citizenhicks

indie ml engineer

About Me

indie ml engineer and ai enthusiast specializing in multi-agent systems and autonomous ai architectures. building the next generation of multi agent applications that enable complex startegic modeling, planning, and tool use. passionate about creating intelligent systems that can decompose problems, coordinate across specialized agents, and deliver sophisticated solutions through orchestrated workflows. focused on pushing the boundaries of what agents can achieve in real-world applications especially in model development and validation in the fincancial services sector.

Multi-Agent Systems LangChain LangGraph smolagents Agent Orchestration ReAct Framework Monte Carlo Tree Search Autonomous Workflows LLM Agents Symbolic Reasoning Probability theory Reinforcement learning

Projects

pegasus

Multi-agent financial analysis platform featuring and ai agent that processes complex financial documents with tool use. Pegasus implements sophisticated agent workflows where specialized agent handle document ingestion, analysis, and report generation through task decomposition and tool use.

  • Agentic system with a single ai for document processing, analysis, and synthesis
  • Agent-driven PDF parsing with context-aware chunking strategies and memory systems
  • Tool-using agent with access to multimodal capabilities for comprehensive document understanding

Entropix

Closely following and contributing to the Entropix project - an innovative approach to adaptive sampling and dynamic inference strategies for language models, enabling more intelligent and context-aware generation through entropy-based decision making.

GRAPES: Graph-based Reasoning and Planning with Ensemble Systems

An advanced multi-agent reasoning framework that enhances language model capabilities through Monte Carlo Tree Search. Implements strategic planning and ensemble decision-making for complex problem-solving tasks.

  • Tech Stack: Python, asyncio, Pydantic, Agent Orchestration
  • Achievement: Improved accuracy from 62.50% to 78.12% on complex reasoning tasks through dual-agent collaboration
  • Features: Tree-based search strategies, ensemble voting, adaptive agent selection

TD3-HER for Robotic Control

Implementation of autonomous robotic agents using Twin Delayed Deep Deterministic Policy Gradient with Hindsight Experience Replay. This project demonstrates agent-based learning in sparse reward environments, where robotic agents must discover successful strategies through autonomous exploration and goal-conditioned learning.

  • Tech Stack: Python, TensorFlow, Gymnasium, Panda-Gym, RL
  • Features: Autonomous goal discovery, experience replay for agent learning, twin-agent architecture for stable training
  • Applications: Robotic manipulation tasks with autonomous agent control

CyberSweat (also known as workout.AI)

One of the first applications to leverage generative AI for personalized fitness planning. CyberSweat uses GPT-3.5 as a backend that understands user goals, preferences, and constraints to generate fully customized workout plans. The app adapts recommendations based on user feedbacks.

  • Tech Stack: Swift, SwiftUI, Firebase, OpenAI API
  • Features: Pioneering use of ai for dynamic workout generation and personalization
  • Impact: Early demonstration of practical applications in health and fitness

closed sourcecode

Skills

Agent Frameworks LangChain LangGraph smolagents Tool Development Memory Systems Planning Algorithms Task Decomposition Python PyTorch AWS Swift FastAPI Async Programming Vector Databases next.js

GitHub Stats

GitHub Stats