DeepGit
DeepGit copied to clipboard
Deep research agent to help you find the best GitHub repositories đľď¸!
DeepGit
DeepGit 2.0 đ¤ŻÂ â now hardwareâaware & ColBERTâpowered
DeepGit
DeepGit is an advanced, Langgraph-based agentic workflow designed to perform deep research across GitHub repositories. It intelligently searches, analyzes, and ranks repositories based on user intentâeven uncovering less-known but highly relevant tools. DeepGit infuses hybrid dense retrieval with advanced cross-encoder re-ranking and comprehensive activity analysis into a unified, open-source platform for intelligent repository discovery
Try out the Lite version here đ§âđ
DeepGit-lite is a lightweight version of DeepGit running on zero GPU on Hugging Face Space here.
It may not perform as well as the full version, but it's great for a quick first-hand preview.
The latest release makes it even deeper, smarter, and faster:
| New feature | What it gives you |
|---|---|
| âď¸Â Multiâdimensional ColBERT v2 embeddings | Fineâgrained tokenâlevel similarity for nuanced matches that singleâvector embeddings miss. |
| đŠ Smart Hardware Filter | Tell DeepGit your device specs â CPU-only, low RAM, or mobile. It filters out repos that wonât run smoothly, so you only see ones that fit your setup. |
DeepGit still unifies hybrid dense retrieval, crossâencoder reâranking, activity & quality analysisâbut now every step is both smarter and leaner.
âď¸Â How It Works â Agentic Workflow v2
When the user submits a query, the DeepGit Orchestrator Agent triggers a relay of expert tools:
-
Query Expansion
An LLM turns your naturalâlanguage question into highâsignal GitHub tags for precise searching. -
Hardware Spec Detector
The same pass infers your wording for hints like âGPUâpoorâ, âlowâmemoryâ, or âmobileâonlyâ and records the constraint. -
ColBERTâv2 Semantic Retriever
Every README & doc block is embedded with multiâdimensional token vectors; MaxSim scoring surfaces nuanced matches. -
CrossâEncoder Reâranker
A lightweight BERT (MiniLMâLâ6âv2) reâorders the top K results for passageâlevel accuracy. -
Hardwareâaware Dependency Filter
The reasoning engine inspects each repoâsrequirements.txt /Âpyproject.tomland discards any that canât run on your declared hardware. -
Community & Code Insight
Collects stars, forks, issue cadence, commit history, plus quick codeâquality metrics. -
Multiâfactor Ranking & Delivery
Merges all scores into one ranking and serves a clean table with links, similarity %, and âRuns on cpuâonlyâ badges where relevant.
đ Goals
-
Uncover Hidden Gems:
Surface powerful but under-the-radar open-source tools. Now comes with hardware spec filter too. -
Empower Research:
Build an intelligent discovery layer over GitHub tailored for research-focused developers. -
Promote Open Innovation:
Open-source the entire workflow to foster transparency and collaboration in research.
đĽď¸ User Interface
DeepGit provides an intuitive interface for exploring repository recommendations. The main page where users enter raw natural language query. This is the primary interaction point for initiating deep semantic searches.
Output: Showcases the tabulated results with clickable links and different threshold scores, making it easy to compare and understand the ranking criteria.
đ§ Recommended Environment
- Python: 3.11+ (The repo has been tested on Python 3.11.x)
- pip: 24.0+ (Ensure you have an up-to-date pip version)
đ¨âđ Setup Instructions
1. Clone the Repository
git clone https://github.com/zamalali/DeepGit.git
cd DeepGit
2. Create a Virtual Environment (Recommended)
python3 -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
3. Upgrade pip (Optional but Recommended)
pip install --upgrade pip
4. Install Dependencies
pip install -r requirements.txt
5. đ Running DeepGit via App
To run DeepGit locally, simply execute:
python app.py
đ ď¸ Troubleshooting
- Python Version: Use Python 3.11 or higher as the repo has been tested on Python 3.11.x.
- pip Version: Make sure youâre running pip 24.0 or later.
- Dependency Issues: If you encounter any, try reinstalling in a new virtual environment.
đ ď¸ Running DeepGit
For a detailed documentation on using DeepGit, Check out here.
DeepGit leverages Langgraph for orchestration. To launch the Langsmith dashboard and start the workflow, simply run:
langgraph dev
This command opens the Langsmith dashboard where you can enter your raw queries in a JSON snippet and monitor the entire agentic workflow.
DeepGit on Docker
For instructions on using Docker with DeepGit, please refer to our Docker Documentation.