Name	Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github	.github
api	api
atlasstack.egg-info	atlasstack.egg-info
clients	clients
docs	docs
k8s	k8s
kong	kong
monitoring	monitoring
scripts	scripts
services	services
shared	shared
tests	tests
training	training
.env.example	.env.example
.gitignore	.gitignore
.pre-commit-config.yaml	.pre-commit-config.yaml
ArchitectureMap.tsx	ArchitectureMap.tsx
Makefile	Makefile
README.md	README.md
atlas.py	atlas.py
docker-compose.yml	docker-compose.yml
nixpacks.toml	nixpacks.toml
package-lock.json	package-lock.json
pyproject.toml	pyproject.toml
pytest.ini	pytest.ini
railway.json	railway.json
requirements.txt	requirements.txt
test_app2.py	test_app2.py
test_hf.py	test_hf.py
test_hf_raw.py	test_hf_raw.py
test_parsers.py	test_parsers.py
vercel.json	vercel.json

AtlasStack

AtlasStack is an autonomous software engineering engine that analyzes GitHub repositories using Gemini 1.5 Flash and Qwen2.5-Coder, finds bugs, explains architecture, and proposes fixes — including an embedded web IDE experience.

Highlights

Deep Repo Analysis: Clones any public GitHub repo and produces a structured report: architecture, data flow, important files, security fixes, and a health score.
Embedded Web IDE: A full VS Code-like development environment directly in the browser.
"Explain Like I'm 10": Toggleable ELI5 summaries for every codebase analysis.
Lite Mode Backend: Runs entirely on Python + SQLite — no Docker needed.
Premium CLI Tool: Analyze repositories and view history directly from your terminal.
Autonomous Training Data Collection: Captures scan inputs and LLM outputs to build a high-quality fine-tuning dataset automatically.
Enterprise Authentication: Secured by Clerk, providing out-of-the-box SSO, OAuth, and user management.

Screenshots

Landing Page

Architecture

AtlasStack is built as a distributed system of specialized services coordinated via an API Gateway.

flowchart TD
  A[Client / UI / CLI] -->|REST| B(API Gateway)
  B -->|Queue| C(Analysis Worker)
  C -->|Model calls| D(LLM Service)
  C -->|Index| E(Knowledge Service)
  E -->|Search| B
  subgraph Data Stores
    F[(PostgreSQL)]
    G[(Redis)]
    H[(Neo4j)]
    I[(Weaviate)]
  end
  B --> F
  B --> G
  C --> F
  C --> G
  E --> H
  E --> I

For more details, see the Architecture Deep Dive.

Tech Stack

Core

Frontend: React, TypeScript, Vite, Monaco Editor, Tailwind CSS, Framer Motion
Backend (Lite Mode): Python 3.12, FastAPI, SQLite, Gemini 1.5 Flash (Google) or Qwen2.5-Coder (HuggingFace)

Enterprise Infrastructure (Optional)

Message Broker: RabbitMQ
Graph Database: Neo4j (Code dependency maps)
Vector Database: Weaviate (Semantic code search)
Caching & Sessions: Redis
Primary Data: PostgreSQL
Observability: Prometheus, Grafana, Jaeger (Tracing)
Orchestration: Docker Compose, Kubernetes

Quick Start (Lite Mode — no Docker)

Prerequisites

Python 3.12+
Node.js 18+
A Gemini API Key (Recommended) or a HuggingFace token

1) Configure environment

cp .env.example .env
cp clients/web/.env.example clients/web/.env.local
# Edit .env — at minimum set HF_TOKEN
# Edit clients/web/.env.local — set VITE_CLERK_PUBLISHABLE_KEY

2) Install Python dependencies

# Recommended: Use a virtual environment
python -m venv .venv
.\.venv\Scripts\Activate.ps1

pip install -r requirements.txt
pip install -e .  # Registers the 'atlas' command

3) Start the backend (Lite Mode)

python test_app2.py
# → http://localhost:8005
# → API docs at http://localhost:8005/docs

4) Start the frontend

cd clients/web
npm install
npm run dev
# → http://localhost:3000

5) Use it

Open http://localhost:3000
Click Sign In / Register → create an account
Enter a public GitHub repo URL and click Start Analysis
The AI will clone the repo, analyze it, and return a full report

No AI Key? The analysis endpoint falls back to a mock response. Set GEMINI_API_KEY or HF_TOKEN in .env to enable real AI analysis.

IDE Integration

Visual Studio Code

The AtlasStack VS Code extension brings AI-driven analysis directly to your workspace.

Build/Install: Open clients/vscode and follow the README to build the .vsix.
Connect: Open VS Code settings and set atlasstack.serverUrl to your backend (default: http://localhost:8005).
Analyze: Use the AtlasStack icon in the Activity Bar to run deep repository scans.

Terminal CLI

The AtlasStack CLI is a powerful tool for developers who live in the terminal.

Installation & Setup

Create Environment:

python -m venv .venv
.\.venv\Scripts\Activate.ps1  # Windows
# source .venv/bin/activate   # macOS/Linux

Install CLI:

pip install -r requirements.txt
pip install -e .

Command Reference

Command	Description
`atlas scan .`	Scan the current local directory
`atlas analyze <url>`	Analyze a remote GitHub repository
`atlas history`	View your past analysis records
`atlas view <id>`	See detailed results for a specific scan
`atlas config`	Update your Hugging Face credentials

API Endpoints

Method	Path	Auth	Description
POST	`/api/v1/analysis/mvp`	No	Full repo analysis
GET	`/api/v1/analyses`	Yes	List your past analyses
GET	`/api/v1/analyses/{id}`	Yes	Get analysis detail
POST	`/api/v1/repositories`	Yes	Register a repo
GET	`/api/v1/repositories`	Yes	List your repos
POST	`/api/v1/repositories/{id}/analyze`	Yes	Trigger analysis
GET	`/health`	No	Health check

Full interactive docs at http://localhost:8005/docs

Project Layout

atlasstack/
├── clients/
│   ├── web/              # Vite + React frontend (landing page + IDE)
│   └── vscode/           # VS Code extension
├── services/
│   ├── api/              # FastAPI gateway (auth, repos, analysis)
│   ├── analysis/         # Celery workers (AST, security, perf)
│   ├── llm/              # LLM inference service
│   └── knowledge/        # Neo4j + Weaviate knowledge graph
├── shared/               # Shared Pydantic models + utilities
├── k8s/                  # Kubernetes manifests
├── monitoring/           # Prometheus + Grafana + Jaeger configs
├── test_app2.py          # Lite Mode entry point
└── docker-compose.yml    # Full stack deployment

AI Model Training

AtlasStack supports custom model fine-tuning via training/. You can run the pipeline to adapt the analysis engine to specific coding standards or languages.

# Run Supervised Fine-Tuning (SFT)
make train-sft

# Run RLHF pipeline
make train-rlhf

Autonomous Data Collection

AtlasStack can automatically harvest its own analysis results to build a specialized dataset for future training.

Storage: Data is saved in JSONL format for easy ingestion by the DatasetLoader.
Default Path: training/datasets/collected_scans.jsonl
Toggle: Enable or disable via COLLECT_TRAINING_DATA in your .env.

Every collected entry includes the system instruction, the repository context, and the structured LLM response, along with metadata like the repo URL and scan timestamp.

Developer Reference

Makefile Commands

Command	Description
`make up`	Start full Docker stack
`make test`	Run unit + integration tests
`make lint`	Run flake8 and pylint
`make format`	Format code with black/isort
`make benchmark`	Run performance test suite
`make clean`	Remove all containers & cache

🚀 Deployment Guide

Docker Compose (Production-ready)

For a full enterprise-grade deployment with PostgreSQL, Redis, and RabbitMQ:

Prepare Environment:

cp .env.example .env
# Fill in required secrets (HF_TOKEN, DATABASE_URL, etc.)

Launch Stack:
```
docker-compose up -d
```

Run Migrations:

docker-compose exec api alembic upgrade head

Kubernetes (K8s)

Deployment manifests are located in /k8s.

kubectl apply -f k8s/

🔧 Troubleshooting

Issue	Solution
White Screen in IDE	Ensure `Cross-Origin-Opener-Policy` headers are set. Check browser console for Clerk errors.
Analysis Timed Out	Increase `MAX_ANALYSIS_TIME` in `.env`. Default is 300s.
"git clone" fails	Verify the repository is public or your `GITHUB_TOKEN` has proper scopes.
Database connection error	If using `LITE_MODE=false`, ensure PostgreSQL is running and `DATABASE_URL` is correct.
Alembic migration failed	Run `ENVIRONMENT=development alembic upgrade head` to see detailed logs.

Configuration Reference

Variable	Default	Description
`HF_TOKEN`	-	HuggingFace Token (required for real AI)
`GEMINI_API_KEY`	-	Google Gemini API Key (Recommended)
`ENVIRONMENT`	`production`	Set to `development` for verbose logging and bypass strict security checks
`JWT_SECRET`	-	Secret for JWT signing (Required in production)
`LITE_MODE`	`true`	Toggle between SQLite and full Postgres stack
`VITE_CLERK_PUBLISHABLE_KEY`	-	Clerk Public Key for Frontend Auth
`DEFAULT_MODEL`	`Qwen2.5-Coder`	The LLM used for analysis
`COLLECT_TRAINING_DATA`	`true`	Toggle scan data harvesting
`RATE_LIMIT_REQUESTS`	`100`	Max requests per window
`RATE_LIMIT_WINDOW`	`60`	Window size in seconds

See .env.example for the full list of configuration options.

Security Notes

Authentication: All user authentication, session management, and SSO is handled securely via Clerk (frontend) and JWT (backend).
Vulnerability Scanning: CI/CD includes automated scans via bandit, safety, and semgrep.
Execution Sandboxing: Repository analysis runs in isolated temporary directories with strict timeouts and size limits.
LLM Data Privacy: Local LLM inference via Lite Mode ensures your codebase never leaves your infrastructure (unless using external HuggingFace/Google Inference APIs).

License

Apache License 2.0 — see LICENSE for details.

Folders and files

Latest commit

History

Repository files navigation

AtlasStack

Highlights

Screenshots

Landing Page

Architecture

Tech Stack

Core

Enterprise Infrastructure (Optional)

Quick Start (Lite Mode — no Docker)

Prerequisites

1) Configure environment

2) Install Python dependencies

3) Start the backend (Lite Mode)

4) Start the frontend

5) Use it

IDE Integration

Visual Studio Code

Terminal CLI

Installation & Setup

Command Reference

API Endpoints

Project Layout

AI Model Training

Autonomous Data Collection

Developer Reference

Makefile Commands

🚀 Deployment Guide

Docker Compose (Production-ready)

Kubernetes (K8s)

🔧 Troubleshooting

Configuration Reference

Security Notes

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages