Lists (1)
Sort Name ascending (A-Z)
Stars
TradingAgents: Multi-Agents LLM Financial Trading Framework
Real-time speech translation β macOS & Windows, free TTS, no server, your API keys only
π PageIndex: Document Index for Vectorless, Reasoning-based RAG
A client-side library that converts any HTML element into a fully editable PowerPoint slide. **dom-to-pptx** transforms DOM structures into pixel-accurate `.pptx` content, preserving gradients, shaβ¦
Your own personal AI assistant. Any OS. Any Platform. The lobster way. π¦
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop β all through one unified, production-reβ¦
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
A complete kitchen scene designed by Lightwheel for simulation and interaction in Isaac Sim.
Computer Vision Annotation Tool (CVAT) is a leading platform for building high-quality visual datasets for vision AI. It offers open-source, cloud, and enterprise products, as well as labeling servβ¦
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
A video call application that recognizes gestures (signal language) and converts them into text and sound.
Papers for Video Anomaly Detection, released codes collection, Performance Comparision.


