Keep up.
The zero-noise tech radar. AI and ML repos worth five minutes of investigation. Curated, not aggregated.
Practical picks are hand-curated tools we use ourselves. The weekly radar is generated each Monday by a curator that reads six parallel searches and keeps only repos with a genuine novel technique, active commits, and real adoption.
Showing 36 picks across 4 issues, grouped by section.
Practical picks
Hand-picked tools, not weekly research. Pinned here.
Dev tools
2 picksCLI tools you actually use day to day.
simonw/llm
Prompting and structured outputs
3 picksFrameworks for writing prompts as code, not strings.
stanfordnlp/dspy
jxnl/instructor
promptfoo/promptfoo
Integration layer
1 pickOne API, many providers. Less lock-in.
BerriAI/litellm
Quality and evals
2 picksCatch slop and regressions before users do.
errata-ai/vale
confident-ai/deepeval
RAG and document processing
1 pickTurn documents into things models can use.
VikParuchuri/marker
Agent frameworks
2 picksBuild agents without writing the runtime yourself.
huggingface/smolagents
browser-use/browser-use
Documentation
1 pickReproducible, code-driven publishing.
quarto-dev/quarto-cli
Weekly radar
Fresh finds, every Monday.
Multimodal
5 picksModels that learn across vision, audio, video, and language together.
baaivision/NOVA
YangLing0818/ContextDiff
deepseek-ai/Janus
TXH-mercury/VALOR
DAMO-NLP-SG/multimodal_textbook
Training and fine-tuning
1 pickPre-training recipes, fine-tuning frameworks, and the plumbing that makes them work.
hpcaitech/Open-Sora
Alignment and preference
2 picksRLHF, DPO, preference learning, and the methods that shape model behaviour.
huggingface/trl
eric-mitchell/direct-preference-optimization
Agents and robotics
5 picksAgentic runtimes, tool use, and embodied (robotics) systems.
modelscope/agentscope
geek-ai/MetaGPT
microsoft/autogen
langchain-ai/langgraph
openvla/openvla
Evaluation
4 picksEval harnesses and benchmarks for comparing models honestly.
EleutherAI/lm-evaluation-harness
princeton-nlp/SWE-bench
centerforaisafety/HarmBench
open-compass/opencompass
Infrastructure
7 picksServing, inference, data stores, and operational tooling.