view article Article How the LiteLLM PyPI Supply Chain Attack Happened — and What to Do If You're Affected davidberenstein1957 • Mar 25 • 1
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 18 items • Updated 6 days ago • 290
view article Article Phare LLM benchmark V2: Reasoning models don't guarantee better security davidberenstein1957 • Dec 16, 2025 • 10
view article Article LLM vulnerability scanner for dynamic & multi-turn Red Teaming JMJM • Sep 25, 2025 • 2
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs davidberenstein1957 • May 7, 2025 • 42
view article Article RealPerformance, A Dataset of Language Model Business Compliance Issues davidberenstein1957 • Jul 21, 2025 • 4
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs davidberenstein1957 • Jul 2, 2025 • 16
RealHarm: A Collection of Real-World Language Model Application Failures Paper • 2504.10277 • Published Apr 14, 2025 • 10
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 265
view article Article License to Call: Introducing Transformers Agents 2.0 +1 m-ric, lysandre, pcuenq • May 13, 2024 • 137
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published Apr 22, 2024 • 126