Quickpanda

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

upvoted a paper 8 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

liked a model 8 months ago

RabotniKuma/Fast-Math-R1-14B

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 5 days ago • 47

upvoted a paper 8 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15 • 19

liked a model 8 months ago

RabotniKuma/Fast-Math-R1-14B

Text Generation • 15B • Updated Jul 15 • 76 • 3

liked a dataset 9 months ago

zwhe99/DeepMath-103K

Viewer • Updated May 29 • 103k • 18.3k • 285

updated a model 9 months ago

Quickpanda/deepcoder-14b-preview-awq

15B • Updated Apr 14 • 5 • 2

published a model 9 months ago

Quickpanda/deepcoder-14b-preview-awq

15B • Updated Apr 14 • 5 • 2

updated a model 9 months ago

Quickpanda/deepseek-14b-sft-dpo4-awq

15B • Updated Apr 13 • 4

published a model 9 months ago

Quickpanda/deepseek-14b-sft-dpo4-awq

15B • Updated Apr 13 • 4

liked 2 models over 1 year ago

mlx-community/Llama-3-8B-Instruct-1048k-4bit

Text Generation • 1B • Updated Apr 29, 2024 • 259 • 25

refuelai/Llama-3-Refueled

Text Generation • 8B • Updated May 9, 2024 • 8.27k • • 190

liked a Space over 1 year ago

modelscope-studio

🚀

A third-party component library based on Gradio.

upvoted an article over 1 year ago

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

147

liked a Space almost 2 years ago

moondream2

🌔

439

a tiny vision language model