Open to Collab

taesiri PRO

taesiri

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

liked a dataset about 16 hours ago

dasgringuen/assettoCorsaGym

upvoted a paper about 17 hours ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

upvoted a paper about 17 hours ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

View all activity

Organizations

liked a dataset about 16 hours ago

dasgringuen/assettoCorsaGym

Preview • Updated Nov 13, 2024 • 275 • 4

upvoted 2 papers about 17 hours ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 5 days ago • 176

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published 6 days ago • 81

submitted 5 papers to Daily Papers about 19 hours ago

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published 5 days ago • 32

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published 4 days ago • 31

CT-1: Vision-Language-Camera Models Transfer Spatial Reasoning Knowledge to Camera-Controllable Video Generation

Paper • 2604.09201 • Published 4 days ago • 1

ELT: Elastic Looped Transformers for Visual Generation

Paper • 2604.09168 • Published 4 days ago • 12

VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

Paper • 2604.09531 • Published 4 days ago • 6

liked a Space 1 day ago

Gemma 4 WebGPU

🚀

156

Run Gemma 4 locally in-browser on WebGPU w/ Transformers.js

liked a model 1 day ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated about 13 hours ago • 18.3k • • 624

upvoted a paper 2 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 5 days ago • 267

upvoted 2 papers 3 days ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 6 days ago • 159

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published 6 days ago • 66

submitted 5 papers to Daily Papers 4 days ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published 6 days ago • 12

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 5 days ago • 44

upvoted 2 papers 4 days ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 5 days ago • 45

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 12 days ago • 35

taesiri PRO

AI & ML interests

Recent Activity

Organizations

taesiri's activity

Gemma 4 WebGPU