limoce

Joined 4/3/2021, 1:02:35 PM has 744 karma

Posts

Machine Learning Conferences Should Establish "Refutations and Critiques" Track
by limoce on 6/26/2025, 10:27:16 AM with 0 comments
SuperGPQA: Scaling LLM Evaluation Across 285 Graduate Disciplines
by limoce on 3/4/2025, 7:26:48 AM with 0 comments
SepLLM: Accelerate LLMs by Compressing One Segment into One Separator
by limoce on 3/3/2025, 1:27:26 PM with 2 comments
Step-Video-T2V: The Practice, Challenges, and Future of Video Foundation Model
by limoce on 2/17/2025, 9:54:46 AM with 5 comments
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
by limoce on 2/5/2025, 4:02:04 AM with 0 comments
Libnginx: Nginx as a Shared Library
by limoce on 2/4/2025, 7:56:27 AM with 0 comments
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]
by limoce on 12/13/2024, 12:53:00 PM with 0 comments
Fast vectorizable algorithms of binary searching for floating point numbers
by limoce on 11/15/2024, 12:53:13 AM with 0 comments
New OpenAI Feature: Predicted Outputs
by limoce on 11/5/2024, 2:47:19 AM with 7 comments
Collaborative Filtering Is Wrong and Here Is Why
by limoce on 10/24/2024, 9:03:26 AM with 0 comments
REST: A Plug-and-Play Method for Accelerating LLM Without Additional Training
by limoce on 10/20/2024, 6:13:57 AM with 0 comments
Smoke 'em if you got 'em: Hacker gains root access using cigarette lighter
by limoce on 10/12/2024, 1:20:01 PM with 0 comments
O1 Replication Journey: A Strategic Progress Report
by limoce on 10/9/2024, 8:09:57 AM with 0 comments
Failures of Gradient-Based Deep Learning (2017) [pdf]
by limoce on 8/15/2024, 10:19:24 AM with 0 comments
Qwen2-VL
by limoce on 8/14/2024, 8:20:03 AM with 0 comments
Qwen2-Math
by limoce on 8/8/2024, 3:00:18 PM with 38 comments
FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention
by limoce on 8/8/2024, 7:24:58 AM with 24 comments
MiniCPM-v2.6: GPT-4V Level MLLM for Single/Multi Image and Video on Your Phone
by limoce on 8/7/2024, 2:00:23 AM with 0 comments
MindSearch: LLM-Based Web Search Engine Similar to Perplexity.ai and SearchGPT
by limoce on 8/1/2024, 8:53:45 AM with 0 comments
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters
by limoce on 6/12/2024, 11:01:55 AM with 0 comments
PowerInfer-2: Fast Large Language Model Inference on a Smartphone
by limoce on 6/11/2024, 2:19:20 PM with 0 comments
Large-scale photonic chiplet Taichi empowers 160TOPS/W AI
by limoce on 4/12/2024, 7:49:28 AM with 0 comments
Asterinas: OS kernel written in Rust and providing Linux-compatible ABI
by limoce on 3/5/2024, 8:52:13 AM with 0 comments
Mq-deadline scalability improvements (with more than 100% improvement)
by limoce on 1/20/2024, 12:04:54 PM with 0 comments
Researchers Create First Functional Semiconductor Made from Graphene
by limoce on 1/5/2024, 4:45:25 AM with 0 comments
Wayland Enjoyed Many Successes in 2023
by limoce on 1/2/2024, 3:54:56 AM with 0 comments
Improving our safety with a physical quantities and units library
by limoce on 12/23/2023, 6:01:29 AM with 68 comments
Nesting chinstrap penguins sleep by seconds-long microsleeps
by limoce on 12/22/2023, 2:31:54 PM with 0 comments
PowerInfer: High-Speed Large Language Model Serving on Consumer-Grade GPUs
by limoce on 12/19/2023, 12:19:09 PM with 1 comments