software-engineering
711 articles in this category (Page 7 of 30)
AI NewsNew ReleasesSoftware Engineering
AntAngelMed: Optimizing 103B-Parameter Medical LLMs via 1/32 MoE Activation
AntAngelMed is a 103B-parameter open-source medical LLM utilizing a 1/32 MoE activation ratio to deliver 200+ tokens/s while outperforming proprietary models on OpenAI's HealthBench.
Read more
AI NewsLarge Language ModelSoftware Engineering
Mastering LLM Distillation: Soft-Label, Hard-Label, and Co-distillation Strategies
LLM distillation uses teacher-student models to transfer reasoning capabilities, reducing costs while maintaining performance through techniques like soft-label and co-distillation.
Read more
AI NewsAgentic AISoftware Engineering
NadirClaw: Building Cost-Aware LLM Routing with Local Prompt Classification
NadirClaw introduces an intelligent local routing layer that classifies prompts into simple and complex tiers, enabling dynamic switching between Gemini Flash and Pro to reduce LLM costs by up to 50%.
Read more