AI Daily AI Daily – 2025-12-12(Morning) Agent tool invocationAgent工具调用AI ModelAI模型ARC-AGI-1 breakthroughARC-AGI-1突破Context WindowGDPval benchmark testGDPval基准测试GPT-5.2GPT-5.2GPT-5.2 Thinking modelGPT-5.2 Thinking模型Knowledge base updateOpenAIProfessional work capabilityStatistical learning theorySWE-Bench Pro recordSWE-Bench Pro记录Visual capabilities上下文窗口专业工作能力知识库更新统计学习理论视觉能力 AI Daily AI Daily – 2025-07-11(Morning) 256k Context WindowBenchmark TestingContext WindowElon Musk Quote ReferenceGrok 4Grok 4 HeavyHLE Benchmark Testinglarge language modelLong-Text Comprehension CapabilityMathematical ReasoningModel BiasxAI
AI Daily AI Daily – 2025-07-11(Morning) 256k Context WindowBenchmark TestingContext WindowElon Musk Quote ReferenceGrok 4Grok 4 HeavyHLE Benchmark Testinglarge language modelLong-Text Comprehension CapabilityMathematical ReasoningModel BiasxAI