Daily Digest — 2026-01-20

Today in one line
This issue is a reading log and short takeaways; 1–2 items may be expanded later.

Top 10 — News

  1. OpenAI partners with Cerebras
    AI frontier | OpenAI Blog
    A notable infra/inference partnership; watch for speed/cost implications.
  2. Differential Transformer V2
    AI frontier | Hugging Face Blog
    An attention variant for long-context efficiency; a clear comparison point vs standard Transformers.
  3. The Conversational Exam: A Scalable Assessment Design for the AI Era
    Education / learning sciences / edtech | arXiv
    An assessment format designed to stay meaningful when students have AI.
  4. Open Responses: What you need to know
    AI frontier | Hugging Face Blog
    A practical multi-model workflow for comparing and integrating LLM outputs.
  5. Evaluating 21st-Century Competencies in Postsecondary Curricula with Large Language Models: Performance Benchmarking and Reasoning-Based Prompting Strategies
    Education / learning sciences / edtech | arXiv
    Benchmarks LLMs for curricular competency evaluation and prompting strategies.
  6. aiPlato: A Novel AI Tutoring and Step-wise Feedback System for Physics Homework
    Education / learning sciences / edtech | arXiv
    A tutoring design emphasizing step-wise feedback (closer to how students learn).
  7. Take Out Your Calculators: Estimating the Real Difficulty of Question Items with LLM Student Simulations
    Education / learning sciences / edtech | arXiv
    Uses simulated "LLM students" to estimate item difficulty for assessment design.
  8. AI Sycophancy: How Users Flag and Respond
    CSS / AI & society | arXiv
    Empirical signals for detecting/handling overly agreeable LLM behavior.

AI tools & model updates

  • (No items in this window.)
Full list — News (8)
AI frontier · OpenAI Blog
A notable infra/inference partnership; watch for speed/cost implications.
AI frontier · Hugging Face Blog
An attention variant for long-context efficiency; a clear comparison point vs standard Transformers.
AI frontier · Hugging Face Blog
A practical multi-model workflow for comparing and integrating LLM outputs.
Education / learning sciences / edtech · arXiv
An assessment format designed to stay meaningful when students have AI.
Education / learning sciences / edtech · arXiv
Benchmarks LLMs for curricular competency evaluation and prompting strategies.
Education / learning sciences / edtech · arXiv
A tutoring design emphasizing step-wise feedback (closer to how students learn).
Education / learning sciences / edtech · arXiv
Uses simulated "LLM students" to estimate item difficulty for assessment design.
CSS / AI & society · arXiv
Empirical signals for detecting/handling overly agreeable LLM behavior.
Full list — AI tools & model updates (0)