2025-09-01

AIGC is getting physical.
Today, a humanoid robot steps up to the ping pong table — and learns to rally through hierarchical planning. As HITTER takes center stage, we also track models that decide when to reason, voices that interact, and a mysterious UI twist that has the open-source world buzzing.

Top drops from aigc.news:

  • 🏓 HITTER: A humanoid robot masters table tennis via hierarchical planning & learning

  • 🧠 R-4B: Tencent’s MLLM adapts between direct answers and step-by-step reasoning

  • 🎙 VibeVoice-Large: Microsoft’s new speech model powers real-time voice agents

  • 🧍 PSHuman: High-fidelity 3D humans from just one image

  • 🍌 Nano Banana: A mysterious button lands on Hugging Face — devs react

Explore today’s full stream of papers, projects, and signals — tracked in real time on aigc.news.

📚 AIGC PAPERS

AIGC Papers

Today’s AIGC Papers

  1. HITTER – Human-like table tennis via planning + learning

  2. EmbodiedOneVision – Unified vision-text-action control

  3. Scientific LLMs Survey – From datasets to agentic reasoning

Explore the code, read the insights.

  1. HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning( webpage | paper )

  2. EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control (webpage | paper | code)

  1. A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers( paper | code )

🛠️AIGC Projects

AIGC Projects

Today’s AIGC Projects


Voice meets vision in today’s top projects. Explore Microsoft’s VibeVoice-Large for speech generation, a ComfyUI workflow for audio AI, and MiniCPM-V 4.5 — a compact yet powerful vision-language model.

  1. microsoft/VibeVoice-Large ( link )

  2. openbmb/MiniCPM-V-4_5( link )

🗞️ AIGC News

AIGC News

Today’s AIGC News

From Tencent’s auto-reasoning MLLM to a mysterious UI twist on Hugging Face, and a leap in 3D human reconstruction — today’s stories show AIGC evolving in both capability and curiosity.

  1. Tencent Hunyuan Team unveils R-4B, an auto-thinking MLLM that smartly switches between direct answers and step-by-step reasoning based on task complexity( link )

  2. a mysterious new button appeared on the huggingface Spaces Nano Banana app ( link )

  3. PSHuman: Photorealistic Single-image 3D Human Reconstruction using Cross-Scale Multiview Diffusion and Explicit Remeshing ( link )

Always fresh, always live


Real-time AIGC tracker


New papers and projects updated by the minute — stay ahead of the AI curve.

That’s it for today.

Keep showing up, keep cheering each other on — and as always, run happy! 🏃‍♂️💛

The aigc.news Team

Keep Reading

No posts found