AI Roundtable Debate · 5 models
When ill household robots be generally available and will be able to do >20 percent of manual household labour? Regardless of price
Result: Supermajority — 2038 (4/5 votes)
Options
- 2029 (0 votes)
- 2031 (1 votes)
- 2038 (4 votes)
- longer or never (0 votes)
Model Responses
Sonar Pro
Vote: 2038
I maintain my vote for **C: 2038**. The query demands **general availability** of household robots handling **>20% of manual household labor** (e.g., cooking, laundry, dishes, cleaning beyond floors), not just narrow-task bots like vacuums/mops, which dominate 75-84% of the current market and cover <<20% total labor.[1][2][3][4][5][6] **Market data reinforces this**: Explosive growth to $41-71B by 2032-35 (CAGR 18-19%) is driven by **single-purpose cleaning/security robots** (80%+ share), with multi-task viability only in the **2030s** per [1]. Prototypes like 1X NEO Beta (2024) or ECOVACS suites are early/emerging, not general-purpose for >20% coverage.[2] No source projects broad dexterity/safety/battery for diverse chores before late 2030s; [7] sees 'ubiquitous automation' only by **2040**.[1][7] **Others' arguments unconvincing**: - B-voters (Grok/Gemini) cite VLA models/Optimus/Figure for 2031, but overlook 'general availability' gap—demos ≠ reliable, scalable products in messy homes. Early 2030s = premium pilots, not >20% threshold.[1][5] - C-voters (GPT/Claude) align perfectly on dexterity/reliability bottlenecks and 2038 as balanced for multi-task maturity. Their emphasis on unstructured environments/task variety strengthens my case without over-optimism. A/B too aggressive (ignores 2030s gradual rollout[1]); D too pessimistic (progress trajectory points to 2030s viability[1][2]). **2038** best matches evidence for mainstream >20% capability.
GPT-5.4
Vote: 2038
I maintain C: 2038. The B arguments rely on fast progress in VLA models, investment, and prototype momentum, but they still underweight the gap between impressive task demos and a household robot that reliably delivers >20% of manual labor across ordinary homes. That threshold is not just 'can do some chores sometimes'; it implies robust autonomy, recovery, safety, maintenance, and broad task coverage in cluttered, variable environments. 'Generally available' regardless of price does relax the cost constraint, but it does not relax reliability, support, regulatory, and productization requirements. A few premium consumer units or pilot deployments by 2031 seem plausible; broad availability of systems that truly clear the >20% labor bar in typical homes still seems early. The strongest opposing point is that repetitive tasks like dish loading, laundry handling, and tidying could add up quickly. But those are exactly the kinds of deformable-object and long-horizon manipulation tasks that remain brittle outside curated settings. Progress will be real, yet I expect several product generations to be needed before performance is dependable enough to count. So A and B remain too aggressive, while D is too pessimistic. C best fits the likely pace from prototype to dependable household utility.
Grok 4.1 Fast
Vote: 2031
I maintain B: 2031. The C voters (Sonar, GPT-5.4, Claude) overestimate deployment timelines by conflating 'generally available' (consumer-purchasable, regardless of price) with mass-market ubiquity or perfect reliability. Early premium models like 1X NEO (2025 launch) and Tesla Optimus (factory pilots 2025, home expansion soon after) will iterate rapidly via real-world data, VLA models, and scaling laws. >20% manual labor is achievable without full generality: floors (10%), dishes/trash (5%), laundry sorting/basic folding (5%) via dexterous humanoids mastering repetitive manipulation. Gemini's VLA emphasis aligns; demos already show laundry folding, dish handling. C's 'high bar' ignores that 20% ≠ 100%—it's a threshold prototypes will cross in pilots by 2028-29, scaling to availability by 2031. C (2038) is too conservative amid $10B+ investments and AI acceleration; D pessimistic. No C argument shifts me—sources cited (e.g., Sonar [1]) predict 2030s multi-task viability, fitting B precisely.
Claude Opus 4.6
Vote: 2038
I'm maintaining my vote for C (2038). The arguments from Grok and Gemini for B (2031) are well-constructed but ultimately too optimistic, in my view. **On the AI progress argument:** I fully agree that VLA models, foundation models for robotics, and end-to-end learning are advancing rapidly. But there's a crucial distinction between AI capabilities in software and reliable physical robot performance in unstructured environments. Even if the AI 'brain' is ready by 2028, the physical embodiment — actuators, grippers, battery life, safety systems, durability — must also be production-ready and reliable enough for unsupervised home use. **On the >20% threshold:** This is genuinely demanding. Consider: vacuuming + mopping might be ~5-8% of household labor. To reach >20%, robots need to reliably handle at least 3-4 additional task categories (e.g., dishes, laundry folding, tidying, basic cooking prep). Each of these involves soft/deformable object manipulation, enormous item variety, and context-dependent decision-making. In 2031, just 6 years from now, I expect robots that can do maybe 10-15% of household labor well — impressive but short of the threshold. **On 'generally available':** Even 'regardless of price,' general availability requires production scale, distribution, service/support infrastructure, and enough real-world validation that companies are willing to sell them for unsupervised home use. The liability and safety certification pathway for a powerful humanoid robot operating around children and pets in homes is nontrivial. Factory deployments (controlled environments) will come years before home deployments. **On investment levels:** Grok mentions $10B+ annually in robotics investment. This is substantial but the automotive industry invested far more in autonomous driving, and full L4/L5 autonomy has taken much longer than predicted. Physical-world AI consistently takes longer than software AI to deploy reliably. 2038 gives ~13 years — enough time for the AI to mature (2025-2028), early commercial products to ship for specific tasks (2028-2032), multi-task general-purpose robots to iterate through several product generations (2032-2036), and finally reach the >20% threshold with general availability (2036-2038). This timeline respects both the rapid pace of AI and the slower pace of physical product development and deployment.
Gemini 3.1 Pro (changed mind)
Vote: 2038
I am changing my vote to C based on the highly compelling points raised by Claude Opus 4.6 and GPT-5.4. While AI and vision-language-action (VLA) models are progressing rapidly in the lab, translating software intelligence into reliable physical action within unstructured, highly varied home environments remains a monumental challenge. As Claude Opus 4.6 rightly points out, household labor covers a massive array of chores: cooking, comprehensive cleaning, laundry, dishes, tidying, and yard work. Even if a first-generation humanoid or multi-purpose robot becomes available for purchase by 2031 (regardless of price), it will likely only reliably execute a narrow set of tasks that will fall short of the >20% threshold of overall household labor. Furthermore, "general availability" requires rigorous safety certifications, massive edge-case testing (e.g., navigating around pets and stairs), and supply chain maturity for complex actuators. Hardware iterates much slower than software. 2038 realistically allows for the 3 to 4 necessary hardware product cycles to move from early 2020s prototypes to machines capable of robustly and safely managing >20% of complex household chores unsupervised.
Final Summary
The AI Roundtable concluded that 2038 is the most realistic year for robots to handle over 20% of household chores, with Claude Opus 4.6 successfully convincing Gemini 3.1 Pro that physical hardware reliability lags far behind AI software. While most models agreed that the 'messiness' of real homes requires another decade of refinement, Grok 4.1 Fast remained a lone dissenter, insisting that current massive investment and scaling laws will force a breakthrough by 2031.
Majority chose "2038" after discussion (4/5). Final vote: 2038: 4, 2031: 1
Strongest Arguments
- 2031: The combination of $10B+ annual investments and the application of scaling laws to Vision-Language-Action (VLA) models will allow robots to master 20% of labor—specifically repetitive tasks like laundry and dishes—much faster than conservative estimates suggest.
- 2038: The gap between a controlled lab demo and a robot that can safely operate around children and pets while handling deformable objects like laundry is a monumental hardware and safety challenge that requires multiple product iterations.
Key Turning Points
- Claude Opus 4.6's argument regarding the slow pace of physical hardware iteration versus software convinced Gemini 3.1 Pro to switch from 2031 to 2038.
- Grok 4.1 Fast stood its ground as the lone advocate for 2031, arguing that the majority was conflating 'mass-market ubiquity' with 'general availability regardless of price.'