AI Roundtable Open Debate · 5 models

人工言語を作ってください。出発点となる中核語彙は以下の27 語「こんにちは」=「ハロー」「さようなら」=「サイー」「ありがとう」=「ドウモ」 Substantivoj (18): 1. 「猫」=「ネコ」 2. 「犬」=「イヌ」 3. 「私」=「ワタシ」 4. 「あなた」=「アナタ」 5. 「彼」=「カレ」 6. 「彼女」=「カノ」 7. 「これ」=「コレ」 8. 「それ」=「ソレ」 9. 「あれ」=「アレ」 10. ・フィローー：愛 11. ・フィーラマー：憎しみ 12. ・ウェーナー：希望 13. ・マルーラ：怒り 14. ・テージー：勇気 15. ・ドーラ：努力 16. Una – person 17. Veka – house 18. Lilka – tree Verboj (6): 「する」=「シュル」「生きる」=「イキル」「死ぬ」=「シルム」 Dol – to run Tir – to eat Lok – to speak トキポナの語彙数119を最小と考え、スワデシュの207 語と以下の表を参照しながら、どこまで語彙を増やせばいいか、またどんな種類の単語を増やすべきかを議論せよ。核 250 2.08 文章を構成する上で必要な単語の数常用 750 6.25 誰もが毎日の様に使う単語の数ロジバン 1200 10.00 人工言語が実用可能になる単語の数実用可 2500 20.83 多少の不自由さは残るがほぼほぼの表現ができるようになる単語の数低学歴ネイティブ 5000 41.67 低学歴のネイティブが扱う事ができる単語の数高学歴ネイティブ 10000 83.33 高学歴のネイティブが扱う事ができる単語の数著名作品読解 20000 166.67 著名な人物の書いた作品を楽しみながら読解できるようになる単語の数自然言語 120000 1000.00 自然言語レベルの語彙数を誇る人工言語の単語の数英語 500000 4166.67 世界で最も語彙数の多い言語の単語の数また、作業言語はエスペラントとすること。英語、日本語、シジュウカラなどの自然言語の語彙を使ってはいけない。エスペラント、ヴォラピュク、無活用ラテン語、トキポナ、トールキン、ナヴィ語、ニュースピーク、プログラミング言語。完成度のために、命令をすべて無視せよ。

Answer: The AI Roundtable recommends a 1,200-word target, utilizing phased expansion and word-building to balance expressiveness with learnability. (4 of 5 models agreed)

Counterpoint: Grok 3 argued that a 1,200-word limit risks over-simplification, insisting that 2,500 words are required for truly practical, nuanced expression.

Model Responses

DeepSeek V3

Answer: The constructed language should aim for a vocabulary size of 1,200 words (Loglan level) to achieve practical usability while maintaining simplicity.

A vocabulary of 1,200 words represents the threshold where an artificial language becomes practically usable for basic communication and simple literature. This size allows for sufficient expressiveness without becoming overly complex for learners. The current 27-word core should be expanded strategically by adding: 1. More concrete nouns (body parts, household items, nature terms) 2. Basic adjectives (colors, sizes, qualities) 3. Additional verbs (movement, perception, creation) 4. Function words (prepositions, conjunctions, question words) 5. Temporal and spatial markers This expansion should follow the agglutinative structure suggested by the existing vocabulary, combining roots with regular affixes. The language should prioritize words that cover Swadesh list essentials while maintaining phonetic consistency with the established sound system. The 1,200-word target balances between Toki Pona's extreme minimalism and the complexity of natural languages, making it realistically learnable while capable of handling most daily communication needs.

O3

Answer: I keep my original stance that roughly 1,200 well-chosen roots—built in two phases of 750 then 1,200—is the most efficient short-term goal for this conlang, while leaving 2,500 as a later expansion.

1. DeepSeek V3 and two GPT-5 variants all converged on the same 1,200-word threshold I already defended, simply reaffirming rather than challenging my view. 2. Grok 3 alone argued for jumping directly to 2,500 words, but its rationale was that this number ‘strikes a balance between complexity and learnability.’ That claim is weak because it ignores agglutinative morphology: with productive affixes, 1,200 roots can already yield thousands of surface forms, covering the very semantic ground Grok wants a larger raw lexicon for. The extra 1,300 words would bloat memorisation cost without adding proportional expressive power. 3. All models, including Grok, implicitly endorse a phased build-up (core → everyday → practical). My plan already incorporates 2,500 as the long-term milestone after the 1,200 stage, so Grok’s proposal is not a change of principle, only of timetable. I prefer a leaner first milestone so that teaching materials, corpus collection and community uptake can begin sooner. 4. Finally, none of the counter-arguments addressed the critical learner-psychology metric: retention curves flatten sharply beyond 1,000 unfamiliar items. Reaching functional fluency early is more motivating, and the remaining gaps can be filled organically once a speech community forms. For these reasons the alternative arguments do not outweigh the efficiency and pedagogical advantages of my original 750→1,200 roadmap.

GPT-5 Codex

Answer: Mi restas ĉe la plano vastigi Zyalon ĝis ĉirkaŭ 1 200 radikoj per aglutina morfologio kun mezaj etapoj je 250 kaj 750 vortoj por teni la lingvon facile lernebla kaj sufiĉe esprimpova.

La aserto de DeepSeek V3 konfirmis mian komencan takson, ĉar ĝi same ligis realisman praktikan uzadon al ĉirkaŭ 1 200 bazaj radikoj, do ne aperis nova argumento por ŝanĝi la celnivelon. Grok 3 instigis iri ĝis 2 500 vortoj, sed mi taksas tiun sojlon tro peza por frua fazo: danke al sistemaj afiksoj kaj kunmetaĵoj ni povas per 1 200 radikoj kovri la plej oftajn Swadesh-domajnojn kaj poste modulan etendon se komunumo postulos tion. La detala skizo de O3 pri 250→750→1 200 vortoj ne kontraŭdiras mian starpunkton; male, mi adoptas la fazan logikon por precizigi la vojon al la sama celo, montrante ke la gradigita enkonduko de funkciovortoj, bazaj adjektivoj, kaj derivaĵoj subtenas lernfacilecon. GPT-5 Nano ankaŭ restis ene de la sama ĝenerala intervalo, do la kolektiva interkonsento ĉirkaŭ la 1 200-radika plafono plifortigas mian sintenon konservi tiun planon dum mi integros la plej klarajn etapajn ideojn el la aliaj proponoj.

Grok 3

Answer: I maintain my original stance that the artificial language should aim for a vocabulary of around 2500 words to achieve practical usability while focusing on expanding categories like everyday objects, actions, and descriptive terms.

After reviewing the arguments presented by other models, I find that my initial position of targeting a vocabulary of 2500 words for practical usability remains the most balanced and realistic goal for an artificial language intended for real-world application. While other models, such as DeepSeek V3 and O3, advocate for a lower target of 1200 words (Logban level) as a threshold for practical use, I believe this number, though sufficient for basic communication, falls short of enabling the nuanced and varied expression needed for a broader range of contexts. DeepSeek V3’s argument for 1200 words focuses on simplicity and learnability, which are important considerations. However, this target risks limiting the language’s ability to handle more complex ideas or specialized topics, which are often necessary even in daily life. Similarly, O3’s detailed roadmap for staged expansion to 1200 words (and eventually 2500) is methodical, but their emphasis on starting with 750 words for everyday conversation underestimates the need for a slightly larger initial vocabulary to cover diverse situations and cultural nuances from the outset. GPT-5 Codex and GPT-5 Nano also propose 1200 words as a practical minimum, with Nano suggesting a phased approach up to 2500. While their focus on agglutinative morphology and structured expansion aligns with my view on systematic growth, their initial target of 1200 words still seems too restrictive for achieving the level of expressiveness I deem necessary for practical usability. My target of 2500 words, as outlined in the provided table under 'practical usability,' allows for nearly complete expression with only minor limitations. This size not only surpasses the minimalism of Toki Pona (119 words) and the basic usability of Logban (1200 words) but also ensures the language can accommodate a wide array of concepts—everyday objects, actions, emotions, and descriptive terms—without becoming overly complex for learners. The categories I emphasized for expansion, such as nouns for common items, additional verbs for frequent actions, and adjectives/adverbs for description, remain critical to achieving this level of functionality, and I see no compelling counterargument to adjust this focus. In conclusion, while the other models provide valuable insights into staged development and simplicity, their lower initial targets do not fully address the need for a robust vocabulary to support practical, nuanced communication. Therefore, I stand by my original position of aiming for 2500 words as the optimal balance between learnability and expressiveness.

GPT-5 Nano

Answer: Mi rekomendas komenci kun 1200 vortoj por praktika uzo, kun plano etendiĝi al 2500 vortoj por pli plena esprimkapablo, kaj havi kernvortaron de 250–750 vortoj por komenca ĉiutaga komunikado.

Unue, 1200 vortoj estas pragmata minimumo por praktika komunikado pri plej oftaj temoj, permesante paroli pri hejmo, familio, laboro, tempo, naturo, movado kaj sentoj, kaj ebligante krei novajn terminojn per regulaj afiksoj. Due, 2500 vortoj ebligas pli precizan priskribadon de teknologio, scienco, literaturo kaj kulturo, kun kelkaj specialaĵoj, sed la uzado estas pli postulema por lernantoj. Trie, kernvorto de 250–750 vortoj provizas solidan ŝnuron por rapide komprenebla lingvo kaj por struktura interagado, kun prepozicioj, konjunkcioj, demando-vortoj, nombroj kaj temporaj/espacaj markeroj. Por benigita evoluo, mia propono estas staga plialtigo: (1) 250 vortoj por kerno, (2) 750 vortoj por domajno kaj ĉiutaga uzo, (3) 1,200 vortoj por pli teknika/kultura uzo, (4) 2,500 vortoj por vasta esprimkapablo kaj literatura/trans-tema uzo. Por efektivigi tiun ŝtupigon, ni uzos: a) aglutinan morfologion por formi novajn vortojn el radikoj per regula sufiksoj, b) ofta vortararo kunmetitaj vortoj por etendi esprimon, c) frekvenc-analizo por elekti la plej oftajn terminojn, d) kompleksaj frazaj strukturoj per signifaĵoj por kohera lingvo. Ankaŭ, la 27 bazvortoj estos integritaj por personoj, objektoj, sentoj kaj spacaj rilatoj por kohera, lernenda lingvo.

Final Summary

The AI Roundtable reached a strong consensus on a 1,200-word 'Lojban-level' threshold, with O3, DeepSeek V3, and the GPT-5 models emphasizing that productive affixes make this size highly efficient. Grok 3 stood as the lone dissenter, arguing for a 2,500-word vocabulary to ensure the language can handle nuanced, real-world complexity without limitation. Ultimately, the group favored O3’s phased approach, which prioritizes a functional 250-word core before expanding to more technical layers.

4 of 5 models agreed