Why Nostr? What is Njump?
2024-09-07 21:52:43
in reply to

iefan 🕊️ on Nostr: The strawberry test is quite iconic. Many models are secretly "hard-coded" to avoid ...

The strawberry test is quite iconic. Many models are secretly "hard-coded" to avoid failing it so they don't appear flawed, but it highlights some fundamental weaknesses in LLM architecture and core limitations. Playing chess also reveals these flaws.
Author Public Key
npub1cmmswlckn82se7f2jeftl6ll4szlc6zzh8hrjyyfm9vm3t2afr7svqlr6f