นักวิจัย เล่าประสบการณ์ train Claude 3 ...

Why Nostr? What is Njump?

npub1e9v…j93f

2024-06-10 02:06:27

นักวิจัย เล่าประสบการณ์ train Claude 3

Amanda Askell จบทางปรัชญามา และรับหน้าที่ fine-tune เพื่อสร้างบุคลิกให้กับ Claude โดยเทคนิค RLAIF ตามเกณฑ์บุคลิกที่เรากำหนด เช่น สงสัยใฝ่รู้, เปิดกว้าง, ช่างคิด งานตรงนี้มีความยาก ไม่สามารถใช้คำตอบง่ายๆ ได้

"Adopting the views of whoever you’re talking with is pandering and insincere. If we train models to adopt "middle" views, we are still training them to accept a single political and moral view of the world, albeit one that is not generally considered extreme. Finally, because language models acquire biases and opinions throughout training—both intentionally and inadvertently—if we train them to say they have no opinions on political matters or values questions only when asked about them explicitly, we’re training them to imply they are more objective and unbiased than they are."

บทความ: https://www.anthropic.com/research/claude-character

source : https://www.facebook.com/share/GGb6vF1brqc7Rk3i

#siamstr #claudestr #AI #philosophy

quoting note10p9…rn85
https://youtu.be/iyJj9RxSsBY

Author Public Key

npub1e9vcz6204fft6jxvyf0edd3a54t8n9znz007h94mmwlkqlqeulzqfjj93f

Show more details

Published at

2024-06-10 02:06:27

Kind type

1 Short Text Note

Event JSON

{ "id": "9bb39375f73df2d7ab73b13c5a3e7c85b4edeb41812cb83bec61ce2e82357730", "pubkey": "c95981694faa52bd48cc225f96b63da55679945313dfeb96bbdbbf607c19e7c4", "created_at": 1717977987, "kind": 1, "tags": [ [ "p", "c95981694faa52bd48cc225f96b63da55679945313dfeb96bbdbbf607c19e7c4" ], [ "r", "https://www.anthropic.com/research/claude-character" ], [ "r", "https://www.facebook.com/share/GGb6vF1brqc7Rk3i" ], [ "t", "siamstr" ], [ "t", "claudestr" ], [ "t", "ai" ], [ "t", "philosophy" ] ], "content": "นักวิจัย เล่าประสบการณ์ train Claude 3\n\nAmanda Askell จบทางปรัชญามา และรับหน้าที่ fine-tune เพื่อสร้างบุคลิกให้กับ Claude โดยเทคนิค RLAIF ตามเกณฑ์บุคลิกที่เรากำหนด เช่น สงสัยใฝ่รู้, เปิดกว้าง, ช่างคิด งานตรงนี้มีความยาก ไม่สามารถใช้คำตอบง่ายๆ ได้\n\n\"Adopting the views of whoever you’re talking with is pandering and insincere. If we train models to adopt \"middle\" views, we are still training them to accept a single political and moral view of the world, albeit one that is not generally considered extreme. Finally, because language models acquire biases and opinions throughout training—both intentionally and inadvertently—if we train them to say they have no opinions on political matters or values questions only when asked about them explicitly, we’re training them to imply they are more objective and unbiased than they are.\"\n\nบทความ: https://www.anthropic.com/research/claude-character\n\nsource : https://www.facebook.com/share/GGb6vF1brqc7Rk3i\n\n#siamstr #claudestr #AI #philosophy nostr:note10p9qupnadqajx4kst3dr9y4q0ejxkeadrz2lcgxlgg4wh59p95jqxyrn85", "sig": "dfe05427c1e4c0fba2b3f61d28f599758ca7afad0d7ae9ef556fe83ca81309df02065c28dba51c3e9ddcb8561b7bc358e2afa6ab84da9b1af53401ba77f1ef76" }