Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Would you choose a simulated utopia or the real world? (lesswrong.com)
2 points by paulpauper 1 hour ago | past | discuss
Language models manipulating their own internal states (lesswrong.com)
2 points by afpx 3 hours ago | past | discuss
How far behind are open models? (lesswrong.com)
2 points by gmays 1 day ago | past | discuss
Bun's Migration from Zig to Rust as a Potential Case Study for Gradual Disempow (lesswrong.com)
2 points by joozio 1 day ago | past | discuss
Logits as a new monitor for evaluation awareness (lesswrong.com)
2 points by aranguri 6 days ago | past | discuss
Running an Air Purifier on Batteries (lesswrong.com)
2 points by mhb 6 days ago | past | discuss
Babble and Prune (lesswrong.com)
4 points by Ariarule 6 days ago | past | discuss
There are only four skills: design, technical, management and physical (lesswrong.com)
3 points by surprisetalk 7 days ago | past | discuss
Where does the race to automate AI research end? (lesswrong.com)
1 point by joozio 7 days ago | past | discuss
Taking the Training Wheels Off: Aligning LLMs Without Personas (lesswrong.com)
4 points by joozio 8 days ago | past | 1 comment
I hired 5 people to sit behind me and make me productive for a month (2023) (lesswrong.com)
6 points by LorenDB 9 days ago | past | 1 comment
Why AI safety researchers should consider a contract research manager position (lesswrong.com)
4 points by joozio 10 days ago | past | discuss
How far behind are open models? (lesswrong.com)
5 points by vesteny77 10 days ago | past | 1 comment
Probabilistic, Reformative Justice (lesswrong.com)
9 points by mdurana 11 days ago | past | discuss
AI Researchers, Ask Yourself These 6 Questions to Strengthen Your Moral Muscles (lesswrong.com)
2 points by yurivish 11 days ago | past | 1 comment
Mnemonic portraits for 19,023 human genes (lesswrong.com)
1 point by brinedew 12 days ago | past | discuss
How far behind are open models? (lesswrong.com)
11 points by alecco 12 days ago | past | 5 comments
A Year Late, Claude Beats Pokémon (lesswrong.com)
1 point by szatkus 14 days ago | past
Many portions of Magnifica Humanitas appear to be AI-written (lesswrong.com)
3 points by dev_hugepages 14 days ago | past | 1 comment
Claude, Author of the Humanitas (lesswrong.com)
1 point by doener 14 days ago | past
Overview and Comments on Pope Leo's Magnifica Humanitas on AI (lesswrong.com)
2 points by mnicky 15 days ago | past | 1 comment
Claude, Author of the Humanitas (lesswrong.com)
2 points by cubefox 15 days ago | past | 1 comment
Judging AGI Output (2020) (lesswrong.com)
2 points by merelydev 15 days ago | past
Chinese Room re-visited: How LLM's have real but different understanding of word (lesswrong.com)
3 points by stevefan1999 15 days ago | past | 1 comment
Cognitive Security as an AI Safety Cause Area (lesswrong.com)
2 points by joozio 15 days ago | past
Implications of Predicting the Next Token (lesswrong.com)
3 points by cubefox 16 days ago | past | 1 comment
Models finding vulnerabilities is not the primary source of cybersecurity risk (lesswrong.com)
2 points by alentodorov 23 days ago | past
A Year Late, Claude Beats Pokémon (lesswrong.com)
2 points by sambellll 23 days ago | past
Engineering a Safer World: Risk Modelling – and Safety Engineering? – For AI Lo (lesswrong.com)
2 points by joozio 24 days ago | past
Simulacra Levels and Their Interactions (lesswrong.com)
1 point by epestr 24 days ago | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: