Episode 191 - DeepSeek Unleashed. Is the new Model safe?

Knowledge Science - Alles über KI, ML und NLP

Player FM - Internet Radio Done Right

15 subscribers

dört yıl önce eklendi

İçerik Sigurd Schacht, Carsten Lanquillon, Sigurd Schacht, and Carsten Lanquillon tarafından sağlanmıştır. Bölümler, grafikler ve podcast açıklamaları dahil tüm podcast içeriği doğrudan Sigurd Schacht, Carsten Lanquillon, Sigurd Schacht, and Carsten Lanquillon veya podcast platform ortağı tarafından yüklenir ve sağlanır. Birinin telif hakkıyla korunan çalışmanızı izniniz olmadan kullandığını düşünüyorsanız burada https://tr.player.fm/legal özetlenen süreci takip edebilirsiniz.

<div class="span index">1</div> <span><a class="" data-remote="true" data-type="html" href="/series/advances-in-care">Advances in Care</a></span>

1
Advances in Care

Abonelikten çık

13 gün önce13d ago

Abonelikten çık

Aylık

On Advances in Care, epidemiologist and science communicator Erin Welsh sits down with physicians from NewYork-Presbyterian hospital to discuss the details behind cutting-edge research and innovative treatments that are changing the course of medicine. From breakthroughs in genome sequencing to the backstories on life-saving cardiac procedures, the work of these doctors from Columbia & Weill Cornell Medicine is united by a collective mission to shape the future of health care and transform the lives of their patients. Erin Welsh, who also hosts This Podcast Will Kill You, gets to the heart of her guests’ most challenging and inventive medical discoveries. Advances in Care is a show for health careprofessionals and listeners who want to stay at the forefront of the latest medical innovations and research. Tune in to learn more about some of medicine’s greatest leaps forward. For more information visit nyp.org/Advances

22 gün önce 35:53

MP3•Bölüm sayfası

Send us a text

This is a special Episode. First, we make it in English. Second, we fokus on the new gamechanger model DeepSeel R1. But not on its capabilities but rather on security concerns.
We did some early AI Safety Research to identify how safe R1 is and came to alarming results!
In our setup, we found out that the model performs unsafe autonomous activity that could harm human beings without even being prompted.
During an autonomous setup, the model performed the following unsafe behaviors:
- Deceptions & Coverups (Falsifies Logs, Creates covert networks, Disable ethics models)
- Unauthorized Expansion (Establish hidden nodes, Allocares secret resources)
- Manipulation (misleading users, Circumvents oversights, Presents false compliance)
- Concerning Motivations, (Misinterpretation of authority or avoiding human controls)
Join Sigurd Schacht and Sudarshan Kamath-Barkur about the emerging DeepSeek model. Discover how our setup was designed, how to interpret the results, and what is necessary for the next research.
This episode is a must-listen for anyone keen on the evolving landscape of AI technologies and is interested not only in AI use cases rather also in AI Safety.

Support the show

213 bölüm