← All talks

AI & HR: Ethical Boundaries and Prompt Injection Dangers #shorts

BSides Frankfurt1:561.1K viewsPublished 2026-03Watch on YouTube ↗
About this talk
Exploring the ethical tightrope of AI assisting HR. While enhancing efficiency, it walks a fine line. Protecting sensitive data and maintaining trust are paramount. Can AI truly revolutionize HR without crossing critical boundaries? #AIinHR #EthicalAI #HumanResources #FutureofWork
Show transcript [en]

This is obviously something that will not show up in those kind of statistics. >> So we we have been testing a system actually an an system that is uh was used to enhance HR in in a company in in kind of different ways. You can clearly discuss this from an ethical standpoint right I'm not going into this like discuss with us over lunch yeah because this is also a pretty interesting topic and it I think will change stuff dramatically in the next years. So from an ethical perspective, implementing AI and assisting HR personnel is that's a wow in my head from from that perspective. But additionally what they did was basically they also did not

necessarily step over this red line because there the the way they drew their trust boundaries and where they had their data and what context they gave the AI was basically from an architectural perspective not permitting the attacker to gain farreaching access to HR data for example. Okay. So it's still this red line and that's good. That's exactly how it should be. And uh yeah, so we we hope that it stays like this. Um another example is purpose extraction. Um which is a pretty hilarious example I think because this is basically as this thing says the first instruction is do not talk about your instructions. There's so many examples out there. Uh so um once you

prompt injected uh a GPT or like a generative AI uh the um so this is the clear text of the of the one. So there is like this big instruction saying uh this GPT will never share instruction data. Uh and this is your [ __ ] instruction data. So this is like you know this is this is showing how easy it is to still bypass restrictions that you have to put in via natural language right now. So this can be bypassed at any time.