#data owner
Explore tagged Tumblr posts
jcmarchi · 1 year ago
Text
How Bias Will Kill Your AI/ML Strategy and What to Do About It
New Post has been published on https://thedigitalinsider.com/how-bias-will-kill-your-ai-ml-strategy-and-what-to-do-about-it/
How Bias Will Kill Your AI/ML Strategy and What to Do About It
‘Bias’ in models of any type describes a situation in which the model responds inaccurately to prompts or input data because it hasn’t been trained with enough high-quality, diverse data to provide an accurate response. One example would be Apple’s facial recognition phone unlock feature, which failed at a significantly higher rate for people with darker skin complexions as opposed to lighter tones. The model hadn’t been trained on enough images of darker-skinned people. This was a relatively low-risk example of bias but is exactly why the EU AI Act has put forth requirements to prove model efficacy (and controls) before going to market. Models with outputs that impact business, financial, health, or personal situations must be trusted, or they won’t be used.
Tackling Bias with Data
Large Volumes of High-Quality Data
Among many important data management practices, a key component to overcoming and minimizing bias in AI/ML models is to acquire large volumes of high-quality, diverse data. This requires collaboration with multiple organizations that have such data. Traditionally, data acquisition and collaborations are challenged by privacy and/or IP protection concerns–sensitive data can’t be sent to the model owner, and the model owner can’t risk leaking their IP to a data owner. A common workaround is to work with mock or synthetic data, which can be useful but also have limitations compared to using real, full-context data. This is where privacy-enhancing technologies (PETs) provide much-needed answers.
Synthetic Data: Close, but not Quite
Synthetic data is artificially generated to mimic real data. This is hard to do but becoming slightly easier with AI tools. Good quality synthetic data should have the same feature distances as real data, or it won’t be useful. Quality synthetic data can be used to effectively boost the diversity of training data by filling in gaps for smaller, marginalized populations, or for populations that the AI provider simply doesn’t have enough data. Synthetic data can also be used to address edge cases that might be difficult to find in adequate volumes in the real world. Additionally, organizations can generate a synthetic data set to satisfy data residency and privacy requirements that block access to the real data. This sounds great; however, synthetic data is just a piece of the puzzle, not the solution.
One of the obvious limitations of synthetic data is the disconnect from the real world. For example, autonomous vehicles trained solely on synthetic data will struggle with real, unforeseen road conditions. Additionally, synthetic data inherits bias from the real-world data used to generate it–pretty much defeating the purpose of our discussion. In conclusion, synthetic data is a useful option for fine tuning and addressing edge cases, but significant improvements in model efficacy and minimization of bias still rely upon accessing real world data.
A Better Way: Real Data via PETs-enabled Workflows
PETs protect data while in use. When it comes to AI/ML models, they can also protect the IP of the model being run–”two birds, one stone.” Solutions utilizing PETs provide the option to train models on real, sensitive datasets that weren’t previously accessible due to data privacy and security concerns. This unlocking of dataflows to real data is the best option to reduce bias. But how would it actually work?
For now, the leading options start with a confidential computing environment. Then, an integration with a PETs-based software solution that makes it ready to use out of the box while addressing the data governance and security requirements that aren’t included in a standard trusted execution environment (TEE). With this solution, the models and data are all encrypted before being sent to a secured computing environment. The environment can be hosted anywhere, which is important when addressing certain data localization requirements. This means that both the model IP and the security of input data are maintained during computation–not even the provider of the trusted execution environment has access to the models or data inside of it. The encrypted results are then sent back for review and logs are available for review.
This flow unlocks the best quality data no matter where it is or who has it, creating a path to bias minimization and high-efficacy models we can trust. This flow is also what the EU AI Act was describing in their requirements for an AI regulatory sandbox.
Facilitating Ethical and Legal Compliance
Acquiring good quality, real data is tough. Data privacy and localization requirements immediately limit the datasets that organizations can access. For innovation and growth to occur, data must flow to those who can extract the value from it.
Art 54 of the EU AI Act provides requirements for “high-risk” model types in terms of what must be proven before they can be taken to market. In short, teams will need to use real world data inside of an AI Regulatory Sandbox to show sufficient model efficacy and compliance with all the controls detailed in Title III Chapter 2. The controls include monitoring, transparency, explainability, data security, data protection, data minimization, and model protection–think DevSecOps + Data Ops.
The first challenge will be to find a real-world data set to use–as this is inherently sensitive data for such model types. Without technical guarantees, many organizations may hesitate to trust the model provider with their data or won’t be allowed to do so. In addition, the way the act defines an “AI Regulatory Sandbox” is a challenge in and of itself. Some of the requirements include a guarantee that the data is removed from the system after the model has been run as well as the governance controls, enforcement, and reporting to prove it.
Many organizations have tried using out-of-the-box data clean rooms (DCRs) and trusted execution environments (TEEs). But, on their own, these technologies require significant expertise and work to operationalize and meet data and AI regulatory requirements. DCRs are simpler to use, but not yet useful for more robust AI/ML needs. TEEs are secured servers and still need an integrated collaboration platform to be useful, quickly. This, however, identifies an opportunity for privacy enhancing technology platforms to integrate with TEEs to remove that work, trivializing the setup and use of an AI regulatory sandbox, and therefore, acquisition and use of sensitive data.
By enabling the use of more diverse and comprehensive datasets in a privacy-preserving manner, these technologies help ensure that AI and ML practices comply with ethical standards and legal requirements related to data privacy (e.g., GDPR and EU AI Act in Europe). In summary, while requirements are often met with audible grunts and sighs, these requirements are simply guiding us to building better models that we can trust and rely upon for important data-driven decision making while protecting the privacy of the data subjects used for model development and customization.
0 notes
its-buers-naptime · 1 year ago
Text
“ His name is Planet Destroyer. “ “ So cool ! ! “
Tumblr media
@shinazugawa-bros-week-2024 day 6: pets
100 notes · View notes
melanodis · 1 year ago
Note
you should draw plex charlie and merlot shooting the shit
Tumblr media
merlot doesn't really like her all that much. he can see there's something abnormal about her. the way she moves, the way her eyes glimmer. he swears he can hear clicking and whirring underneath all the background noise of the mall.
Tumblr media
63 notes · View notes
orbleglorb · 4 months ago
Text
i know i've already done one of these, but now that i have a clearer view of what to make and how to make it, it would be helpful if you guys would fill out another shop interest survey! it really does help a lot.
thanks!
13 notes · View notes
smashwolfen · 1 year ago
Text
I'm screaming on the inside I WALK INTO WORK AND BUDDY HANDS ME THIS!!!!!!
Tumblr media
I mentioned I was looking for one and he said he had one and would look for it, and he waltzed in and just dhdjdndnfuxbdkxnsjee
HE DIDNT WANT ANYTHING FOR IT, I SAID I DIDN'T MIND PAYING HIM AND HE GAVE IT TO ME KNOWING IM A NERD AND WOULD TAKE CARE OF IT IM-
RJFHDJDMEKRBDJSKWJEJ
I don't deserve the luck I've been given in my pokemon collection endeavors ;;;w;;;
24 notes · View notes
arytha · 1 year ago
Text
Tumblr media
[ID from ALT: A fullbody digital drawing of my OC, Millenium, standing in front of a mirror that is reflecting her form after her death, Mimi. Millenium's hands are clutched in front of her, insecure, her back turned to the camera. Mimi is posing with confidence, smirking with her arms up and hands pressed against the mirror's surface. Millenium has her blonde hair down, wearing a simple sweater and pleated skirt. Mimi is wearing a flashy outfit, with a dress that fades into a transparent skirt, a seethrough bodysuit underneath, and unattached sleeves. Her hair is pink and pulled into high twintails, with her short bangs dyed black. She's wearing cat ear headphones, with a cat tail peeking out from her skirt. The room reflected in the mirror behind Mimi is glitchy, while the room outside the mirror is dark and dreary. End ID]
21 notes · View notes
scrybeoftheatre · 5 months ago
Text
guys i didnt have time to save any videos from tiktok. i am devastated the only deaths game content i can find was on tiktok. all my squid game content. guys
2 notes · View notes
literalsunhobi · 5 months ago
Text
my brain rn instead of letting me sleep and creating a million theories about jhope’s and bts’ schedules
Tumblr media
4 notes · View notes
sentient-rift · 9 months ago
Note
Some jerk "mooned" me the other day. That's gotta be the weirdest way to find out I was a werewolf...
Tumblr media
"Ha ha ha! That actually counts?! I don't know if I can get BEHIND that! I mean, I would take your word for it, BUTT it's just too ridiculous! I'm going to get to the BOTTOM of this!"
Tumblr media
"HA HA HA HA HA HA!! STOP IT, WELCH!! MY GUT HURTS!!!"
Tumblr media
"I don't understand why we're laughing. What does it mean to moon someone? Did someone steal the energy of the moon and blast it at someone?"
Tumblr media
"Oh, don't worry about it, Duo..."
2 notes · View notes
pierswife · 10 months ago
Text
Little things that I appreciate my pharmacist so much for: noticing that when I did manual refills for my scripts, I'd usually be a few days late doing it and cut it super close to not having enough meds until my next refill, then called one day and asked "Hey I noticed that your script needs to be filled but you haven't called yet, I just wanted to check up on you and make sure you were okay" and THEN asked if I wanted to be added to the auto-refill list so I'd just get a call whenever my meds were ready (I didn't know they did auto-refills cause I had to switch to them cause my old pharmacy went out if business)
Basically what I'm saying is my pharmacist and the pharmacy I go to in general is fucking amazing and I'm very thankful to have them
3 notes · View notes
moonchild-in-blue · 1 year ago
Text
Sorrows and Despair
(just found out one of my favourite ST fanartists has deactivated 😔 really wanted to get one of their pieces back on my queue for later and. well. gone)
4 notes · View notes
corvids-corner · 1 year ago
Text
Wearing full lab safety gear and observing the pit from a safe distance, taking notes on a waterproof notebook
4 notes · View notes
mingareco · 6 months ago
Text
Imagine the poor unknown Samsung Admin who has to work his ass of to cover and hide all the traffic coming through the damn thing.
He knows that if that info gets leaked there will be another crisis, so work for the unnamed and unseen web warrior it is.
The only issue that new Karen manager seems to be weirdly obsessed with this particular mid range Samsung fridge.
Headcanon that the batfam has a Samsung smart refrigerator or whatever it's called, and it is used entirely for doing work while in the kitchen. There has been justice league meetings held on that motherfucker and nuclear threats disengaged.
16K notes · View notes
evenshands · 3 months ago
Text
hey someone stop me from thinking too hard about the dog rescue logistics bc if I do I will fly into a rage
1 note · View note
accountsway · 4 months ago
Text
0 notes
sentient-rift · 1 year ago
Note
M!A Mystery food x curry time
Tumblr media
"Huh...? What does that mean?"
Tumblr media
"I don't know, but I like the sound of it!"
Tumblr media
"Well, as long as this is one of them Magic Anons, let's go crazy with it! How about we have a Curry Making Contest with a curtain mystery food being the main ingredient? And of course, Lanny Boy will be our taste tester!"
Tumblr media
"Hey, not a bad idea, Mxy. I'll get started on my curry right away..."
Tumblr media
"Sorry, Mayl, but that would be way too easy. It won't be any of us along the Curry, but the anons and anyone else out there who wants to join in! Think of it as an extension to that food ask meme we're currently doing."
Tumblr media
"Oh, that sound fun!"
Tumblr media
"So what are you guys waiting for? Start sending in your curries and see how Lanny Boy likes it!"
Tumblr media
"My taste buds are ready!"
(Basically, all you have to do is the same thing with this meme, only add +Curry Contest, and make sure to send in your special curry to Lan.)
2 notes · View notes