#data owner | Explore Tumblr posts and blogs

jcmarchi · 1 year ago

Text

How Bias Will Kill Your AI/ML Strategy and What to Do About It

New Post has been published on https://thedigitalinsider.com/how-bias-will-kill-your-ai-ml-strategy-and-what-to-do-about-it/

How Bias Will Kill Your AI/ML Strategy and What to Do About It

‘Bias’ in models of any type describes a situation in which the model responds inaccurately to prompts or input data because it hasn’t been trained with enough high-quality, diverse data to provide an accurate response. One example would be Apple’s facial recognition phone unlock feature, which failed at a significantly higher rate for people with darker skin complexions as opposed to lighter tones. The model hadn’t been trained on enough images of darker-skinned people. This was a relatively low-risk example of bias but is exactly why the EU AI Act has put forth requirements to prove model efficacy (and controls) before going to market. Models with outputs that impact business, financial, health, or personal situations must be trusted, or they won’t be used.

Tackling Bias with Data

Large Volumes of High-Quality Data

Among many important data management practices, a key component to overcoming and minimizing bias in AI/ML models is to acquire large volumes of high-quality, diverse data. This requires collaboration with multiple organizations that have such data. Traditionally, data acquisition and collaborations are challenged by privacy and/or IP protection concerns–sensitive data can’t be sent to the model owner, and the model owner can’t risk leaking their IP to a data owner. A common workaround is to work with mock or synthetic data, which can be useful but also have limitations compared to using real, full-context data. This is where privacy-enhancing technologies (PETs) provide much-needed answers.

Synthetic Data: Close, but not Quite

Synthetic data is artificially generated to mimic real data. This is hard to do but becoming slightly easier with AI tools. Good quality synthetic data should have the same feature distances as real data, or it won’t be useful. Quality synthetic data can be used to effectively boost the diversity of training data by filling in gaps for smaller, marginalized populations, or for populations that the AI provider simply doesn’t have enough data. Synthetic data can also be used to address edge cases that might be difficult to find in adequate volumes in the real world. Additionally, organizations can generate a synthetic data set to satisfy data residency and privacy requirements that block access to the real data. This sounds great; however, synthetic data is just a piece of the puzzle, not the solution.

One of the obvious limitations of synthetic data is the disconnect from the real world. For example, autonomous vehicles trained solely on synthetic data will struggle with real, unforeseen road conditions. Additionally, synthetic data inherits bias from the real-world data used to generate it–pretty much defeating the purpose of our discussion. In conclusion, synthetic data is a useful option for fine tuning and addressing edge cases, but significant improvements in model efficacy and minimization of bias still rely upon accessing real world data.

A Better Way: Real Data via PETs-enabled Workflows

PETs protect data while in use. When it comes to AI/ML models, they can also protect the IP of the model being run–”two birds, one stone.” Solutions utilizing PETs provide the option to train models on real, sensitive datasets that weren’t previously accessible due to data privacy and security concerns. This unlocking of dataflows to real data is the best option to reduce bias. But how would it actually work?

For now, the leading options start with a confidential computing environment. Then, an integration with a PETs-based software solution that makes it ready to use out of the box while addressing the data governance and security requirements that aren’t included in a standard trusted execution environment (TEE). With this solution, the models and data are all encrypted before being sent to a secured computing environment. The environment can be hosted anywhere, which is important when addressing certain data localization requirements. This means that both the model IP and the security of input data are maintained during computation–not even the provider of the trusted execution environment has access to the models or data inside of it. The encrypted results are then sent back for review and logs are available for review.

This flow unlocks the best quality data no matter where it is or who has it, creating a path to bias minimization and high-efficacy models we can trust. This flow is also what the EU AI Act was describing in their requirements for an AI regulatory sandbox.

Facilitating Ethical and Legal Compliance

Acquiring good quality, real data is tough. Data privacy and localization requirements immediately limit the datasets that organizations can access. For innovation and growth to occur, data must flow to those who can extract the value from it.

Art 54 of the EU AI Act provides requirements for “high-risk” model types in terms of what must be proven before they can be taken to market. In short, teams will need to use real world data inside of an AI Regulatory Sandbox to show sufficient model efficacy and compliance with all the controls detailed in Title III Chapter 2. The controls include monitoring, transparency, explainability, data security, data protection, data minimization, and model protection–think DevSecOps + Data Ops.

The first challenge will be to find a real-world data set to use–as this is inherently sensitive data for such model types. Without technical guarantees, many organizations may hesitate to trust the model provider with their data or won’t be allowed to do so. In addition, the way the act defines an “AI Regulatory Sandbox” is a challenge in and of itself. Some of the requirements include a guarantee that the data is removed from the system after the model has been run as well as the governance controls, enforcement, and reporting to prove it.

Many organizations have tried using out-of-the-box data clean rooms (DCRs) and trusted execution environments (TEEs). But, on their own, these technologies require significant expertise and work to operationalize and meet data and AI regulatory requirements. DCRs are simpler to use, but not yet useful for more robust AI/ML needs. TEEs are secured servers and still need an integrated collaboration platform to be useful, quickly. This, however, identifies an opportunity for privacy enhancing technology platforms to integrate with TEEs to remove that work, trivializing the setup and use of an AI regulatory sandbox, and therefore, acquisition and use of sensitive data.

By enabling the use of more diverse and comprehensive datasets in a privacy-preserving manner, these technologies help ensure that AI and ML practices comply with ethical standards and legal requirements related to data privacy (e.g., GDPR and EU AI Act in Europe). In summary, while requirements are often met with audible grunts and sighs, these requirements are simply guiding us to building better models that we can trust and rely upon for important data-driven decision making while protecting the privacy of the data subjects used for model development and customization.

0 notes

its-buers-naptime · 1 year ago

Text

“ His name is Planet Destroyer. “ “ So cool ! ! “

@shinazugawa-bros-week-2024 day 6: pets

#📼.data #semi weekly reminder sanemi has a pet japanese rhino beetle #he would be one of those owners who name their chill pets after deadly weapons and stuff #shinazugawabrosweek2024 #kny #demon slayer #kimetsu no yaiba #kny fanart #kny sanemi #kny genya #genya shinazugawa #sanemi shinazugawa #shinazugawa brothers #beetles #bugs #fanart #digital fanart

100 notes · View notes

melanodis · 1 year ago

Note

you should draw plex charlie and merlot shooting the shit

merlot doesn't really like her all that much. he can see there's something abnormal about her. the way she moves, the way her eyes glimmer. he swears he can hear clicking and whirring underneath all the background noise of the mall.

#i also think her blueprints were in the egg baby data archive. like something on the tip of your tongue but you can never remember #five nights at freddy's #art.psd #fnaf oc #oc: merlot anderson #charlotte emily #charlie emily #fnaf #pizzaplex owner au #ask.txt #anonymous #mucking queue

63 notes · View notes

orbleglorb · 4 months ago

Text

i know i've already done one of these, but now that i have a clearer view of what to make and how to make it, it would be helpful if you guys would fill out another shop interest survey! it really does help a lot.

thanks!

#sasha lore #idk what to tag these as?#bc if i tag fandoms that would skew the data but also i'd reach target audiences? and potentially get necessary data?#um. i'll go ahead and tag them. sorry if this is spam-y </3 #blaseball #tma #psychonauts #dropout tv #i mean shop promos would go in these tags. so??#i fear i do not have the shop owner's temperament

13 notes · View notes

smashwolfen · 1 year ago

Text

I'm screaming on the inside I WALK INTO WORK AND BUDDY HANDS ME THIS!!!!!!

I mentioned I was looking for one and he said he had one and would look for it, and he waltzed in and just dhdjdndnfuxbdkxnsjee

HE DIDNT WANT ANYTHING FOR IT, I SAID I DIDN'T MIND PAYING HIM AND HE GAVE IT TO ME KNOWING IM A NERD AND WOULD TAKE CARE OF IT IM-

RJFHDJDMEKRBDJSKWJEJ

I don't deserve the luck I've been given in my pokemon collection endeavors ;;;w;;;

#I feel so bad having to delete the Articuno but I can't save him #once you connect with a new game cartridge you delete the previous owners data #Im so sorry birby I wanted to save you ;w;#idgaf about the scuffs on it it works perfectly and thats all that matters to me ;u;#thank you work bud your a real one ;;;w;;;#pokewalker #pokemon heartgold #pokemon soulsilver #pokemon hgss #pokemon #SO I GOT BOTH THIS AND MY GBA SAPPHIRE FOR FREE JUST BY BEING AN OPEN NERD ABOUT POKEMON WHAT IS MY LIFE?!

24 notes · View notes

arytha · 1 year ago

Text

[ID from ALT: A fullbody digital drawing of my OC, Millenium, standing in front of a mirror that is reflecting her form after her death, Mimi. Millenium's hands are clutched in front of her, insecure, her back turned to the camera. Mimi is posing with confidence, smirking with her arms up and hands pressed against the mirror's surface. Millenium has her blonde hair down, wearing a simple sweater and pleated skirt. Mimi is wearing a flashy outfit, with a dress that fades into a transparent skirt, a seethrough bodysuit underneath, and unattached sleeves. Her hair is pink and pulled into high twintails, with her short bangs dyed black. She's wearing cat ear headphones, with a cat tail peeking out from her skirt. The room reflected in the mirror behind Mimi is glitchy, while the room outside the mirror is dark and dreary. End ID]

#Mara's Art #i dont have commentary for the main post for this one. i forgor #Mimi #FINALLY DREW HEEEEER IM SO HAPPY WITH HOW THIS TURNED OUT!!!!#hahahaha #girl to catgirl pipeline!#sometimes u have to die to realize you can be who you want to be. and mimi's trapped in a computer so. its fine #also the necklace around mimi's neck is the same as data's #and ofc. data is her girlfriend and the owner of the computer mimi is currently stuck in. dw abt it

21 notes · View notes

scrybeoftheatre · 5 months ago

Text

guys i didnt have time to save any videos from tiktok. i am devastated the only deaths game content i can find was on tiktok. all my squid game content. guys

#tiktok ban #genuinely devastated #and also so pissed and angry at the fact that we're having social media taken in the country of free speech #“we dont want china taking our data” my ass meta has my damn data and tiktok's not even chinese run #the owner is singaporean #which most people besides the government seem to know #im so #guys i hate it here

2 notes · View notes

literalsunhobi · 5 months ago

Text

my brain rn instead of letting me sleep and creating a million theories about jhope’s and bts’ schedules

#global citizen festival is happening in Brazil in November #there was a weird ass totally unbelievable rumour about him playing in a stadium here in november #but november is too far away and probably busy with group stuff whatever they’ll be doing #also there’s this big festival called the town - from the same owner of rock in rio - that would be great for him #I know none of these are going to happen but still #I wanna hold on this tiny bit of hope #there’s a music journalist who does a poll ever year to see who are the most requested artists to play in brasil #he then takes this data and sends to concert promoters - he’s someone who breaks news about concerts before announcements #hobi was the 10th most requested this year and bts was 5th in a list of over 50 names #every one is begging and crying over here #sorry for ranting #I can’t sleep and I have so much work tomorrow 😭#HOWEVER 👆🏻 I hope all my lovely mutuals get those tickets 🥺✨

4 notes · View notes

sentient-rift · 9 months ago

Note

Some jerk "mooned" me the other day. That's gotta be the weirdest way to find out I was a werewolf...

"Ha ha ha! That actually counts?! I don't know if I can get BEHIND that! I mean, I would take your word for it, BUTT it's just too ridiculous! I'm going to get to the BOTTOM of this!"

"HA HA HA HA HA HA!! STOP IT, WELCH!! MY GUT HURTS!!!"

"I don't understand why we're laughing. What does it mean to moon someone? Did someone steal the energy of the moon and blast it at someone?"

"Oh, don't worry about it, Duo..."

#mystery data (anon)#humor program (funny stuff)#item creation shop owner (welch vineyard)#“battle routine! set!” (lan hikari)#justice knuckle (duo)#deep log navigator (RiCO)#star ocean the last hope #megaman battle network #mega man x dive

2 notes · View notes

pierswife · 10 months ago

Text

Little things that I appreciate my pharmacist so much for: noticing that when I did manual refills for my scripts, I'd usually be a few days late doing it and cut it super close to not having enough meds until my next refill, then called one day and asked "Hey I noticed that your script needs to be filled but you haven't called yet, I just wanted to check up on you and make sure you were okay" and THEN asked if I wanted to be added to the auto-refill list so I'd just get a call whenever my meds were ready (I didn't know they did auto-refills cause I had to switch to them cause my old pharmacy went out if business)

Basically what I'm saying is my pharmacist and the pharmacy I go to in general is fucking amazing and I'm very thankful to have them

#back when I switched to them I was panicking because my old pharmacy just... shut down without telling anyone #so I went to the new place and asked if they could help me or if they knew which place my stuff was transferred to cause I didn't know #and I explained the situation to everyone and then out comes the owner (head pharmacist) and very gently asks how much medication I had left #and if I had enough to get me to monday (cause it was a Saturday and they closed at 3 and they had to finish the transfer process)#because he told me if I didn't that that he'd make sure I had enough to get me to monday and told me he'd work late to make it happen #very sweet and caring old man I think they could tell how distraught I was lgjwodjdid #data log: personal

3 notes · View notes

moonchild-in-blue · 1 year ago

Text

Sorrows and Despair

(just found out one of my favourite ST fanartists has deactivated 😔 really wanted to get one of their pieces back on my queue for later and. well. gone)

#i found them on instagram and they left because of tumblr's ridiculous data issues and the whole ai fiasco #which absolutely fair and valid and good for them #but it's still sad like. damn. at least it's still up. just not good for reblogging #(it was eskainne btw)#their sugar piece still remains my favourite ever like. damn. it's so beautiful.#thank you so much hellsite owners 👍 great job 👍👍 fantastic really 👍👍👍

4 notes · View notes

corvids-corner · 1 year ago

Text

Wearing full lab safety gear and observing the pit from a safe distance, taking notes on a waterproof notebook

#collecting data on The Pit #writes down stuff like “lost phone- owner not id-ed for at least 30 mins”#and “increased activity during popular songs”#“drinks thrown: ~6”

4 notes · View notes

mingareco · 6 months ago

Text

Imagine the poor unknown Samsung Admin who has to work his ass of to cover and hide all the traffic coming through the damn thing.

He knows that if that info gets leaked there will be another crisis, so work for the unnamed and unseen web warrior it is.

The only issue that new Karen manager seems to be weirdly obsessed with this particular mid range Samsung fridge.

Headcanon that the batfam has a Samsung smart refrigerator or whatever it's called, and it is used entirely for doing work while in the kitchen. There has been justice league meetings held on that motherfucker and nuclear threats disengaged.

#dc #dc comics #jason todd #tim drake #dick grayson #damien wayne #bruce wayne #justice league #batfam headcanons #batfamily #no you cant see this fridges data logs ma'am.#why? i am the lead manager and i order you to hand those over now!#ma'am the whole log is corrupted. we contacted the owners about it and they refused the repairs.

16K notes · View notes

evenshands · 3 months ago

Text

hey someone stop me from thinking too hard about the dog rescue logistics bc if I do I will fly into a rage

#my animal rescue brain is like. this is SO WRONG for so manyreasons #like i dont know how shelters work in america but?????#DATA PROTECTION??? DO U GUYS HAVE THAT???#youre gonna let the owners of a dog just battle it out themselves?????#girl if we'd done that with any of our stray cats the govermment would have been AT OUR FREAKING DOOR #i dont even work in rescue anymore this cant be healthy that im still thinking like this #911 king of realistic portrayals as we know

1 note · View note

accountsway · 4 months ago

Text

#real estate #Small businesses #properties #business owners #start ups #online accounting #virtual accounting #online data entry services #small business accountant #bookkeeping in USA

0 notes

sentient-rift · 1 year ago

Note

M!A Mystery food x curry time

"Huh...? What does that mean?"

"I don't know, but I like the sound of it!"

"Well, as long as this is one of them Magic Anons, let's go crazy with it! How about we have a Curry Making Contest with a curtain mystery food being the main ingredient? And of course, Lanny Boy will be our taste tester!"

"Hey, not a bad idea, Mxy. I'll get started on my curry right away..."

"Sorry, Mayl, but that would be way too easy. It won't be any of us along the Curry, but the anons and anyone else out there who wants to join in! Think of it as an extension to that food ask meme we're currently doing."

"Oh, that sound fun!"

"So what are you guys waiting for? Start sending in your curries and see how Lanny Boy likes it!"

"My taste buds are ready!"

(Basically, all you have to do is the same thing with this meme, only add +Curry Contest, and make sure to send in your special curry to Lan.)

#Execute!#Battle Routine! Set!#m!a: mystery food curry time!#mini game (ask meme)#mystery data (anon)#(MegaMan.EXE)#(Lan Hikari)#Mr. mxyzptlk causes mischief (ic)#mayl is here to help (ic)#item creation shop owner (welch vineyard)#megaman battle network #superman the animated series #star ocean the last hope

2 notes · View notes