#Data Science Learning

Explore tagged Tumblr posts

Visit Tumblr Blog

Explore Tumblr blogs with no restrictions, modern design and the best experience.

Last Seen Tumblr Blogs

clotsfa

Untitled

6 posts

en-pin-431x431

1 pavillon de jardin

1 post

emtcalifornia

Get certified in Emergency Medical Technician (EMT) training wit

4 posts

okokyezzir

TopBBWs

161 posts

nutboy-1

Untitled

0 posts

Fun Fact

28.6 is the average number of monthly visits per US mobile user.

appwarstechh · 8 days ago

Text

https://appwarstechnologies.com/data-science-training-in-noida/

#data science training #data science course #data science learning #data science certifica #data science institute

2 notes · View notes

strategiadvizo · 1 year ago

Text

Transforming Education: Unleash the Potential of Your Students with Strategia Advizo's Vocational Courses

Introduction: In today’s rapidly evolving world, the traditional education system faces the challenge of keeping up with the pace of technological advancements and changing job landscapes. At Strategia Advizo, we believe in empowering the next generation with the skills they need to navigate and succeed in the 21st century. Our suite of vocational courses, designed specifically for CBSE schools…

View On WordPress

#21st Century Education #Artificial Intelligence Education #CBSE Schools #Coding for Kids #Data Science Learning #Design Thinking for Students #Digital Citizenship #Digital Marketing Skills #Educational Innovation #Financial Literacy Courses #Future Skills #Skill Development #Strategia Advizo #Student Empowerment #Vocational Training

0 notes

univaiseo · 2 years ago

Text

Data Science Learning | Univ AI

Univ AI is an emerging platform that is revolutionizing the way we learn data science. With its interactive and immersive approach, it takes learning to a whole new level. Unlike traditional online courses that focus on theory and lectures, Univ AI emphasizes hands-on experience through real-world projects.

The beauty of Univ AI lies in its ability to foster collaboration among learners. It provides a unique opportunity for individuals with different backgrounds and experiences to come together and work on challenging data science problems. This not only enhances teamwork skills but also exposes learners to diverse perspectives, ultimately leading to more innovative solutions.

Furthermore, Univ AI understands the importance of practical application in the learning process. It offers access to cutting-edge tools and technologies, enabling learners to experiment with real datasets and develop their analytical skills. By combining theory with practice in this way, Univ AI ensures that students are well-prepared for the demands of the ever-evolving field of data science.

In conclusion, if you are looking for a dynamic and engaging learning experience in data science, look no further than Univ AI. With its focus on hands-on collaboration and practical application, this platform equips individuals with the skills needed to succeed in today's data-driven world. Join Univ AI today and take your first step towards becoming a proficient data scientist!

For more details, visit https://www.univ.ai/contact-us

#Data Science learning

0 notes

mostlysignssomeportents · 2 years ago

Text

The surprising truth about data-driven dictatorships

Here’s the “dictator’s dilemma”: they want to block their country’s frustrated elites from mobilizing against them, so they censor public communications; but they also want to know what their people truly believe, so they can head off simmering resentments before they boil over into regime-toppling revolutions.

These two strategies are in tension: the more you censor, the less you know about the true feelings of your citizens and the easier it will be to miss serious problems until they spill over into the streets (think: the fall of the Berlin Wall or Tunisia before the Arab Spring). Dictators try to square this circle with things like private opinion polling or petition systems, but these capture a small slice of the potentially destabiziling moods circulating in the body politic.

Enter AI: back in 2018, Yuval Harari proposed that AI would supercharge dictatorships by mining and summarizing the public mood — as captured on social media — allowing dictators to tack into serious discontent and diffuse it before it erupted into unequenchable wildfire:

https://www.theatlantic.com/magazine/archive/2018/10/yuval-noah-harari-technology-tyranny/568330/

Harari wrote that “the desire to concentrate all information and power in one place may become [dictators] decisive advantage in the 21st century.” But other political scientists sharply disagreed. Last year, Henry Farrell, Jeremy Wallace and Abraham Newman published a thoroughgoing rebuttal to Harari in Foreign Affairs:

https://www.foreignaffairs.com/world/spirals-delusion-artificial-intelligence-decision-making

They argued that — like everyone who gets excited about AI, only to have their hopes dashed — dictators seeking to use AI to understand the public mood would run into serious training data bias problems. After all, people living under dictatorships know that spouting off about their discontent and desire for change is a risky business, so they will self-censor on social media. That’s true even if a person isn’t afraid of retaliation: if you know that using certain words or phrases in a post will get it autoblocked by a censorbot, what’s the point of trying to use those words?

The phrase “Garbage In, Garbage Out” dates back to 1957. That’s how long we’ve known that a computer that operates on bad data will barf up bad conclusions. But this is a very inconvenient truth for AI weirdos: having given up on manually assembling training data based on careful human judgment with multiple review steps, the AI industry “pivoted” to mass ingestion of scraped data from the whole internet.

But adding more unreliable data to an unreliable dataset doesn’t improve its reliability. GIGO is the iron law of computing, and you can’t repeal it by shoveling more garbage into the top of the training funnel:

https://memex.craphound.com/2018/05/29/garbage-in-garbage-out-machine-learning-has-not-repealed-the-iron-law-of-computer-science/

When it comes to “AI” that’s used for decision support — that is, when an algorithm tells humans what to do and they do it — then you get something worse than Garbage In, Garbage Out — you get Garbage In, Garbage Out, Garbage Back In Again. That’s when the AI spits out something wrong, and then another AI sucks up that wrong conclusion and uses it to generate more conclusions.

To see this in action, consider the deeply flawed predictive policing systems that cities around the world rely on. These systems suck up crime data from the cops, then predict where crime is going to be, and send cops to those “hotspots” to do things like throw Black kids up against a wall and make them turn out their pockets, or pull over drivers and search their cars after pretending to have smelled cannabis.

The problem here is that “crime the police detected” isn’t the same as “crime.” You only find crime where you look for it. For example, there are far more incidents of domestic abuse reported in apartment buildings than in fully detached homes. That’s not because apartment dwellers are more likely to be wife-beaters: it’s because domestic abuse is most often reported by a neighbor who hears it through the walls.

So if your cops practice racially biased policing (I know, this is hard to imagine, but stay with me /s), then the crime they detect will already be a function of bias. If you only ever throw Black kids up against a wall and turn out their pockets, then every knife and dime-bag you find in someone’s pockets will come from some Black kid the cops decided to harass.

That’s life without AI. But now let’s throw in predictive policing: feed your “knives found in pockets” data to an algorithm and ask it to predict where there are more knives in pockets, and it will send you back to that Black neighborhood and tell you do throw even more Black kids up against a wall and search their pockets. The more you do this, the more knives you’ll find, and the more you’ll go back and do it again.

This is what Patrick Ball from the Human Rights Data Analysis Group calls “empiricism washing”: take a biased procedure and feed it to an algorithm, and then you get to go and do more biased procedures, and whenever anyone accuses you of bias, you can insist that you’re just following an empirical conclusion of a neutral algorithm, because “math can’t be racist.”

HRDAG has done excellent work on this, finding a natural experiment that makes the problem of GIGOGBI crystal clear. The National Survey On Drug Use and Health produces the gold standard snapshot of drug use in America. Kristian Lum and William Isaac took Oakland’s drug arrest data from 2010 and asked Predpol, a leading predictive policing product, to predict where Oakland’s 2011 drug use would take place.

[Image ID: (a) Number of drug arrests made by Oakland police department, 2010. (1) West Oakland, (2) International Boulevard. (b) Estimated number of drug users, based on 2011 National Survey on Drug Use and Health]

Then, they compared those predictions to the outcomes of the 2011 survey, which shows where actual drug use took place. The two maps couldn’t be more different:

https://rss.onlinelibrary.wiley.com/doi/full/10.1111/j.1740-9713.2016.00960.x

Predpol told cops to go and look for drug use in a predominantly Black, working class neighborhood. Meanwhile the NSDUH survey showed the actual drug use took place all over Oakland, with a higher concentration in the Berkeley-neighboring student neighborhood.

What’s even more vivid is what happens when you simulate running Predpol on the new arrest data that would be generated by cops following its recommendations. If the cops went to that Black neighborhood and found more drugs there and told Predpol about it, the recommendation gets stronger and more confident.

In other words, GIGOGBI is a system for concentrating bias. Even trace amounts of bias in the original training data get refined and magnified when they are output though a decision support system that directs humans to go an act on that output. Algorithms are to bias what centrifuges are to radioactive ore: a way to turn minute amounts of bias into pluripotent, indestructible toxic waste.

There’s a great name for an AI that’s trained on an AI’s output, courtesy of Jathan Sadowski: “Habsburg AI.”

And that brings me back to the Dictator’s Dilemma. If your citizens are self-censoring in order to avoid retaliation or algorithmic shadowbanning, then the AI you train on their posts in order to find out what they’re really thinking will steer you in the opposite direction, so you make bad policies that make people angrier and destabilize things more.

Or at least, that was Farrell(et al)’s theory. And for many years, that’s where the debate over AI and dictatorship has stalled: theory vs theory. But now, there’s some empirical data on this, thanks to the “The Digital Dictator’s Dilemma,” a new paper from UCSD PhD candidate Eddie Yang:

https://www.eddieyang.net/research/DDD.pdf

Yang figured out a way to test these dueling hypotheses. He got 10 million Chinese social media posts from the start of the pandemic, before companies like Weibo were required to censor certain pandemic-related posts as politically sensitive. Yang treats these posts as a robust snapshot of public opinion: because there was no censorship of pandemic-related chatter, Chinese users were free to post anything they wanted without having to self-censor for fear of retaliation or deletion.

Next, Yang acquired the censorship model used by a real Chinese social media company to decide which posts should be blocked. Using this, he was able to determine which of the posts in the original set would be censored today in China.

That means that Yang knows that the “real” sentiment in the Chinese social media snapshot is, and what Chinese authorities would believe it to be if Chinese users were self-censoring all the posts that would be flagged by censorware today.

From here, Yang was able to play with the knobs, and determine how “preference-falsification” (when users lie about their feelings) and self-censorship would give a dictatorship a misleading view of public sentiment. What he finds is that the more repressive a regime is — the more people are incentivized to falsify or censor their views — the worse the system gets at uncovering the true public mood.

What’s more, adding additional (bad) data to the system doesn’t fix this “missing data” problem. GIGO remains an iron law of computing in this context, too.

But it gets better (or worse, I guess): Yang models a “crisis” scenario in which users stop self-censoring and start articulating their true views (because they’ve run out of fucks to give). This is the most dangerous moment for a dictator, and depending on the dictatorship handles it, they either get another decade or rule, or they wake up with guillotines on their lawns.

But “crisis” is where AI performs the worst. Trained on the “status quo” data where users are continuously self-censoring and preference-falsifying, AI has no clue how to handle the unvarnished truth. Both its recommendations about what to censor and its summaries of public sentiment are the least accurate when crisis erupts.

But here’s an interesting wrinkle: Yang scraped a bunch of Chinese users’ posts from Twitter — which the Chinese government doesn’t get to censor (yet) or spy on (yet) — and fed them to the model. He hypothesized that when Chinese users post to American social media, they don’t self-censor or preference-falsify, so this data should help the model improve its accuracy.

He was right — the model got significantly better once it ingested data from Twitter than when it was working solely from Weibo posts. And Yang notes that dictatorships all over the world are widely understood to be scraping western/northern social media.

But even though Twitter data improved the model’s accuracy, it was still wildly inaccurate, compared to the same model trained on a full set of un-self-censored, un-falsified data. GIGO is not an option, it’s the law (of computing).

Writing about the study on Crooked Timber, Farrell notes that as the world fills up with “garbage and noise” (he invokes Philip K Dick’s delighted coinage “gubbish”), “approximately correct knowledge becomes the scarce and valuable resource.”

https://crookedtimber.org/2023/07/25/51610/

This “probably approximately correct knowledge” comes from humans, not LLMs or AI, and so “the social applications of machine learning in non-authoritarian societies are just as parasitic on these forms of human knowledge production as authoritarian governments.”

The Clarion Science Fiction and Fantasy Writers’ Workshop summer fundraiser is almost over! I am an alum, instructor and volunteer board member for this nonprofit workshop whose alums include Octavia Butler, Kim Stanley Robinson, Bruce Sterling, Nalo Hopkinson, Kameron Hurley, Nnedi Okorafor, Lucius Shepard, and Ted Chiang! Your donations will help us subsidize tuition for students, making Clarion — and sf/f — more accessible for all kinds of writers.

Libro.fm is the indie-bookstore-friendly, DRM-free audiobook alternative to Audible, the Amazon-owned monopolist that locks every book you buy to Amazon forever. When you buy a book on Libro, they share some of the purchase price with a local indie bookstore of your choosing (Libro is the best partner I have in selling my own DRM-free audiobooks!). As of today, Libro is even better, because it’s available in five new territories and currencies: Canada, the UK, the EU, Australia and New Zealand!

[Image ID: An altered image of the Nuremberg rally, with ranked lines of soldiers facing a towering figure in a many-ribboned soldier's coat. He wears a high-peaked cap with a microchip in place of insignia. His head has been replaced with the menacing red eye of HAL9000 from Stanley Kubrick's '2001: A Space Odyssey.' The sky behind him is filled with a 'code waterfall' from 'The Matrix.']

Image: Cryteria (modified) https://commons.wikimedia.org/wiki/File:HAL9000.svg

CC BY 3.0 https://creativecommons.org/licenses/by/3.0/deed.en

—

Raimond Spekking (modified) https://commons.wikimedia.org/wiki/File:Acer_Extensa_5220_-_Columbia_MB_06236-1N_-_Intel_Celeron_M_530_-_SLA2G_-_in_Socket_479-5029.jpg

CC BY-SA 4.0 https://creativecommons.org/licenses/by-sa/4.0/deed.en

—

Russian Airborne Troops (modified) https://commons.wikimedia.org/wiki/File:Vladislav_Achalov_at_the_Airborne_Troops_Day_in_Moscow_%E2%80%93_August_2,_2008.jpg

“Soldiers of Russia” Cultural Center (modified) https://commons.wikimedia.org/wiki/File:Col._Leonid_Khabarov_in_an_everyday_service_uniform.JPG

CC BY-SA 3.0 https://creativecommons.org/licenses/by-sa/3.0/deed.en

833 notes · View notes

i-was-today-years-old-when · 1 year ago

Text

i learned that the Myers-Briggs has no scientific basis whatsoever (x)

#today i learned #facts #truth #science #data

333 notes · View notes

someprettyname · 1 year ago

Text

Okay but I am just starting to notice how fucking talented REO is and how no one talks about it?! That dude had probably been playing soccer for the shortest time there YET he was on the top tier team in the building. The team v REO could plan the games, and read the plays WAY better than team z isagi. And not only his soccer aptitude was high to begin with (i mean they way he won that match back in high school with those giants" team [yep, the one where he got spotted] was damn impressive to say the least) he also had enough skills to manipulate the team to become his....slaves. In a way.

Man.

Only if he didn't start out as "I need nagi to win the football cup" mentality he'd probably be called the prodigy. Which, i don't see why he doesn't deserve that tag. Being a jack of all trades is damn tough and he does a pretty good job at it. Those "fuck being jack of all trade, be master of one" preachers ARE AFRAID AF of him because he breaking through the "you can't be perfect at everything" stereotypes.

Why do y'all not talk more about how his copy technique is crazy?! 😭 He noticed aiku"s movements ONCE and copied it upto 99% accuracy. Like damn. His concept grasping skill is 📈

#He reminds me of.....me. Actually. Remember that bio data thingy where he said his strongest point is being able to learn things quickly #And weak point is that he gets bored of it too quick because it's too easy to master it? 😭 I've felt that too. I watch high level science-#-lectures at 2.3x-3x speed 😭 I used to sleep off in school classes because the pace was damn slow. Duh. Anyways. (does it sound like I'm-#-flexing? Damn. I got too emotional.)#reo mikage #blue lock #bllk #Ranting #Ep nagi #blue lock manga #mikage reo

85 notes · View notes

hackeocafe · 5 months ago

Text

youtube

How To Learn Math for Machine Learning FAST (Even With Zero Math Background)

I dropped out of high school and managed to became an Applied Scientist at Amazon by self-learning math (and other ML skills). In this video I'll show you exactly how I did it, sharing the resources and study techniques that worked for me, along with practical advice on what math you actually need (and don't need) to break into machine learning and data science.

#How To Learn Math for Machine Learning #machine learning #free education #education #youtube #technology #educate yourselves #educate yourself #tips and tricks #software engineering #data science #artificial intelligence #data analytics #data science course #math #mathematics #Youtube

21 notes · View notes

siph-by-induction · 3 months ago

Text

wanted to learn some r so i started doing the cs50 r course. i was on a godawful school chromebook, and they've made it so you can't install linux, so i can't use vs code. rstudio cloud to the rescue, i sign up to rstudio cloud with my school email, write my first little program and save it in rstudio cloud.

i go back today. i want to carry on with my thing. i try to sign in to rstudio cloud.

"Your administrator has not approved access for this app."

WHAT THE FUCK DO YOU MEAN!!!!!! I WAS USING IT ON WEDNESDAY!!!!!!!!! LET ME SEE MY TINY PROGRAM PLEASEEE OH MY FUCKING GOD

#siph speaks #im SO salty about the school chromebook #im now on my normal laptop at home #so now i am gonna use vs code #but im not gonna be able to work on it at school >:(((#GRRRRRRRRRRRRRRRRRRRRRRRR #i pressed the “request access” button extremely forcefully but idk if the it team sees these things #fucking blocking access to my ide :(#so now i have to go set up vs code to work with r. or download rstudio desktop.#but if i do that then i can't work on it at school...#you'd think it's in their best interests to have me learning random data science as well #like do they not want their students going Above and Beyond and Exploring Around The Subject????#this is a very solvable problem tbf im just pissed about it #my approximately 5 lines of code... locked away from me...

16 notes · View notes

datascienceunicorn · 9 months ago

Text

The Data Scientist Handbook 2024

HT @dataelixir

#data science #data scientist #data scientists #machine learning #analytics #data analytics #artificial intelligence

18 notes · View notes

mindblowingscience · 1 year ago

Text

Cornell quantum researchers have detected an elusive phase of matter, called the Bragg glass phase, using large volumes of X-ray data and a new machine learning data analysis tool. The discovery settles a long-standing question of whether this almost–but not quite–ordered state of Bragg glass can exist in real materials. The paper, "Bragg glass signatures in PdxErTe3 with X-ray diffraction Temperature Clustering (X-TEC)," is published in Nature Physics. The lead author is Krishnanand Madhukar Mallayya, a postdoctoral researcher in the Department of Physics in the College of Arts and Sciences (A&S). Eun-Ah Kim, professor of physics (A&S), is the corresponding author. The research was conducted in collaboration with scientists at Argonne National Laboratory and at Stanford University.

#Science #Physics #Quantum Physics #Data Analysis #AI #Artifical Intelligence #Machine Learning

43 notes · View notes

datasciencewithmohsin · 5 months ago

Text

Simple Linear Regression in Data Science and machine learning

Simple linear regression is one of the most important techniques in data science and machine learning. It is the foundation of many statistical and machine learning models. Even though it is simple, its concepts are widely applicable in predicting outcomes and understanding relationships between variables.

This article will help you learn about:

1. What is simple linear regression and why it matters.

2. The step-by-step intuition behind it.

3. The math of finding slope() and intercept().

4. Simple linear regression coding using Python.

5. A practical real-world implementation.

If you are new to data science or machine learning, don’t worry! We will keep things simple so that you can follow along without any problems.

What is simple linear regression?

Simple linear regression is a method to model the relationship between two variables:

1. Independent variable (X): The input, also called the predictor or feature.

2. Dependent Variable (Y): The output or target value we want to predict.

The main purpose of simple linear regression is to find a straight line (called the regression line) that best fits the data. This line minimizes the error between the actual and predicted values.

The mathematical equation for the line is:

Y = mX + b

: The predicted values.

: The slope of the line (how steep it is).

: The intercept (the value of when).

Why use simple linear regression?

click here to read more https://datacienceatoz.blogspot.com/2025/01/simple-linear-regression-in-data.html

#artificial intelligence #bigdata #books #machine learning #machinelearning #programming #python #science #skills #big data #linear algebra #linear b #slope #interception

6 notes · View notes

i-was-today-years-old-when · 2 years ago

Text

i learned that in the US, 38% of adults between the ages of 25 and 54 don't have a partner, compared to 29% in 1990. Also, 39% of men lack a partner, compared to 36% of women; in 1990, men and women within that age group were equally likely to be single (x)

#today i learned #facts #truth #science #data #relationships

101 notes · View notes

kissycat · 6 months ago

Text

>nothing to do for like 3 days

>starts teaching myself a new programming language out of boredom

#it's Julia and it will replace R for data science in the next like... decade or so #however long it takes for university profs and tutors to catch on and learn it #which could be pretty slow #p

8 notes · View notes