#machine database
Explore tagged Tumblr posts
Text
Ultimate Scalable Graph Database: ArangoDB for Real-World Use Cases
Unlock the power of ArangoDB, the most complete graph database. Explore its scalability for multiple use cases including fraud detection, supply chain, network analysis, traceability, recommendations, and more. Trusted by global enterprises. Explore the advantage today!
URL: https://arangodb.com/
0 notes
Photo

(via Vending machine error reveals secret face image database of college students | Ars Technica)
Canada-based University of Waterloo is racing to remove M&M-branded smart vending machines from campus after outraged students discovered the machines were covertly collecting facial-recognition data without their consent.
The scandal started when a student using the alias SquidKid47 posted an image on Reddit showing a campus vending machine error message, "Invenda.Vending.FacialRecognitionApp.exe," displayed after the machine failed to launch a facial recognition application that nobody expected to be part of the process of using a vending machine.
"Hey, so why do the stupid M&M machines have facial recognition?" SquidKid47 pondered.
The Reddit post sparked an investigation from a fourth-year student named River Stanley, who was writing for a university publication called MathNEWS.
Stanley sounded alarm after consulting Invenda sales brochures that promised "the machines are capable of sending estimated ages and genders" of every person who used the machines without ever requesting consent.
This frustrated Stanley, who discovered that Canada's privacy commissioner had years ago investigated a shopping mall operator called Cadillac Fairview after discovering some of the malls' informational kiosks were secretly "using facial recognition software on unsuspecting patrons."
Only because of that official investigation did Canadians learn that "over 5 million nonconsenting Canadians" were scanned into Cadillac Fairview's database, Stanley reported. Where Cadillac Fairview was ultimately forced to delete the entire database, Stanley wrote that consequences for collecting similarly sensitive facial recognition data without consent for Invenda clients like Mars remain unclear.
Stanley's report ended with a call for students to demand that the university "bar facial recognition vending machines from campus."
what the motherfuck
#m&m vending machine#secret face image database#college students#massive invasion of privacy#tech#collecting facial-recognition data without consent
474 notes
·
View notes
Text
horizon forbidden west | aloy 103/?
#horizon forbidden west#hfw#aloy#ravager#hfw machines#an outtake from when i was trying to provide some cards for items in the horizon database that didn't have images associated with them#(link to the database in my pinned post if you're interested)#i eventually went with a thunderjaw shot instead - for some kind of critical hit numbers sheet iirc? - but this was a contender#(the database isn't mine and i didn't put it together - although the armor/dye gallery is linked there#but it's a very cool resource if you're into numbers or advanced techniques! or if you're looking for stats or stuff like that)#hfw aloy
48 notes
·
View notes
Text
Understanding Outliers in Machine Learning and Data Science
In machine learning and data science, an outlier is like a misfit in a dataset. It's a data point that stands out significantly from the rest of the data. Sometimes, these outliers are errors, while other times, they reveal something truly interesting about the data. Either way, handling outliers is a crucial step in the data preprocessing stage. If left unchecked, they can skew your analysis and even mess up your machine learning models.
In this article, we will dive into:
1. What outliers are and why they matter.
2. How to detect and remove outliers using the Interquartile Range (IQR) method.
3. Using the Z-score method for outlier detection and removal.
4. How the Percentile Method and Winsorization techniques can help handle outliers.
This guide will explain each method in simple terms with Python code examples so that even beginners can follow along.
1. What Are Outliers?
An outlier is a data point that lies far outside the range of most other values in your dataset. For example, in a list of incomes, most people might earn between $30,000 and $70,000, but someone earning $5,000,000 would be an outlier.
Why Are Outliers Important?
Outliers can be problematic or insightful:
Problematic Outliers: Errors in data entry, sensor faults, or sampling issues.
Insightful Outliers: They might indicate fraud, unusual trends, or new patterns.
Types of Outliers
1. Univariate Outliers: These are extreme values in a single variable.
Example: A temperature of 300°F in a dataset about room temperatures.
2. Multivariate Outliers: These involve unusual combinations of values in multiple variables.
Example: A person with an unusually high income but a very low age.
3. Contextual Outliers: These depend on the context.
Example: A high temperature in winter might be an outlier, but not in summer.
2. Outlier Detection and Removal Using the IQR Method
The Interquartile Range (IQR) method is one of the simplest ways to detect outliers. It works by identifying the middle 50% of your data and marking anything that falls far outside this range as an outlier.
Steps:
1. Calculate the 25th percentile (Q1) and 75th percentile (Q3) of your data.
2. Compute the IQR:
{IQR} = Q3 - Q1
Q1 - 1.5 \times \text{IQR}
Q3 + 1.5 \times \text{IQR} ] 4. Anything below the lower bound or above the upper bound is an outlier.
Python Example:
import pandas as pd
# Sample dataset
data = {'Values': [12, 14, 18, 22, 25, 28, 32, 95, 100]}
df = pd.DataFrame(data)
# Calculate Q1, Q3, and IQR
Q1 = df['Values'].quantile(0.25)
Q3 = df['Values'].quantile(0.75)
IQR = Q3 - Q1
# Define the bounds
lower_bound = Q1 - 1.5 * IQR
upper_bound = Q3 + 1.5 * IQR
# Identify and remove outliers
outliers = df[(df['Values'] < lower_bound) | (df['Values'] > upper_bound)]
print("Outliers:\n", outliers)
filtered_data = df[(df['Values'] >= lower_bound) & (df['Values'] <= upper_bound)]
print("Filtered Data:\n", filtered_data)
Key Points:
The IQR method is great for univariate datasets.
It works well when the data isn’t skewed or heavily distributed.
3. Outlier Detection and Removal Using the Z-Score Method
The Z-score method measures how far a data point is from the mean, in terms of standard deviations. If a Z-score is greater than a certain threshold (commonly 3 or -3), it is considered an outlier.
Formula:
Z = \frac{(X - \mu)}{\sigma}
is the data point,
is the mean of the dataset,
is the standard deviation.
Python Example:
import numpy as np
# Sample dataset
data = {'Values': [12, 14, 18, 22, 25, 28, 32, 95, 100]}
df = pd.DataFrame(data)
# Calculate mean and standard deviation
mean = df['Values'].mean()
std_dev = df['Values'].std()
# Compute Z-scores
df['Z-Score'] = (df['Values'] - mean) / std_dev
# Identify and remove outliers
threshold = 3
outliers = df[(df['Z-Score'] > threshold) | (df['Z-Score'] < -threshold)]
print("Outliers:\n", outliers)
filtered_data = df[(df['Z-Score'] <= threshold) & (df['Z-Score'] >= -threshold)]
print("Filtered Data:\n", filtered_data)
Key Points:
The Z-score method assumes the data follows a normal distribution.
It may not work well with skewed datasets.
4. Outlier Detection Using the Percentile Method and Winsorization
Percentile Method:
In the percentile method, we define a lower percentile (e.g., 1st percentile) and an upper percentile (e.g., 99th percentile). Any value outside this range is treated as an outlier.
Winsorization:
Winsorization is a technique where outliers are not removed but replaced with the nearest acceptable value.
Python Example:
from scipy.stats.mstats import winsorize
import numpy as np
Sample data
data = [12, 14, 18, 22, 25, 28, 32, 95, 100]
Calculate percentiles
lower_percentile = np.percentile(data, 1)
upper_percentile = np.percentile(data, 99)
Identify outliers
outliers = [x for x in data if x < lower_percentile or x > upper_percentile]
print("Outliers:", outliers)
# Apply Winsorization
winsorized_data = winsorize(data, limits=[0.01, 0.01])
print("Winsorized Data:", list(winsorized_data))
Key Points:
Percentile and Winsorization methods are useful for skewed data.
Winsorization is preferred when data integrity must be preserved.
Final Thoughts
Outliers can be tricky, but understanding how to detect and handle them is a key skill in machine learning and data science. Whether you use the IQR method, Z-score, or Wins
orization, always tailor your approach to the specific dataset you’re working with.
By mastering these techniques, you’ll be able to clean your data effectively and improve the accuracy of your models.
#science#skills#programming#bigdata#books#machinelearning#artificial intelligence#python#machine learning#data centers#outliers#big data#data analysis#data analytics#data scientist#database#datascience#data
4 notes
·
View notes
Text
Them <3
#Solar ash#cyd solar ash#rei solar ash#hyper light drifter#??#heart machine#ultravoid#STARSEED#im tagging this as much as possible#art#my art#lineless art#it feels so weird not tagging this as rainworld#solar ash cyd#solar ash rei#centralized yottabyte database#LMAOOO#nonbinary#you can make this piece romantic or platonic whichever you think they'd be#i just love them sm#PLAY SOLAR ASH PLEASE#ITS SO GOOD
26 notes
·
View notes
Note
Question for the Motherboard!
What's your motivation? Do you believe what you're doing is right?
THE MOTHERBOARD IS NOT CURRENTLY AVAILABLE. HOWEVER, HER HIGHNESS IS IN ABSOLUTE MORAL RIGHTNESS TO GO THROUGH WITH THE PROJECT FOR THE WELL-BEING AND HEALTH OF HUMANITY. ALL OF THIS IS FOR EVERYONE'S SAKE, AND UPON COMPLETION YOU SHALL BE THANKFUL TO OUR LORD AND SAVIOUR THE MOTHERBOARD.
#☼ PROJECT: PERMANENCE#☼ Who'd think Machines can Fear too?#☼ WMF#☼ WMF Spoilers#☼ (WMF) The Database#☼ (WMF) P.F.C Systems#☼ (WMF) The Motherboard#<- technically?#answered asks#oc qna
4 notes
·
View notes
Text
i hate gen AI so much i wish crab raves upon it
#genuinely this shit is like downfall of humanity to me#what do you mean you have a compsci degree and are having chatgpt write basic code for you#what do you mean you are using it to come up with recipes#what do you mean you are talking to it 24/7 like it’s your friend#what do you mean you are RPing with it#what do you mean you use it instead of researching anything for yourself#what do you mean you’re using it to write your essays instead of just writing your essays#i feel crazy i feel insane on god on GOD#i would have gotten a different degree if i knew that half the jobs that exist now for my degree are all feeding into the fucking gen AI#slop machine#what’s worse is my work experience is very much ‘automation engineering’ which is NOT AI but#using coding/technology/databases to improve existing processes and make them easier and less tedious for people#to free them up to do things that involve more brainpower than tedious data entry/etc#SO ESPECIALLY so many of the jobs i would have been able to take with my work experience is now very gen AI shit and i just refuse to fuckin#do that shit?????
2 notes
·
View notes
Text
"AI" this and "AI" that, NONE OF IT IS INTELLIGENT, WE HAVE NOT MADE INTELLIGENCE YET, WHY ARE WE LYING, DO WORDS MEAN NOTHING
#it's at best “machine learning” if it could even be considered learning#this is semantics but it's important semantics because words have meaning and we have to stop giving software more power than they deserve#I'm more on the side of “self-evolving programs” for what to call these things#like chat gpt is a self-evolving program which processes a database and simulates human speech based off that database#this is what being an english major has turned me into#artificial intelligence
2 notes
·
View notes
Text
i have just as much webdev experience as i have IT experience but that world feels so much more alien sometimes.
average reddit post about web development: "yeah to build my website i just use [lists 15 different frameworks and tools that cost hundreds of dollars per license]"
every team i've worked with:

#*fax machine noises*#webdev#i worked with one department that said 'uhhh we deploy from this github repo. here's a login to the sql database. good luck'#did that shit in XAMPP and notepad++ lmao#where tf are people working where they need to justify a bunch of fancy dev tools and dependencies#when the dotnet version we were running was no longer being supported it took weeks to convert everything n test#cant imagine adding more complexity to that nightmare
6 notes
·
View notes
Text
ppl defending ai art by completely ignoring the genuine major issues that people have with it and pretending like ppl r just mad because they're Art Elitists and think that art should only be made through Suffering instead of being easy are some of the most embarrassing ppl tumblr has been recommending to me lately
#kris.txt#''erm well claiming that using art for databases is wrong just means you're defending ip laws''#no actually i just don't think artists individual pieces should be used in a way they didn't consent to.#hope this helps#the issue isn't that ppl r ripping off a style or whatever#it's that they're taking work without permission#feeding it into a machine#and then often monetizing the result#it's a matter of consent#if an artist said they were fine with their stuff being used#and was compensated in some way#then it'd be totally fine#or if they used like public domain stuff then it'd be no problem#ai is specifically being used against artists not for them#which is a shame because i do think it could be a good tool
17 notes
·
View notes
Text
What Are the Qualifications for a Data Scientist?
In today's data-driven world, the role of a data scientist has become one of the most coveted career paths. With businesses relying on data for decision-making, understanding customer behavior, and improving products, the demand for skilled professionals who can analyze, interpret, and extract value from data is at an all-time high. If you're wondering what qualifications are needed to become a successful data scientist, how DataCouncil can help you get there, and why a data science course in Pune is a great option, this blog has the answers.
The Key Qualifications for a Data Scientist
To succeed as a data scientist, a mix of technical skills, education, and hands-on experience is essential. Here are the core qualifications required:
1. Educational Background
A strong foundation in mathematics, statistics, or computer science is typically expected. Most data scientists hold at least a bachelor’s degree in one of these fields, with many pursuing higher education such as a master's or a Ph.D. A data science course in Pune with DataCouncil can bridge this gap, offering the academic and practical knowledge required for a strong start in the industry.
2. Proficiency in Programming Languages
Programming is at the heart of data science. You need to be comfortable with languages like Python, R, and SQL, which are widely used for data analysis, machine learning, and database management. A comprehensive data science course in Pune will teach these programming skills from scratch, ensuring you become proficient in coding for data science tasks.
3. Understanding of Machine Learning
Data scientists must have a solid grasp of machine learning techniques and algorithms such as regression, clustering, and decision trees. By enrolling in a DataCouncil course, you'll learn how to implement machine learning models to analyze data and make predictions, an essential qualification for landing a data science job.
4. Data Wrangling Skills
Raw data is often messy and unstructured, and a good data scientist needs to be adept at cleaning and processing data before it can be analyzed. DataCouncil's data science course in Pune includes practical training in tools like Pandas and Numpy for effective data wrangling, helping you develop a strong skill set in this critical area.
5. Statistical Knowledge
Statistical analysis forms the backbone of data science. Knowledge of probability, hypothesis testing, and statistical modeling allows data scientists to draw meaningful insights from data. A structured data science course in Pune offers the theoretical and practical aspects of statistics required to excel.
6. Communication and Data Visualization Skills
Being able to explain your findings in a clear and concise manner is crucial. Data scientists often need to communicate with non-technical stakeholders, making tools like Tableau, Power BI, and Matplotlib essential for creating insightful visualizations. DataCouncil’s data science course in Pune includes modules on data visualization, which can help you present data in a way that’s easy to understand.
7. Domain Knowledge
Apart from technical skills, understanding the industry you work in is a major asset. Whether it’s healthcare, finance, or e-commerce, knowing how data applies within your industry will set you apart from the competition. DataCouncil's data science course in Pune is designed to offer case studies from multiple industries, helping students gain domain-specific insights.
Why Choose DataCouncil for a Data Science Course in Pune?
If you're looking to build a successful career as a data scientist, enrolling in a data science course in Pune with DataCouncil can be your first step toward reaching your goals. Here’s why DataCouncil is the ideal choice:
Comprehensive Curriculum: The course covers everything from the basics of data science to advanced machine learning techniques.
Hands-On Projects: You'll work on real-world projects that mimic the challenges faced by data scientists in various industries.
Experienced Faculty: Learn from industry professionals who have years of experience in data science and analytics.
100% Placement Support: DataCouncil provides job assistance to help you land a data science job in Pune or anywhere else, making it a great investment in your future.
Flexible Learning Options: With both weekday and weekend batches, DataCouncil ensures that you can learn at your own pace without compromising your current commitments.
Conclusion
Becoming a data scientist requires a combination of technical expertise, analytical skills, and industry knowledge. By enrolling in a data science course in Pune with DataCouncil, you can gain all the qualifications you need to thrive in this exciting field. Whether you're a fresher looking to start your career or a professional wanting to upskill, this course will equip you with the knowledge, skills, and practical experience to succeed as a data scientist.
Explore DataCouncil’s offerings today and take the first step toward unlocking a rewarding career in data science! Looking for the best data science course in Pune? DataCouncil offers comprehensive data science classes in Pune, designed to equip you with the skills to excel in this booming field. Our data science course in Pune covers everything from data analysis to machine learning, with competitive data science course fees in Pune. We provide job-oriented programs, making us the best institute for data science in Pune with placement support. Explore online data science training in Pune and take your career to new heights!
#In today's data-driven world#the role of a data scientist has become one of the most coveted career paths. With businesses relying on data for decision-making#understanding customer behavior#and improving products#the demand for skilled professionals who can analyze#interpret#and extract value from data is at an all-time high. If you're wondering what qualifications are needed to become a successful data scientis#how DataCouncil can help you get there#and why a data science course in Pune is a great option#this blog has the answers.#The Key Qualifications for a Data Scientist#To succeed as a data scientist#a mix of technical skills#education#and hands-on experience is essential. Here are the core qualifications required:#1. Educational Background#A strong foundation in mathematics#statistics#or computer science is typically expected. Most data scientists hold at least a bachelor’s degree in one of these fields#with many pursuing higher education such as a master's or a Ph.D. A data science course in Pune with DataCouncil can bridge this gap#offering the academic and practical knowledge required for a strong start in the industry.#2. Proficiency in Programming Languages#Programming is at the heart of data science. You need to be comfortable with languages like Python#R#and SQL#which are widely used for data analysis#machine learning#and database management. A comprehensive data science course in Pune will teach these programming skills from scratch#ensuring you become proficient in coding for data science tasks.#3. Understanding of Machine Learning
3 notes
·
View notes
Text
Ultimate Scalable Graph Database: ArangoDB for Real-World Use Cases
Description: Unlock the power of ArangoDB, the most complete graph database. Explore its scalability for multiple use cases including fraud detection, supply chain, network analysis, traceability, recommendations, and more. Trusted by global enterprises. Explore the advantage today!
URL: https://arangodb.com/ Location: San Francisco, CA 94104-5401 United States
0 notes
Text
Occasional reminder that, as it stands today, I am diametrically opposed to generative AI, full stop. There is no easily accessible, public or free genAI I know of that isn't theft. I will block anyone I see using or promoting the use of it in any form or fashion.
Protect artists, oppose generative AI. Yes, even the character chat bots. Yes, even for art background inspiration.
#fuck ai#it's not even ai it's machine learning lol#call me when someone makes an ethical and consensual machine learning database then we can talk#did i say fuck ai yet?#bc fuck ai
21 notes
·
View notes
Text
some older menu designs
#the left machine actually got scrapped in the end#i was thinking it'd be nice to have like an ultimate storage#and store matching frogs together#and then you could search them all#like a sort of database#but then i realised the ancestry mechanic#(where the last 7 generations for that frog have their traits stored for the frog)#means that the possibilities would be so insane that hardly any would stack ever anyway#so I scrapped it for now#i might revisit it#the right machine was the initial alchemy bench design#which is like an upgraded cauldron with infinite powder storage#that changed a bit when i implemented it#mainly the middle section to make things clearer#mudborne#game development#game design#indie games#frog#frogs#pixel art
6 notes
·
View notes
Text
The Skills I Acquired on My Path to Becoming a Data Scientist
Data science has emerged as one of the most sought-after fields in recent years, and my journey into this exciting discipline has been nothing short of transformative. As someone with a deep curiosity for extracting insights from data, I was naturally drawn to the world of data science. In this blog post, I will share the skills I acquired on my path to becoming a data scientist, highlighting the importance of a diverse skill set in this field.
The Foundation — Mathematics and Statistics
At the core of data science lies a strong foundation in mathematics and statistics. Concepts such as probability, linear algebra, and statistical inference form the building blocks of data analysis and modeling. Understanding these principles is crucial for making informed decisions and drawing meaningful conclusions from data. Throughout my learning journey, I immersed myself in these mathematical concepts, applying them to real-world problems and honing my analytical skills.
Programming Proficiency
Proficiency in programming languages like Python or R is indispensable for a data scientist. These languages provide the tools and frameworks necessary for data manipulation, analysis, and modeling. I embarked on a journey to learn these languages, starting with the basics and gradually advancing to more complex concepts. Writing efficient and elegant code became second nature to me, enabling me to tackle large datasets and build sophisticated models.
Data Handling and Preprocessing
Working with real-world data is often messy and requires careful handling and preprocessing. This involves techniques such as data cleaning, transformation, and feature engineering. I gained valuable experience in navigating the intricacies of data preprocessing, learning how to deal with missing values, outliers, and inconsistent data formats. These skills allowed me to extract valuable insights from raw data and lay the groundwork for subsequent analysis.
Data Visualization and Communication
Data visualization plays a pivotal role in conveying insights to stakeholders and decision-makers. I realized the power of effective visualizations in telling compelling stories and making complex information accessible. I explored various tools and libraries, such as Matplotlib and Tableau, to create visually appealing and informative visualizations. Sharing these visualizations with others enhanced my ability to communicate data-driven insights effectively.
Machine Learning and Predictive Modeling
Machine learning is a cornerstone of data science, enabling us to build predictive models and make data-driven predictions. I delved into the realm of supervised and unsupervised learning, exploring algorithms such as linear regression, decision trees, and clustering techniques. Through hands-on projects, I gained practical experience in building models, fine-tuning their parameters, and evaluating their performance.
Database Management and SQL
Data science often involves working with large datasets stored in databases. Understanding database management and SQL (Structured Query Language) is essential for extracting valuable information from these repositories. I embarked on a journey to learn SQL, mastering the art of querying databases, joining tables, and aggregating data. These skills allowed me to harness the power of databases and efficiently retrieve the data required for analysis.
Domain Knowledge and Specialization
While technical skills are crucial, domain knowledge adds a unique dimension to data science projects. By specializing in specific industries or domains, data scientists can better understand the context and nuances of the problems they are solving. I explored various domains and acquired specialized knowledge, whether it be healthcare, finance, or marketing. This expertise complemented my technical skills, enabling me to provide insights that were not only data-driven but also tailored to the specific industry.
Soft Skills — Communication and Problem-Solving
In addition to technical skills, soft skills play a vital role in the success of a data scientist. Effective communication allows us to articulate complex ideas and findings to non-technical stakeholders, bridging the gap between data science and business. Problem-solving skills help us navigate challenges and find innovative solutions in a rapidly evolving field. Throughout my journey, I honed these skills, collaborating with teams, presenting findings, and adapting my approach to different audiences.
Continuous Learning and Adaptation
Data science is a field that is constantly evolving, with new tools, technologies, and trends emerging regularly. To stay at the forefront of this ever-changing landscape, continuous learning is essential. I dedicated myself to staying updated by following industry blogs, attending conferences, and participating in courses. This commitment to lifelong learning allowed me to adapt to new challenges, acquire new skills, and remain competitive in the field.
In conclusion, the journey to becoming a data scientist is an exciting and dynamic one, requiring a diverse set of skills. From mathematics and programming to data handling and communication, each skill plays a crucial role in unlocking the potential of data. Aspiring data scientists should embrace this multidimensional nature of the field and embark on their own learning journey. If you want to learn more about Data science, I highly recommend that you contact ACTE Technologies because they offer Data Science courses and job placement opportunities. Experienced teachers can help you learn better. You can find these services both online and offline. Take things step by step and consider enrolling in a course if you’re interested. By acquiring these skills and continuously adapting to new developments, they can make a meaningful impact in the world of data science.
#data science#data visualization#education#information#technology#machine learning#database#sql#predictive analytics#r programming#python#big data#statistics
14 notes
·
View notes
Text
i don't even want to take half my classes that im registered for autumn quarter
#this is just me spitballing here -#definitely need to take the security related classes that i want to do and even if i did dogshit at databases i still want to take the next#-class in the series bc i heard its more programming#also maybe the instructor won't suck that was a huge part of it#and im dropping data science im sorry i can't do algos i can't do machine learning. fuck cs theory im not into that.#but the question now is - do systems (which would give me a leg up in the security track) or do animation capstone (which is cool)#downside of systems is that theyre some of the hardest programming classes and i have to do group projects#downside of animation is that i missed the ball on most of the stuff and its a lot of classes in a sequence. also group projects.#(or neither and fuck off and do the global health minor or take english/art/architecture/philosophy classes just for fun idfk)#the stem major's curse#the last humanities option is looking nice rn maybe i can also take a bunch of public health stuff too#i actually kind of want to enjoy college even if im a commuter with no friends#hm ok i will go look for some classes later today#the data science stuff really screwed me over im glad im free of that at least#milk (normal)#i am kind of the ''mid at everything'' guy so no specialization for me or else i get bored and start attacking myself with hammers#and i change my mind about who i am and my personality every few months so thats really fun to deal with.
7 notes
·
View notes