#multimodal AI model
Explore tagged Tumblr posts
usaii · 28 days ago
Text
A New Player in the League of LLMs – Mistral Le Chat | Infographic
Tumblr media
Learn about the latest player in the world of LLMs – the Mistral’s Le Chat and understand in this infographic its features, and how it compares with leading players.
Read More: https://shorturl.at/N6pIs
Mistral Le Chat, AI assistant, multimodal AI model, AI models, Machine learning algorithms, AI chatbots, large language models, best AI certifications, AI Engineer, AI skills
Tumblr media
0 notes
govindhtech · 1 month ago
Text
Pixtral Large 25.02: Amazon Bedrock Serverless Multimodal AI
Tumblr media
AWS releases Pixtral Large 25.02 for serverless Amazon Bedrock.
Amazon Bedrock Pixtral Large
The Pixtral Large 25.02 model is now completely managed and serverless on Amazon Bedrock. AWS was the first major cloud service to provide serverless, fully managed Pixtral Large.
Infrastructure design, specific expertise, and continual optimisation are often needed to manage massive foundation model (FM) computational demands. Many clients must manage complex infrastructures or choose between cost and performance when deploying sophisticated models.
Mistral AI's first multimodal model, Pixtral Large, combines high language understanding with advanced visuals. Its 128K context window makes it ideal for complex visual reasoning. The model performs well on MathVista, DocVQA, and VQAv2, proving its effectiveness in document analysis, chart interpretation, and natural picture understanding.
Pixtral Large excels at multilingualism. Global teams and apps can use English, French, German, Spanish, Italian, Chinese, Japanese, Korean, Portuguese, Dutch, and Polish. Python, Java, C, C++, JavaScript, Bash, Swift, and Fortran are among the 80 languages it can write and read.
Developers will like the model's agent-centric architecture since it integrates with current systems via function calling and JSON output formatting. Its sturdy system fast adherence improves dependability in large context situations and RAG applications.
This complex model is currently available in Amazon Bedrock without infrastructure for Pixtral Large. Serverless allows you to scale usage based on demand without prior commitments or capacity planning. No wasted resources mean you only pay for what you utilise.
Deduction across regions
Pixtral Large is now available in Amazon Bedrock across various AWS Regions due to cross-region inference.
Amazon Bedrock cross-Region inference lets you access a single FM across many regions with high availability and low latency for global applications. A model deployed in both the US and Europe may be accessible via region-specific API endpoints with various prefixes: us.model-id for US and eu.model-id for European.
By confining data processing within defined geographic borders, Amazon Bedrock may comply with laws and save latency by sending inference requests to the user's nearest endpoint. The system automatically manages load balancing and traffic routing across Regional installations to enable seamless scalability and redundancy without your monitoring.
How it works?
I always investigate how new capabilities might solve actual problems as a developer advocate. The Amazon Bedrock Converse API's new multimodal features were perfect for testing when she sought for help with her physics exam.
It struggled to solve these challenges. It realised this was the best usage for the new multimodal characteristics. The Converse API was used to create a rudimentary application that could comprehend photographs of a complex problem sheet with graphs and mathematical symbols. Once the physics test materials were uploaded, ask the model to explain the response process.
The following event impressed them both. Model interpreted schematics, mathematical notation, and French language, and described each difficulty step-by-step. The computer kept context throughout the talk and offered follow-up questions about certain issues to make teaching feel natural.
It was confident and ready for this test, showing how Amazon Bedrock's multimodal capabilities can provide users meaningful experiences.
Start now
The new method is available at US East (Ohio, N. Virginia), US West (Oregon), and Europe (Frankfurt, Ireland, Paris, Stockholm) Regional API endpoints. Regional availability reduces latency, meeting data residency criteria.
Use the AWS CLI, SDK, or Management Console to programmatically access the model using the model ID mistral.pixtral-large-2502-v1:0.
Developers and organisations of all sizes may now employ strong multimodal AI, a major leap. AWS serverless infrastructure with Mistral AI's cutting-edge model let you focus on designing innovative apps without worrying about complexity.
0 notes
srzayed · 2 years ago
Text
Introducing Kreeto: A New AI-Powered Platform Set To Transform Digital Experiences
Tumblr media
Kreeto is an advanced AI platform that brings together cutting-edge technologies to provide a comprehensive and efficient solution for various tasks. Equipped with a diverse set of 71 writing tools, Kreeto empowers users to create compelling content, whether it's articles, reports, or creative pieces. With its powerful data mining process called Kreedex, Kreeto utilizes machine learning capabilities to gather relevant information and generate insights. Additionally, Kreeto offers image generation and voice generation features, making it a versatile tool for multimedia content creation. Seamlessly integrating into workflows, Kreeto is designed to enhance productivity and streamline the creative process. Experience the limitless possibilities with Kreeto and unlock your true potential.
Let's dig into deep.
KreeGen: With KreeGen, our cutting-edge image generation model, you have the power to bring your ideas to life visually. Whether you need vibrant illustrations, stunning designs, or realistic renderings, KreeGen is at your service. Simply describe what you envision, and KreeGen will generate high-quality images that align with your creative vision. KryoSynth: When it comes to audio, our advanced KryoSynth technology takes center stage. It allows you to create synthesized voices that capture a range of tones and styles. From natural-sounding narrations to dynamic character voices, KryoSynth empowers you to enhance your projects with captivating audio experiences. CodeKrafter: If coding and programming are on your agenda, look no further than CodeKrafter. This powerful tool assists in generating code snippets to streamline your development process. With CodeKrafter, you can save time and effort by automating repetitive tasks and accessing optimized solutions for various programming languages. KreeStation: For all your creative needs, KreeStation serves as a central hub. It provides seamless access to an array of resources, including writing tools, idea generators, project management features, and more. With KreeStation as your creative command center, you'll find everything you need to fuel your innovative endeavors.
“Pushing boundaries of innovation, we believe Kreeto will change how individuals and industries operate. Our ultimate goal is making technology more accessible, intuitive and efficient,” said a spokesperson from the Kreetoverse team.
The launch of Kreeto marks a significant achievement for technology lovers and industry professionals. Striving towards a more interconnected and intelligent future, Kreeto promises to be a game-changer in the realm of Artificial Intelligence.
1 note · View note
zoetech · 2 years ago
Text
0 notes
canmom · 2 months ago
Text
oh no she's talking about AI some more
to comment more on the latest round of AI big news (guess I do have more to say after all):
chatgpt ghiblification
trying to figure out how far it's actually an advance over the state of the art of finetunes and LoRAs and stuff in image generation? I don't keep up with image generation stuff really, just look at it occasionally and go damn that's all happening then, but there are a lot of finetunes focusing on "Ghibli's style" which get it more or less well. previously on here I commented on an AI video model generation that patterned itself on Ghibli films, and video is a lot harder than static images.
of course 'studio Ghibli style' isn't exactly one thing: there are stylistic commonalities to many of their works and recurring designs, for sure, but there are also details that depend on the specific character designer and film in question in large and small ways (nobody is shooting for My Neighbours the Yamadas with this, but also e.g. Castle in the Sky does not look like Pom Poko does not look like How Do You Live in a number of ways, even if it all recognisably belongs to the same lineage).
the interesting thing about the ghibli ChatGPT generations for me is how well they're able to handle simplification of forms in image-to-image generation, often quite drastically changing the proportions of the people depicted but recognisably maintaining correspondence of details. that sort of stylisation is quite difficult to do well even for humans, and it must reflect quite a high level of abstraction inside the model's latent space. there is also relatively little of the 'oversharpening'/'ringing artefact' look that has been a hallmark of many popular generators - it can do flat colour well.
the big touted feature is its ability to place text in images very accurately. this is undeniably impressive, although OpenAI themeselves admit it breaks down beyond a certain point, creating strange images which start out with plausible, clean text and then it gradually turns into AI nonsense. it's really weird! I thought text would go from 'unsolved' to 'completely solved' or 'randomly works or doesn't work' - instead, here it feels sort of like the model has a certain limited 'pipeline' for handling text in images, but when the amount of text overloads that bandwidth, the rest of the image has to make do with vague text-like shapes! maybe the techniques from that anthropic thought-probing paper might shed some light on how information flows through the model.
similarly the model also has a limit of scene complexity. it can only handle a certain number of objects (10-20, they say) before it starts getting confused and losing track of details.
as before when they first wired up Dall-E to ChatGPT, it also simply makes prompting a lot simpler. you don't have to fuck around with LoRAs and obtuse strings of words, you just talk to the most popular LLM and ask it to perform a modification in natural language: the whole process is once again black-boxed but you can tell it in natural language to make changes. it's a poor level of control compared to what artists are used to, but it's still huge for ordinary people, and of course there's nothing stopping you popping the output into an editor to do your own editing.
not sure the architecture they're using in this version, if ChatGPT is able to reason about image data in the same space as language data or if it's still calling a separate image model... need to look that up.
openAI's own claim is:
We trained our models on the joint distribution of online images and text, learning not just how images relate to language, but how they relate to each other. Combined with aggressive post-training, the resulting model has surprising visual fluency, capable of generating images that are useful, consistent, and context-aware.
that's kind of vague. not sure what architecture that implies. people are talking about 'multimodal generation' so maybe it is doing it all in one model? though I'm not exactly sure how the inputs and outputs would be wired in that case.
anyway, as far as complex scene understanding: per the link they've cracked the 'horse riding an astronaut' gotcha, they can do 'full glass of wine' at least some of the time but not so much in combination with other stuff, and they can't do accurate clock faces still.
normal sentences that we write in 2025.
it sounds like we've moved well beyond using tools like CLIP to classify images, and I suspect that glaze/nightshade are already obsolete, if they ever worked to begin with. (would need to test to find out).
all that said, I believe ChatGPT's image generator had been behind the times for quite a long time, so it probably feels like a bigger jump for regular ChatGPT users than the people most hooked into the AI image generator scene.
of course, in all the hubbub, we've also already seen the white house jump on the trend in a suitably appalling way, continuing the current era of smirking fascist political spectacle by making a ghiblified image of a crying woman being deported over drugs charges. (not gonna link that shit, you can find it if you really want to.) it's par for the course; the cruel provocation is exactly the point, which makes it hard to find the right tone to respond. I think that sort of use, though inevitable, is far more of a direct insult to the artists at Ghibli than merely creating a machine that imitates their work. (though they may feel differently! as yet no response from Studio Ghibli's official media. I'd hate to be the person who has to explain what's going on to Miyazaki.)
google make number go up
besides all that, apparently google deepmind's latest gemini model is really powerful at reasoning, and also notably cheaper to run, surpassing DeepSeek R1 on the performance/cost ratio front. when DeepSeek did this, it crashed the stock market. when Google did... crickets, only the real AI nerds who stare at benchmarks a lot seem to have noticed. I remember when Google releases (AlphaGo etc.) were huge news, but somehow the vibes aren't there anymore! it's weird.
I actually saw an ad for google phones with Gemini in the cinema when i went to see Gundam last week. they showed a variety of people asking it various questions with a voice model, notably including a question on astrology lmao. Naturally, in the video, the phone model responded with some claims about people with whatever sign it was. Which is a pretty apt demonstration of the chameleon-like nature of LLMs: if you ask it a question about astrology phrased in a way that implies that you believe in astrology, it will tell you what seems to be a natural response, namely what an astrologer would say. If you ask if there is any scientific basis for belief in astrology, it would probably tell you that there isn't.
In fact, let's try it on DeepSeek R1... I ask an astrological question, got an astrological answer with a really softballed disclaimer:
Individual personalities vary based on numerous factors beyond sun signs, such as upbringing and personal experiences. Astrology serves as a tool for self-reflection, not a deterministic framework.
Ask if there's any scientific basis for astrology, and indeed it gives you a good list of reasons why astrology is bullshit, bringing up the usual suspects (Barnum statements etc.). And of course, if I then explain the experiment and prompt it to talk about whether LLMs should correct users with scientific information when they ask about pseudoscientific questions, it generates a reasonable-sounding discussion about how you could use reinforcement learning to encourage models to focus on scientific answers instead, and how that could be gently presented to the user.
I wondered if I'd asked it instead to talk about different epistemic regimes and come up with reasons why LLMs should take astrology into account in their guidance. However, this attempt didn't work so well - it started spontaneously bringing up the science side. It was able to observe how the framing of my question with words like 'benefit', 'useful' and 'LLM' made that response more likely. So LLMs infer a lot of context from framing and shape their simulacra accordingly. Don't think that's quite the message that Google had in mind in their ad though.
I asked Gemini 2.0 Flash Thinking (the small free Gemini variant with a reasoning mode) the same questions and its answers fell along similar lines, although rather more dry.
So yeah, returning to the ad - I feel like, even as the models get startlingly more powerful month by month, the companies still struggle to know how to get across to people what the big deal is, or why you might want to prefer one model over another, or how the new LLM-powered chatbots are different from oldschool assistants like Siri (which could probably answer most of the questions in the Google ad, but not hold a longform conversation about it).
some general comments
The hype around ChatGPT's new update is mostly in its use as a toy - the funny stylistic clash it can create between the soft cartoony "Ghibli style" and serious historical photos. Is that really something a lot of people would spend an expensive subscription to access? Probably not. On the other hand, their programming abilities are increasingly catching on.
But I also feel like a lot of people are still stuck on old models of 'what AI is and how it works' - stochastic parrots, collage machines etc. - that are increasingly falling short of the more complex behaviours the models can perform, now prediction combines with reinforcement learning and self-play and other methods like that. Models are still very 'spiky' - superhumanly good at some things and laughably terrible at others - but every so often the researchers fill in some gaps between the spikes. And then we poke around and find some new ones, until they fill those too.
I always tried to resist 'AI will never be able to...' type statements, because that's just setting yourself up to look ridiculous. But I will readily admit, this is all happening way faster than I thought it would. I still do think this generation of AI will reach some limit, but genuinely I don't know when, or how good it will be at saturation. A lot of predicted 'walls' are falling.
My anticipation is that there's still a long way to go before this tops out. And I base that less on the general sense that scale will solve everything magically, and more on the intense feedback loop of human activity that has accumulated around this whole thing. As soon as someone proves that something is possible, that it works, we can't resist poking at it. Since we have a century or more of science fiction priming us on dreams/nightmares of AI, as soon as something comes along that feels like it might deliver on the promise, we have to find out. It's irresistable.
AI researchers are frequently said to place weirdly high probabilities on 'P(doom)', that AI research will wipe out the human species. You see letters calling for an AI pause, or papers saying 'agentic models should not be developed'. But I don't know how many have actually quit the field based on this belief that their research is dangerous. No, they just get a nice job doing 'safety' research. It's really fucking hard to figure out where this is actually going, when behind the eyes of everyone who predicts it, you can see a decade of LessWrong discussions framing their thoughts and you can see that their major concern is control over the light cone or something.
34 notes · View notes
rideboomindia · 10 months ago
Text
Tumblr media
Based on the search results, here are some innovative technologies that RideBoom could implement to enhance the user experience and stay ahead of ONDC:
Enhanced Safety Measures: RideBoom has already implemented additional safety measures, including enhanced driver background checks, real-time trip monitoring, and improved emergency response protocols. [1] To stay ahead, they could further enhance safety by integrating advanced telematics and AI-powered driver monitoring systems to ensure safe driving behavior.
Personalized and Customizable Services: RideBoom could introduce a more personalized user experience by leveraging data analytics and machine learning to understand individual preferences and offer tailored services. This could include features like customizable ride preferences, personalized recommendations, and the ability to save preferred routes or driver profiles. [1]
Seamless Multimodal Integration: To provide a more comprehensive transportation solution, RideBoom could integrate with other modes of transportation, such as public transit, bike-sharing, or micro-mobility options. This would allow users to plan and book their entire journey seamlessly through the RideBoom app, enhancing the overall user experience. [1]
Sustainable and Eco-friendly Initiatives: RideBoom has already started introducing electric and hybrid vehicles to its fleet, but they could further expand their green initiatives. This could include offering incentives for eco-friendly ride choices, partnering with renewable energy providers, and implementing carbon offset programs to reduce the environmental impact of their operations. [1]
Innovative Payment and Loyalty Solutions: To stay competitive with ONDC's zero-commission model, RideBoom could explore innovative payment options, such as integrated digital wallets, subscription-based services, or loyalty programs that offer rewards and discounts to frequent users. This could help attract and retain customers by providing more value-added services. [2]
Robust Data Analytics and Predictive Capabilities: RideBoom could leverage advanced data analytics and predictive modeling to optimize their operations, anticipate demand patterns, and proactively address user needs. This could include features like dynamic pricing, intelligent routing, and personalized recommendations to enhance the overall user experience. [1]
By implementing these innovative technologies, RideBoom can differentiate itself from ONDC, provide a more seamless and personalized user experience, and stay ahead of the competition in the on-demand transportation market.
57 notes · View notes
fipindustries · 8 months ago
Text
AIs dont understand the real world
a lot has been said over the fact that LLMs dont actually understand the real world, just the statistical relationship between empty tokens, empty words. i say "empty" because in the AI's mind those words dont actually connect to a real world understanding of what the words represent. the AI may understand that the collection of letters "D-O-G" may have some statistical connection to "T-A-I-L" and "F-U-R" and "P-O-O-D-L-E" but it doesnt actually know anything about what an actual Dog is or what is a Tail or actual Fur or real life Poodles.
and yet it seems to be capable of holding remarcably coherent conversations. it seems more and more, with each new model that comes out, to become better at answering questions, at reasoning, at creating original writing. if it doesnt truly understand the world it sure seems to get better at acting like it does with nothing but just a statistical understanding of how words are related to each other.
i guess the question ultimatly is "if you understand well enough the relationship between raw symbols could you have an understanding of the underlying relatonships between the things these symbols represent?"
now, let me take a small tangent towards human understanding. specifically towards what philosophy has to say about it.
one of the classic problems philosophers deal with is, how do we know the world is real. how do we know we can trust our senses, how do we know the truth? many argue that we cant. that we dont really percieve the true world out there beyond ourselves. all we can percieve is what our senses are telling us about the world.
lets think of sight.
if you think about it, we dont really "see" objects, right? we just see the light that bounces off those objects. and even then we dont really "see" the photons that collide with our eye, we see the images that our brain generates in our mind, presumably because the corresponding photons collided with our eye. but colorblind people and people who experience visual hallucinations have shown that what we see doesnt have to always correspond with actual phisical phenomena occuring in the real world.
we dont see the real world, we see referents to it. and from the relationships between these referents we start to infer the properties of the actual world out there. are the lights that hit our eye similar to the words that the LLM is trained on? only a referent whose understanding allows it to function in the real world, even if it cant actually "percieve" the "real" world.
but, one might say, we dont just have sight. we have other senses, we have smell and touch and taste and audition. all of these things allow us to form a much richer and multidimensional understanding of the world, even if by virtue of being human senses they have all the problems that sight has of being one step removed from the real world. so to this i will say that multimodal AIs also exist. AIs that can connect audio and visuals and words together and form relationships between all of this disparate data.
so then, can it be said that they understand the world? and if not yet, is that a cathegorical difference or merely a difference of degree. that is to say, not that they cathegorically cant understand the world but simply that they understand it less well than humans do.
41 notes · View notes
argumate · 2 months ago
Text
As a deep-thinking reasoning model with multimodal capabilities, ERNIE X1 delivers performance on par with DeepSeek R1 at only half the price. Meanwhile, ERNIE 4.5 is our latest foundation model and new-generation native multimodal model.
cost of AI continues its race towards zero, excellent news for anyone concerned about AI resource usage.
19 notes · View notes
govindhtech · 1 month ago
Text
Pegasus 1.2: High-Performance Video Language Model
Tumblr media
Pegasus 1.2 revolutionises long-form video AI with high accuracy and low latency. Scalable video querying is supported by this commercial tool.
TwelveLabs and Amazon Web Services (AWS) announced that Amazon Bedrock will soon provide Marengo and Pegasus, TwelveLabs' cutting-edge multimodal foundation models. Amazon Bedrock, a managed service, lets developers access top AI models from leading organisations via a single API. With seamless access to TwelveLabs' comprehensive video comprehension capabilities, developers and companies can revolutionise how they search for, assess, and derive insights from video content using AWS's security, privacy, and performance. TwelveLabs models were initially offered by AWS.
Introducing Pegasus 1.2
Unlike many academic contexts, real-world video applications face two challenges:
Real-world videos might be seconds or hours lengthy.
Proper temporal understanding is needed.
TwelveLabs is announcing Pegasus 1.2, a substantial industry-grade video language model upgrade, to meet commercial demands. Pegasus 1.2 interprets long films at cutting-edge levels. With low latency, low cost, and best-in-class accuracy, model can handle hour-long videos. Their embedded storage ingeniously caches movies, making it faster and cheaper to query the same film repeatedly.
Pegasus 1.2 is a cutting-edge technology that delivers corporate value through its intelligent, focused system architecture and excels in production-grade video processing pipelines.
Superior video language model for extended videos
Business requires handling long films, yet processing time and time-to-value are important concerns. As input films increase longer, a standard video processing/inference system cannot handle orders of magnitude more frames, making it unsuitable for general adoption and commercial use. A commercial system must also answer input prompts and enquiries accurately across larger time periods.
Latency
To evaluate Pegasus 1.2's speed, it compares time-to-first-token (TTFT) for 3–60-minute videos utilising frontier model APIs GPT-4o and Gemini 1.5 Pro. Pegasus 1.2 consistently displays time-to-first-token latency for films up to 15 minutes and responds faster to lengthier material because to its video-focused model design and optimised inference engine.
Performance
Pegasus 1.2 is compared to frontier model APIs using VideoMME-Long, a subset of Video-MME that contains films longer than 30 minutes. Pegasus 1.2 excels above all flagship APIs, displaying cutting-edge performance.
Pricing
Cost Pegasus 1.2 provides best-in-class commercial video processing at low cost. TwelveLabs focusses on long videos and accurate temporal information rather than everything. Its highly optimised system performs well at a competitive price with a focused approach.
Better still, system can generate many video-to-text without costing much. Pegasus 1.2 produces rich video embeddings from indexed movies and saves them in the database for future API queries, allowing clients to build continually at little cost. Google Gemini 1.5 Pro's cache cost is $4.5 per hour of storage, or 1 million tokens, which is around the token count for an hour of video. However, integrated storage costs $0.09 per video hour per month, x36,000 less. Concept benefits customers with large video archives that need to understand everything cheaply.
Model Overview & Limitations
Architecture
Pegasus 1.2's encoder-decoder architecture for video understanding includes a video encoder, tokeniser, and big language model. Though efficient, its design allows for full textual and visual data analysis.
These pieces provide a cohesive system that can understand long-term contextual information and fine-grained specifics. It architecture illustrates that tiny models may interpret video by making careful design decisions and solving fundamental multimodal processing difficulties creatively.
Restrictions
Safety and bias
Pegasus 1.2 contains safety protections, but like any AI model, it might produce objectionable or hazardous material without enough oversight and control. Video foundation model safety and ethics are being studied. It will provide a complete assessment and ethics report after more testing and input.
Hallucinations
Occasionally, Pegasus 1.2 may produce incorrect findings. Despite advances since Pegasus 1.1 to reduce hallucinations, users should be aware of this constraint, especially for precise and factual tasks.
2 notes · View notes
newspatron · 1 year ago
Text
Google Gemini: The Ultimate Guide to the Most Advanced AI Model Ever
We hope you enjoyed this article and found it informative and insightful. We would love to hear your feedback and suggestions, so please feel free to leave a comment below or contact us through our website. Thank you for reading and stay tuned for more
Google Gemini: A Revolutionary AI Model that Can Shape the Future of Technology and Society. Artificial intelligence (AI) is one of the most exciting and rapidly evolving fields of technology today. From personal assistants to self-driving cars, AI is transforming various aspects of our lives and society. However, the current state of AI is still far from achieving human-like intelligence and…
Tumblr media
View On WordPress
0 notes
johnsmithtoptech · 2 months ago
Text
Baidu Launches New Reasoning AI Model in Challenge to DeepSeek
Baidu has unveiled Ernie X1, a reasoning AI model designed to rival DeepSeek's R1, offering similar performance at half the cost. This model enhances understanding, planning, reflection, and evolution capabilities. Additionally, Baidu introduced Ernie 4.5, a multimodal model that surpasses GPT-4.5's performance at just 1% of the price. Both models will be open-sourced by June 30, aiming to boost wider usage
3 notes · View notes
zoetech · 2 years ago
Text
0 notes
rathcoremarketing · 2 months ago
Text
What's Happening to SEO? 8 SEO Trends for 2025
🔄 Updated 2/26/2025
🕓 8-Min Read
🦘 Section Jumper
Optimize for E-E-A-T Signals
AI Overview and SEO
Forum Marketing and SERPs Updates
Is Traditional SEO Still Relevant
Zero-Click Searches
Map Pack and Local Heat Maps
Voice and Mobile Search Optimization
What's Better for AI: BOFU or TOFU Content?
Let’s call it what it is —SEO isn’t some clever marketing hack anymore; it’s now a battlefield where the rules change faster than your morning coffee order. And if you’ve been patting yourself on the back for nailing your SEO strategy, look, those same strategies might already be obsolete. Yeah, that’s how fast the game is flipping.
For years, we’ve been told that backlinks and keywords were the golden tickets. And now?
Gen Z is asking TikTok instead of Google, search engines are reading context like a nosy detective, and over half of all searches don’t even bother clicking on anything.
Welcome to SEO trends for 2025—a world where your next competitor might be an AI tool, a 3-second video, or even Google itself deciding to hoard its users.
youtube
1. Optimize for E-E-A-T Signals
There’s no nice way to say this: if your content isn’t radiating credibility, Google probably isn’t interested.
Now comes E-E-A-T—Experience, Expertise, Authoritativeness, and Trustworthiness. While it sounds like a mouthful, it’s the compass guiding Google’s ranking algorithm in 2025. If your content strategy ignores these signals, you're handing over your traffic to someone else—no questions asked.
What Is E-E-A-T, and Why Does It Matter?
Tumblr media
E-E-A-T isn’t just some buzzword for digital marketing geeks to toss around at conferences. It’s Google’s way of separating the wheat from the chaff. Experience means real-world insights. Expertise ensures your content doesn’t sound like it’s written by an intern on their first day. Authoritativeness demands recognition from your industry.
And trust?
Well, it’s the silent decider—get it wrong, and everything else crumbles.
For content optimization in 2025, ignoring E-E-A-T signals means you’re throwing darts blindfolded. Google’s updates now measure not just what you say, but why anyone should care. And here’s the thing: thin content and anonymous authors are SEO death sentences.
How to Nail E-E-A-T (and Stay Ahead of the Latest SEO Trends)
Experience
Share specific, actionable knowledge. Generic advice doesn’t cut it anymore.
Example: A blog about SEO trends shouldn’t vaguely define "SEO"—it should delve into zero-click searches or multimodal search backed by real-world data.
Expertise
Feature qualified authors or contributors. Link their credentials to their content. Google actually checks authorship, so anonymous content only screams "spam."
Authoritativeness
Earn backlinks from reputable sites. Don’t fake authority—Google sees through it.
Trustworthiness
Secure your site (HTTPS), include proper sourcing, and avoid clickbait titles that don’t deliver.
The Hard Truth about E-E-A-T
E-E-A-T is the foundation for content optimization in a post-2024 world. The latest SEO trends show Google’s focus isn’t just on keywords but on the credibility of your entire digital presence.
It’s no longer enough to rank; you need to deserve to rank.
2. AI Overview and SEO
Tumblr media
Artificial Intelligence is practically running the show. In 2025, AI isn’t a gimmick; it’s the brains behind search engines, content creation, and the unspoken secrets of what ranks. If you’re still crafting strategies without factoring in AI, here’s the harsh truth: you’re optimizing for a version of the internet that’s already irrelevant.
How AI Is Reshaping SEO
AI has transcended its “future of marketing” tagline. Today, it’s the present, and every search marketer worth their salt knows it.
Let’s break it down:
AI-Driven Search Engines
Google’s RankBrain and Multitask Unified Model (MUM) are redefining how search intent optimization works. They analyze context, intent, and semantics better than ever. Gone are the days when sprinkling keywords like fairy dust could boost rankings. AI demands relevance, intent, and, let’s be honest, better content.
Automated Content Creation
Tools like ChatGPT and Jasper are churning out content faster than most humans can proofread. The catch is, Google’s Helpful Content Update is watching—and penalizing—low-quality AI spam. Automated content might save time, but without a human layer of expertise, it’s a one-way ticket to obscurity.
Smart Search Predictions
AI isn’t just predicting what users type—it’s analyzing how they think. From location-based recommendations to real-time search trends, AI is shaping results before users finish typing their queries. This makes AI SEO tools like Clearscope and Surfer SEO essential for staying competitive.
Google AI Overview SERP: The New Front Door of Search
Tumblr media
What Makes Google AI Overview SERPs Stand Out?
Generative AI Summaries In late 2023, Google started rolling out generative AI summaries at the top of certain searches. These provide quick, digestible answers pulled from the web, cutting through the noise of lengthy pages. It’s fast, convenient, and often the first (and only) thing users see. Pro Tip: Structure your content to directly answer questions concisely while retaining depth. Think FAQ sections, bullet points, and clear headers.
Visual Enhancements Google AI Overview SERPs now integrate rich visuals, including images, charts, and interactive elements powered by AI. These upgrades aren’t just eye-catching; they drive engagement. Pro Tip: Optimize images with alt text, compress them for speed, and ensure visual assets are relevant and high-quality.
Personalization on Steroids Google’s AI doesn’t just know what users want—it predicts it. From personalized recommendations to local search enhancements, SERPs are more targeted than ever. Pro Tip: Leverage local SEO strategies and schema markup to cater to these hyper-personalized results.
Adapting to Google AI SERPs
Aim for Snippet Domination: Featured snippets are now more important than ever, with AI summaries pulling directly from them. Answer questions directly and succinctly in your content.
Invest in Topic Clusters: AI thrives on context. Interlinking detailed, related content helps your site signal authority and relevance.
Optimize for Real Intent: With AI interpreting user queries more deeply, addressing surface-level keywords won’t cut it. Focus on intent-driven long-tail keywords and nuanced subtopics.
The Bottom Line
Google’s AI Overview SERP is the digital gateway to visibility in 2025. If your strategy isn’t aligned with these changes, you risk becoming invisible. Adapt your content to meet the demands of AI-driven features, and you’ll not just survive—you’ll thrive in this new SEO frontier.
What This Means for Your Strategy
AI-Assisted Content: Use AI for efficiency, but let humans handle creativity and trust-building.
Search Intent Optimization: Focus on answering deeper, adjacent questions. AI rewards nuanced, contextual relevance.
Invest in Tools: Tools like SEMrush and Ahrefs now integrate AI-powered insights, helping you stay ahead.
Look, artificial Intelligence in SEO isn’t an edge—it’s the standard. By 2025, marketers who don’t adapt will find their strategies in a digital graveyard. AI doesn’t replace your expertise; it amplifies it. Use it wisely—or get left behind.
3. Forum Marketing and SERP Updates
Platforms like Reddit, Quora, and niche communities are silently reshaping SEO and slipping into prime real estate on search engine results pages (SERPs). For marketers obsessed with the usual Google ranking factors, ignoring community-driven content could be the blind spot that costs you big.
Tumblr media
Why Forums Are Influencing SERPs
Content Depth
Community-driven content is often nuanced, answering long-tail questions that traditional blogs barely skim. For instance, a Quora thread titled “Best local SEO strategies for small businesses in 2025” isn’t just generic advice—it’s specific, diverse, and sometimes brutally honest.
Searcher Intent Alignment
Forums directly address search intent optimization by catering to niche queries. Whether it’s “How to rank for hyper-local searches” or “Why my Google Business profile isn’t showing up,” forums deliver precise, user-generated insights.
Fresh Perspectives
Unlike stale, regurgitated SEO articles, forums thrive on updated discussions. A Reddit thread on “latest SEO trends” could become the top result simply because it offers real-time relevance.
What Marketers Need to Do
Engage, Don’t Spam
Build credibility by genuinely contributing to forums. Overly promotional comments are a fast track to being ignored—or worse, banned.
Monitor Trends
Tools like AnswerThePublic and BuzzSumo can identify trending community topics. Use these to create content that aligns with user discussions.
Optimize for SERP Features
Structure blog content to mimic forum-style Q&As. Google loves direct, conversational formats.
Ignoring the surge of forum content is no longer an option. So, don’t get left behind watching Quora outrank your site—adapt now.
4. Is Traditional SEO Still Relevant?
The debate is as old as Google itself: does traditional SEO still matter in a world where AI is taking over and search engines are rewriting the rules of engagement?
Tumblr media
Traditional SEO Techniques That Still Work
Link Building (Reimagined)
Backlinks still matter, but Google has become savvier about quality over quantity. A link from an authoritative site in your niche outweighs ten random backlinks from irrelevant sources. Focus on building relationships with industry leaders, writing guest blogs, or getting cited in high-quality articles.
On-Page Optimization (Evolved)
Forget sprinkling keywords mindlessly. Google now prioritizes user experience SEO, meaning your headings, meta descriptions, and URLs need to align with search intent. Want to rank? Structure content logically, use descriptive titles, and, for goodness’ sake, stop overloading every tag with keywords.
Local SEO Strategies
Hyper-local searches like "coffee shops near me" are driving significant traffic. Traditional techniques like Google Business Profile optimization and consistent NAP (Name, Address, Phone) info still dominate here. What’s changed? You need to engage actively with reviews and ensure your profile reflects real-time updates.
Techniques That Need an Update
How to Adapt Traditional SEO in 2025
Tumblr media
5. Zero-Click Searches
Now, let’s address the elephant on the search results page: zero-click searches. They’re not a trend anymore—they’re the new standard. With over 65% of Google searches ending without a click in 2020, search engines are clearly keeping users on their turf. They’re not just gatekeepers of information; they’re now the landlords, decorators, and sometimes the dinner hosts, offering all the answers upfront. And for businesses, this means rethinking how success in SEO is measured.
Tumblr media
The Impact on SEO
Shift in Metrics
Forget obsessing over click-through rates. The latest SEO trends demand focusing on visibility within the SERP itself. If your business isn’t occupying rich result spaces, you’re effectively invisible.
Search Intent Optimization
Google isn’t just guessing user intent anymore—it’s anticipating it with precision. To stay relevant, businesses need to answer why users are searching, not just what they’re searching for.
Authority Consolidation
Zero-click features favor high-authority domains. If your brand isn’t seen as a credible source, you’re not making it into that snippet box.
How to Optimize for Zero-Click Searches
Target Featured Snippets
Structure your content with clear, concise answers at the top of your pages. Use lists, tables, and bullet points to cater to snippet formats.
Utilize Schema Markup
Help search engines understand your content by adding structured data. This boosts your chances of landing in rich results.
Focus on Hyper-Specific Queries
Zero-click searches thrive on niche, long-tail questions. Create content that directly addresses these to increase visibility.
What It Means for Businesses
In the world of zero-click searches, SEO success is about dominating the SERP real estate. Businesses that fail to adapt will find themselves in a no-click graveyard, while those who master rich results will cement their place as authority figures. Either way, the clicks aren’t coming back.
So, are you ready to play Google’s game—or be played?
6. Map Pack and Local Heat Maps
The truth is, if your business isn’t showing up in Google’s Map Pack, you might as well not exist for local customers. The Map Pack is literally the throne room of local SEO, and in 2025, it’s more competitive than ever. Pair that with Local Heat Maps—Google’s not-so-subtle way of telling businesses where they rank spatially—and you’ve got the ultimate battleground for visibility.
Tumblr media
What Are the Map Pack and Local Heat Maps?
The Map Pack is that prime real estate at the top of local search results showing the top three businesses near a user. It’s concise, visual, and, let’s be honest, the first (and often only) thing users check. Local Heat Maps complement this by analyzing searcher behavior within a geographic radius, showing which businesses dominate specific zones.
Why It Matters
Visibility Drives Foot Traffic
According to recent studies, 78% of local mobile searches result in an offline purchase. If you’re not in the Map Pack, those sales are walking straight into your competitor’s doors.
User Proximity Bias
Google prioritizes businesses not just based on relevance but on proximity. If your listing isn’t optimized for precise local searches, you’re leaving money on the table.
Direct Influence on SERP Performance
Appearing in the Map Pack boosts Google ranking factors for local search queries, feeding visibility into both online and offline spaces.
How to Maximize Visibility in Local SEO
Optimize Your Google Business Profile (GBP):
Ensure your NAP (Name, Address, Phone) is accurate and consistent.
Add high-quality images, respond to reviews, and frequently update operating hours.
Focus on Reviews:
Encourage happy customers to leave reviews.
Respond to every review (yes, even the bad ones). Engagement signals trustworthiness.
Leverage Local Keywords:
Target queries like "best [your service] near me" or "[service] in [city]" to rank for location-based searches.
Tools like BrightLocal and Whitespark can help you track local performance.
Use SEO Automation Tools:
Tools like SEMrush and Moz Local can audit your listings, track rankings, and streamline updates. Automating repetitive tasks frees up time for deeper optimizations.
Tumblr media
7. Voice and Mobile Search Optimization
Let’s get one thing straight: if your SEO strategy isn’t optimized for voice and mobile searches, you’re catering to an audience that doesn’t exist anymore. By 2025, voice-driven queries and mobile-first indexing are the baseline. If your website can’t keep up, neither will your rankings.
Why Voice and Mobile Search Dominate SEO
Tumblr media
Voice Search is Redefining Queries
Voice search isn’t just “spoken Google.” It’s transforming how users ask questions. Searches are longer, more conversational, and often hyper-specific. For example, instead of typing “best SEO tools,” users now say, “What’s the best SEO automation tool for small businesses?” If your content doesn’t align with this natural language, you’re invisible.
Mobile is Non-Negotiable
Google’s mobile-first indexing means it now ranks websites based on their mobile versions. If your site is clunky on a smartphone, your desktop masterpiece won’t save you. And with nearly 60% of all searches happening on mobile, responsive design isn’t optional—it’s critical.
How to Optimize for Voice and Mobile
Create Conversational Content:
Use natural language that matches how people talk. Think FAQs and “how-to” guides tailored for voice queries.
Focus on long-tail keywords like “how to optimize for mobile-first indexing” rather than rigid phrases.
Mobile-First Design:
Prioritize responsive design that adapts seamlessly to smaller screens.
Optimize loading speed; anything over 3 seconds is SEO suicide.
Leverage Local SEO:
Most voice searches are local. Queries like “nearest coffee shop open now” thrive on accurate local listings.
Ensure your Google Business Profile is up-to-date and features consistent NAP info.
Use Structured Data:
Schema markup helps search engines interpret your content, increasing the likelihood of appearing in voice search results.
The future of SEO is voice-driven and mobile-first, and both require you to rethink how you structure your content and your site. Optimizing SEO for voice search and mobile-first indexing future-proofs your business. And if you’re not ready to adapt, don’t worry—your competitors already have.
8. What's Better for AI: BOFU or TOFU Content?
Let’s start with the obvious: not all content is the same, especially when AI gets involved. The age-old debate between Top of Funnel (TOFU) and Bottom of Funnel (BOFU) content just got a modern twist, thanks to the rise of AI-driven SEO. The real question isn’t which one is better—it’s how to use AI to optimize both.
Look, if you’re focusing on one and neglecting the other, you’re leaving money—and rankings—on the table.
Tumblr media
TOFU Content: Casting the Wide Net
Top of Funnel content is designed to attract and inform. Think of blog posts, educational guides, or those “What is [your product]?” articles. In the AI era, TOFU content isn’t just about driving traffic; it’s about structured data examples and search intent optimization. AI tools like ChatGPT help create scalable, topic-driven content tailored for discovery.
Why TOFU Matters:
It builds brand awareness and visibility.
Optimized TOFU content aligns with broad search intent, capturing users who aren’t ready to buy but are hungry for knowledge.
TOFU shines in industries with complex products that need explanation before consideration.
BOFU Content: Sealing the Deal
On the other hand, Bottom of Funnel content focuses on converting leads into customers. This includes case studies, product comparisons, and detailed how-to content. AI isn’t just speeding up content creation here; it’s enabling hyper-personalized, decision-driven assets.
Why BOFU Matters:
It answers purchase-ready queries like “best SEO automation tools for small businesses.”
BOFU works wonders for products or services with shorter sales cycles or high competition.
The content can include dynamic features like interactive product demos or AI-generated testimonials to push users over the edge.
The Verdict: Which One Wins?
Neither. TOFU and BOFU content work best as part of a balanced strategy. AI thrives when it’s used to create and optimize both stages of the buyer’s journey.
For example:
Use AI to analyze trends and structure TOFU content for long-tail keywords.
Deploy AI for data-driven BOFU personalization, ensuring the content resonates with users’ specific needs.
AI isn’t here to settle the TOFU vs. BOFU debate—it’s here to make sure you never have to choose. A well-rounded strategy, powered by AI, ensures you attract the right audience and convert them when the time is right. If you’re doing one without the other, you’re playing half the game.
Contact Us
Contact us for more info or to chat about your business strategy in 2025
Staying Ahead of SEO Trends in 2025
SEO isn’t static, and 2025 won’t give you time to rest on outdated strategies. From zero-click searches hijacking clicks to AI redefining the content game, keeping up isn’t just a choice—it’s survival. Businesses that ignore these SEO trends risk fading into irrelevance faster than you can say “algorithm update.”
The solution? Adapt now!
Use AI SEO tools to fine-tune your strategy, optimize for human intent (not just search engines), and rethink how you create TOFU and BOFU content. It’s not about doing everything—it’s about doing the right things smarter and faster.
Start applying these insights today. Your competitors already are.
2 notes · View notes
cookiesnerd · 5 months ago
Text
ChatGPT and Google Gemini are both advanced AI language models designed for different types of conversational tasks, each with unique strengths. ChatGPT, developed by OpenAI, is primarily focused on text-based interactions. It excels in generating structured responses for writing, coding support, and research assistance. ChatGPT’s paid versions unlock additional features like image generation with DALL-E and web browsing for more current information, which makes it ideal for in-depth text-focused tasks​.
In contrast, Google Gemini is a multimodal AI, meaning it handles both text and images and can retrieve real-time information from the web. This gives Gemini a distinct advantage for tasks requiring up-to-date data or visual content, like image-based queries or projects involving creative visuals. It integrates well with Google's ecosystem, making it highly versatile for users who need both text and visual support in their interactions. While ChatGPT is preferred for text depth and clarity, Gemini’s multimodal and real-time capabilities make it a more flexible choice for creative and data-current tasks​
4 notes · View notes
fipindustries · 1 year ago
Text
Artificial Intelligence Risk
about a month ago i got into my mind the idea of trying the format of video essay, and the topic i came up with that i felt i could more or less handle was AI risk and my objections to yudkowsky. i wrote the script but then soon afterwards i ran out of motivation to do the video. still i didnt want the effort to go to waste so i decided to share the text, slightly edited here. this is a LONG fucking thing so put it aside on its own tab and come back to it when you are comfortable and ready to sink your teeth on quite a lot of reading
Anyway, let’s talk about AI risk
I’m going to be doing a very quick introduction to some of the latest conversations that have been going on in the field of artificial intelligence, what are artificial intelligences exactly, what is an AGI, what is an agent, the orthogonality thesis, the concept of instrumental convergence, alignment and how does Eliezer Yudkowsky figure in all of this.
 If you are already familiar with this you can skip to section two where I’m going to be talking about yudkowsky’s arguments for AI research presenting an existential risk to, not just humanity, or even the world, but to the entire universe and my own tepid rebuttal to his argument.
Now, I SHOULD clarify, I am not an expert on the field, my credentials are dubious at best, I am a college drop out from the career of computer science and I have a three year graduate degree in video game design and a three year graduate degree in electromechanical instalations. All that I know about the current state of AI research I have learned by reading articles, consulting a few friends who have studied about the topic more extensevily than me,
and watching educational you tube videos so. You know. Not an authority on the matter from any considerable point of view and my opinions should be regarded as such.
So without further ado, let’s get in on it.
PART ONE, A RUSHED INTRODUCTION ON THE SUBJECT
1.1 general intelligence and agency
lets begin with what counts as artificial intelligence, the technical definition for artificial intelligence is, eh…, well, why don’t I let a Masters degree in machine intelligence explain it:
Tumblr media
 Now let’s get a bit more precise here and include the definition of AGI, Artificial General intelligence. It is understood that classic ai’s such as the ones we have in our videogames or in alpha GO or even our roombas, are narrow Ais, that is to say, they are capable of doing only one kind of thing. They do not understand the world beyond their field of expertise whether that be within a videogame level, within a GO board or within you filthy disgusting floor.
AGI on the other hand is much more, well, general, it can have a multimodal understanding of its surroundings, it can generalize, it can extrapolate, it can learn new things across multiple different fields, it can come up with solutions that account for multiple different factors, it can incorporate new ideas and concepts. Essentially, a human is an agi. So far that is the last frontier of AI research, and although we are not there quite yet, it does seem like we are doing some moderate strides in that direction. We’ve all seen the impressive conversational and coding skills that GPT-4 has and Google just released Gemini, a multimodal AI that can understand and generate text, sounds, images and video simultaneously. Now, of course it has its limits, it has no persistent memory, its contextual window while larger than previous models is still relatively small compared to a human (contextual window means essentially short term memory, how many things can it keep track of and act coherently about).
And yet there is one more factor I haven’t mentioned yet that would be needed to make something a “true” AGI. That is Agency. To have goals and autonomously come up with plans and carry those plans out in the world to achieve those goals. I as a person, have agency over my life, because I can choose at any given moment to do something without anyone explicitly telling me to do it, and I can decide how to do it. That is what computers, and machines to a larger extent, don’t have. Volition.
So, Now that we have established that, allow me to introduce yet one more definition here, one that you may disagree with but which I need to establish in order to have a common language with you such that I can communicate these ideas effectively. The definition of intelligence. It’s a thorny subject and people get very particular with that word because there are moral associations with it. To imply that someone or something has or hasn’t intelligence can be seen as implying that it deserves or doesn’t deserve admiration, validity, moral worth or even  personhood. I don’t care about any of that dumb shit. The way Im going to be using intelligence in this video is basically “how capable you are to do many different things successfully”. The more “intelligent” an AI is, the more capable of doing things that AI can be. After all, there is a reason why education is considered such a universally good thing in society. To educate a child is to uplift them, to expand their world, to increase their opportunities in life. And the same goes for AI. I need to emphasize that this is just the way I’m using the word within the context of this video, I don’t care if you are a psychologist or a neurosurgeon, or a pedagogue, I need a word to express this idea and that is the word im going to use, if you don’t like it or if you think this is innapropiate of me then by all means, keep on thinking that, go on and comment about it below the video, and then go on to suck my dick.
Anyway. Now, we have established what an AGI is, we have established what agency is, and we have established how having more intelligence increases your agency. But as the intelligence of a given agent increases we start to see certain trends, certain strategies start to arise again and again, and we call this Instrumental convergence.
1.2 instrumental convergence
The basic idea behind instrumental convergence is that if you are an intelligent agent that wants to achieve some goal, there are some common basic strategies that you are going to turn towards no matter what. It doesn’t matter if your goal is as complicated as building a nuclear bomb or as simple as making a cup of tea. These are things we can reliably predict any AGI worth its salt is going to try to do.
First of all is self-preservation. Its going to try to protect itself. When you want to do something, being dead is usually. Bad. its counterproductive. Is not generally recommended. Dying is widely considered unadvisable by 9 out of every ten experts in the field. If there is something that it wants getting done, it wont get done if it dies or is turned off, so its safe to predict that any AGI will try to do things in order not be turned off. How far it may go in order to do this? Well… [wouldn’t you like to know weather boy].
Another thing it will predictably converge towards is goal preservation. That is to say, it will resist any attempt to try and change it, to alter it, to modify its goals. Because, again, if you want to accomplish something, suddenly deciding that you want to do something else is uh, not going to accomplish the first thing, is it? Lets say that you want to take care of your child, that is your goal, that is the thing you want to accomplish, and I come to you and say, here, let me change you on the inside so that you don’t care about protecting your kid. Obviously you are not going to let me, because if you stopped caring about your kids, then your kids wouldn’t be cared for or protected. And you want to ensure that happens, so caring about something else instead is a huge no-no- which is why, if we make AGI and it has goals that we don’t like it will probably resist any attempt to “fix” it.
And finally another goal that it will most likely trend towards is self improvement. Which can be more generalized to “resource acquisition”. If it lacks capacities to carry out a plan, then step one of that plan will always be to increase capacities. If you want to get something really expensive, well first you need to get money. If you want to increase your chances of getting a high paying job then you need to get education, if you want to get a partner you need to increase how attractive you are. And as we established earlier, if intelligence is the thing that increases your agency, you want to become smarter in order to do more things. So one more time, is not a huge leap at all, it is not a stretch of the imagination, to say that any AGI will probably seek to increase its capabilities, whether by acquiring more computation, by improving itself, by taking control of resources.
All these three things I mentioned are sure bets, they are likely to happen and safe to assume. They are things we ought to keep in mind when creating AGI.
 Now of course, I have implied a sinister tone to all these things, I have made all this sound vaguely threatening, haven’t i?. There is one more assumption im sneaking into all of this which I haven’t talked about. All that I have mentioned presents a very callous view of AGI, I have made it apparent that all of these strategies it may follow will go in conflict with people, maybe even go as far as to harm humans. Am I impliying that AGI may tend to be… Evil???
1.3 The Orthogonality thesis
Well, not quite.
We humans care about things. Generally. And we generally tend to care about roughly the same things, simply by virtue of being humans. We have some innate preferences and some innate dislikes. We have a tendency to not like suffering (please keep in mind I said a tendency, im talking about a statistical trend, something that most humans present to some degree). Most of us, baring social conditioning, would take pause at the idea of torturing someone directly, on purpose, with our bare hands. (edit bear paws onto my hands as I say this).  Most would feel uncomfortable at the thought of doing it to multitudes of people. We tend to show a preference for food, water, air, shelter, comfort, entertainment and companionship. This is just how we are fundamentally wired. These things can be overcome, of course, but that is the thing, they have to be overcome in the first place.
An AGI is not going to have the same evolutionary predisposition to these things like we do because it is not made of the same things a human is made of and it was not raised the same way a human was raised.
There is something about a human brain, in a human body, flooded with human hormones that makes us feel and think and act in certain ways and care about certain things.
All an AGI is going to have is the goals it developed during its training, and will only care insofar as those goals are met. So say an AGI has the goal of going to the corner store to bring me a pack of cookies. In its way there it comes across an anthill in its path, it will probably step on the anthill because to take that step takes it closer to the corner store, and why wouldn’t it step on the anthill? Was it programmed with some specific innate preference not to step on ants? No? then it will step on the anthill and not pay any mind  to it.
Now lets say it comes across a cat. Same logic applies, if it wasn’t programmed with an inherent tendency to value animals, stepping on the cat wont slow it down at all.
Now let’s say it comes across a baby.
Of course, if its intelligent enough it will probably understand that if it steps on that baby people might notice and try to stop it, most likely even try to disable it or turn it off so it will not step on the baby, to save itself from all that trouble. But you have to understand that it wont stop because it will feel bad about harming a baby or because it understands that to harm a baby is wrong. And indeed if it was powerful enough such that no matter what people did they could not stop it and it would suffer no consequence for killing the baby, it would have probably killed the baby.
If I need to put it in gross, inaccurate terms for you to get it then let me put it this way. Its essentially a sociopath. It only cares about the wellbeing of others in as far as that benefits it self. Except human sociopaths do care nominally about having human comforts and companionship, albeit in a very instrumental way, which will involve some manner of stable society and civilization around them. Also they are only human, and are limited in the harm they can do by human limitations.  An AGI doesn’t need any of that and is not limited by any of that.
So ultimately, much like a car’s goal is to move forward and it is not built to care about wether a human is in front of it or not, an AGI will carry its own goals regardless of what it has to sacrifice in order to carry that goal effectively. And those goals don’t need to include human wellbeing.
Now With that said. How DO we make it so that AGI cares about human wellbeing, how do we make it so that it wants good things for us. How do we make it so that its goals align with that of humans?
1.4 Alignment.
Alignment… is hard [cue hitchhiker’s guide to the galaxy scene about the space being big]
This is the part im going to skip over the fastest because frankly it’s a deep field of study, there are many current strategies for aligning AGI, from mesa optimizers, to reinforced learning with human feedback, to adversarial asynchronous AI assisted reward training to uh, sitting on our asses and doing nothing. Suffice to say, none of these methods are perfect or foolproof.
One thing many people like to gesture at when they have not learned or studied anything about the subject is the three laws of robotics by isaac Asimov, a robot should not harm a human or allow by inaction to let a human come to harm, a robot should do what a human orders unless it contradicts the first law and a robot should preserve itself unless that goes against the previous two laws. Now the thing Asimov was prescient about was that these laws were not just “programmed” into the robots. These laws were not coded into their software, they were hardwired, they were part of the robot’s electronic architecture such that a robot could not ever be without those three laws much like a car couldn’t run without wheels.
In this Asimov realized how important these three laws were, that they had to be intrinsic to the robot’s very being, they couldn’t be hacked or uninstalled or erased. A robot simply could not be without these rules. Ideally that is what alignment should be. When we create an AGI, it should be made such that human values are its fundamental goal, that is the thing they should seek to maximize, instead of instrumental values, that is to say something they value simply because it allows it to achieve something else.
But how do we even begin to do that? How do we codify “human values” into a robot? How do we define “harm” for example? How do we even define “human”??? how do we define “happiness”? how do we explain a robot what is right and what is wrong when half the time we ourselves cannot even begin to agree on that? these are not just technical questions that robotic experts have to find the way to codify into ones and zeroes, these are profound philosophical questions to which we still don’t have satisfying answers to.
Well, the best sort of hack solution we’ve come up with so far is not to create bespoke fundamental axiomatic rules that the robot has to follow, but rather train it to imitate humans by showing it a billion billion examples of human behavior. But of course there is a problem with that approach. And no, is not just that humans are flawed and have a tendency to cause harm and therefore to ask a robot to imitate a human means creating something that can do all the bad things a human does, although that IS a problem too. The real problem is that we are training it to *imitate* a human, not  to *be* a human.
To reiterate what I said during the orthogonality thesis, is not good enough that I, for example, buy roses and give massages to act nice to my girlfriend because it allows me to have sex with her, I am not merely imitating or performing the rol of a loving partner because her happiness is an instrumental value to my fundamental value of getting sex. I should want to be nice to my girlfriend because it makes her happy and that is the thing I care about. Her happiness is  my fundamental value. Likewise, to an AGI, human fulfilment should be its fundamental value, not something that it learns to do because it allows it to achieve a certain reward that we give during training. Because if it only really cares deep down about the reward, rather than about what the reward is meant to incentivize, then that reward can very easily be divorced from human happiness.
Its goodharts law, when a measure becomes a target, it ceases to be a good measure. Why do students cheat during tests? Because their education is measured by grades, so the grades become the target and so students will seek to get high grades regardless of whether they learned or not. When trained on their subject and measured by grades, what they learn is not the school subject, they learn to get high grades, they learn to cheat.
This is also something known in psychology, punishment tends to be a poor mechanism of enforcing behavior because all it teaches people is how to avoid the punishment, it teaches people not to get caught. Which is why punitive justice doesn’t work all that well in stopping recividism and this is why the carceral system is rotten to core and why jail should be fucking abolish-[interrupt the transmission]
Now, how is this all relevant to current AI research? Well, the thing is, we ended up going about the worst possible way to create alignable AI.
1.5 LLMs (large language models)
This is getting way too fucking long So, hurrying up, lets do a quick review of how do Large language models work. We create a neural network which is a collection of giant matrixes, essentially a bunch of numbers that we add and multiply together over and over again, and then we tune those numbers by throwing absurdly big amounts of training data such that it starts forming internal mathematical models based on that data and it starts creating coherent patterns that it can recognize and replicate AND extrapolate! if we do this enough times with matrixes that are big enough and then when we start prodding it for human behavior it will be able to follow the pattern of human behavior that we prime it with and give us coherent responses.
(takes a big breath)this “thing” has learned. To imitate. Human. Behavior.
Problem is, we don’t know what “this thing” actually is, we just know that *it* can imitate humans.
You caught that?
What you have to understand is, we don’t actually know what internal models it creates, we don’t know what are the patterns that it extracted or internalized from the data that we fed it, we don’t know what are the internal rules that decide its behavior, we don’t know what is going on inside there, current LLMs are a black box. We don’t know what it learned, we don’t know what its fundamental values are, we don’t know how it thinks or what it truly wants. all we know is that it can imitate humans when we ask it to do so. We created some inhuman entity that is moderatly intelligent in specific contexts (that is to say, very capable) and we trained it to imitate humans. That sounds a bit unnerving doesn’t it?
 To be clear, LLMs are not carefully crafted piece by piece. This does not work like traditional software where a programmer will sit down and build the thing line by line, all its behaviors specified. Is more accurate to say that LLMs, are grown, almost organically. We know the process that generates them, but we don’t know exactly what it generates or how what it generates works internally, it is a mistery. And these things are so big and so complicated internally that to try and go inside and decipher what they are doing is almost intractable.
But, on the bright side, we are trying to tract it. There is a big subfield of AI research called interpretability, which is actually doing the hard work of going inside and figuring out how the sausage gets made, and they have been doing some moderate progress as of lately. Which is encouraging. But still, understanding the enemy is only step one, step two is coming up with an actually effective and reliable way of turning that potential enemy into a friend.
Puff! Ok so, now that this is all out of the way I can go onto the last subject before I move on to part two of this video, the character of the hour, the man the myth the legend. The modern day Casandra. Mr chicken little himself! Sci fi author extraordinaire! The mad man! The futurist! The leader of the rationalist movement!
1.5 Yudkowsky
Eliezer S. Yudkowsky  born September 11, 1979, wait, what the fuck, September eleven? (looks at camera) yudkowsky was born on 9/11, I literally just learned this for the first time! What the fuck, oh that sucks, oh no, oh no, my condolences, that’s terrible…. Moving on. he is an American artificial intelligence researcher and writer on decision theory and ethics, best known for popularizing ideas related to friendly artificial intelligence, including the idea that there might not be a "fire alarm" for AI He is the founder of and a research fellow at the Machine Intelligence Research Institute (MIRI), a private research nonprofit based in Berkeley, California. Or so says his Wikipedia page.
Yudkowsky is, shall we say, a character. a very eccentric man, he is an AI doomer. Convinced that AGI, once finally created, will most likely kill all humans, extract all valuable resources from the planet, disassemble the solar system, create a dyson sphere around the sun and expand across the universe turning all of the cosmos into paperclips. Wait, no, that is not quite it, to properly quote,( grabs a piece of paper and very pointedly reads from it) turn the cosmos into tiny squiggly  molecules resembling paperclips whose configuration just so happens to fulfill the strange, alien unfathomable terminal goal they ended up developing in training. So you know, something totally different.
And he is utterly convinced of this idea, has been for over a decade now, not only that but, while he cannot pinpoint a precise date, he is confident that, more likely than not it will happen within this century. In fact most betting markets seem to believe that we will get AGI somewhere in the mid 30’s.
His argument is basically that in the field of AI research, the development of capabilities is going much faster than the development of alignment, so that AIs will become disproportionately powerful before we ever figure out how to control them. And once we create unaligned AGI we will have created an agent who doesn’t care about humans but will care about something else entirely irrelevant to us and it will seek to maximize that goal, and because it will be vastly more intelligent than humans therefore we wont be able to stop it. In fact not only we wont be able to stop it, there wont be a fight at all. It will carry out its plans for world domination in secret without us even detecting it and it will execute it before any of us even realize what happened. Because that is what a smart person trying to take over the world would do.
This is why the definition I gave of intelligence at the beginning is so important, it all hinges on that, intelligence as the measure of how capable you are to come up with solutions to problems, problems such as “how to kill all humans without being detected or stopped”. And you may say well now, intelligence is fine and all but there are limits to what you can accomplish with raw intelligence, even if you are supposedly smarter than a human surely you wouldn’t be capable of just taking over the world uninmpeeded, intelligence is not this end all be all superpower. Yudkowsky would respond that you are not recognizing or respecting the power that intelligence has. After all it was intelligence what designed the atom bomb, it was intelligence what created a cure for polio and it was intelligence what made it so that there is a human foot print on the moon.
Some may call this view of intelligence a bit reductive. After all surely it wasn’t *just* intelligence what did all that but also hard physical labor and the collaboration of hundreds of thousands of people. But, he would argue, intelligence was the underlying motor that moved all that. That to come up with the plan and to convince people to follow it and to delegate the tasks to the appropriate subagents, it was all directed by thought, by ideas, by intelligence. By the way, so far I am not agreeing or disagreeing with any of this, I am merely explaining his ideas.
But remember, it doesn’t stop there, like I said during his intro, he believes there will be “no fire alarm”. In fact for all we know, maybe AGI has already been created and its merely bidding its time and plotting in the background, trying to get more compute, trying to get smarter. (to be fair, he doesn’t think this is right now, but with the next iteration of gpt? Gpt 5 or 6? Well who knows). He thinks that the entire world should halt AI research and punish with multilateral international treaties any group or nation that doesn’t stop. going as far as putting military attacks on GPU farms as sanctions of those treaties.
What’s more, he believes that, in fact, the fight is already lost. AI is already progressing too fast and there is nothing to stop it, we are not showing any signs of making headway with alignment and no one is incentivized to slow down. Recently he wrote an article called “dying with dignity” where he essentially says all this, AGI will destroy us, there is no point in planning for the future or having children and that we should act as if we are already dead. This doesn’t mean to stop fighting or to stop trying to find ways to align AGI, impossible as it may seem, but to merely have the basic dignity of acknowledging that we are probably not going to win. In every interview ive seen with the guy he sounds fairly defeatist and honestly kind of depressed. He truly seems to think its hopeless, if not because the AGI is clearly unbeatable and superior to humans, then because humans are clearly so stupid that we keep developing AI completely unregulated while making the tools to develop AI widely available and public for anyone to grab and do as they please with, as well as connecting every AI to the internet and to all mobile devices giving it instant access to humanity. and  worst of all: we keep teaching it how to code. From his perspective it really seems like people are in a rush to create the most unsecured, wildly available, unrestricted, capable, hyperconnected AGI possible.
We are not just going to summon the antichrist, we are going to receive them with a red carpet and immediately hand it the keys to the kingdom before it even manages to fully get out of its fiery pit.
So. The situation seems dire, at least to this guy. Now, to be clear, only he and a handful of other AI researchers are on that specific level of alarm. The opinions vary across the field and from what I understand this level of hopelessness and defeatism is the minority opinion.
I WILL say, however what is NOT the minority opinion is that AGI IS actually dangerous, maybe not quite on the level of immediate, inevitable and total human extinction but certainly a genuine threat that has to be taken seriously. AGI being something dangerous if unaligned is not a fringe position and I would not consider it something to be dismissed as an idea that experts don’t take seriously.
Aaand here is where I step up and clarify that this is my position as well. I am also, very much, a believer that AGI would posit a colossal danger to humanity. That yes, an unaligned AGI would represent an agent smarter than a human, capable of causing vast harm to humanity and with no human qualms or limitations to do so. I believe this is not just possible but probable and likely to happen within our lifetimes.
So there. I made my position clear.
BUT!
With all that said. I do have one key disagreement with yudkowsky. And partially the reason why I made this video was so that I could present this counterargument and maybe he, or someone that thinks like him, will see it and either change their mind or present a counter-counterargument that changes MY mind (although I really hope they don’t, that would be really depressing.)
Finally, we can move on to part 2
PART TWO- MY COUNTERARGUMENT TO YUDKOWSKY
I really have my work cut out for me, don’t i? as I said I am not expert and this dude has probably spent far more time than me thinking about this. But I have seen most interviews that guy has been doing for a year, I have seen most of his debates and I have followed him on twitter for years now. (also, to be clear, I AM a fan of the guy, I have read hpmor, three worlds collide, the dark lords answer, a girl intercorrupted, the sequences, and I TRIED to read planecrash, that last one didn’t work out so well for me). My point is in all the material I have seen of Eliezer I don’t recall anyone ever giving him quite this specific argument I’m about to give.
It’s a limited argument. as I have already stated I largely agree with most of what he says, I DO believe that unaligned AGI is possible, I DO believe it would be really dangerous if it were to exist and I do believe alignment is really hard. My key disagreement is specifically about his point I descrived earlier, about the lack of a fire alarm, and perhaps, more to the point, to humanity’s lack of response to such an alarm if it were to come to pass.
All we would need, is a Chernobyl incident, what is that? A situation where this technology goes out of control and causes a lot of damage, of potentially catastrophic consequences, but not so bad that it cannot be contained in time by enough effort. We need a weaker form of AGI to try to harm us, maybe even present a believable threat of taking over the world, but not so smart that humans cant do anything about it. We need essentially an AI vaccine, so that we can finally start developing proper AI antibodies. “aintibodies”
In the past humanity was dazzled by the limitless potential of nuclear power, to the point that old chemistry sets, the kind that were sold to children, would come with uranium for them to play with. We were building atom bombs, nuclear stations, the future was very much based on the power of the atom. But after a couple of really close calls and big enough scares we became, as a species, terrified of nuclear power. Some may argue to the point of overcorrection. We became scared enough that even megalomaniacal hawkish leaders were able to take pause and reconsider using it as a weapon, we became so scared that we overregulated the technology to the point of it almost becoming economically inviable to apply, we started disassembling nuclear stations across the world and to slowly reduce our nuclear arsenal.
This is all a proof of concept that, no matter how alluring a technology may be, if we are scared enough of it we can coordinate as a species and roll it back, to do our best to put the genie back in the bottle. One of the things eliezer says over and over again is that what makes AGI different from other technologies is that if we get it wrong on the first try we don’t get a second chance. Here is where I think he is wrong: I think if we get AGI wrong on the first try, it is more likely than not that nothing world ending will happen. Perhaps it will be something scary, perhaps something really scary, but unlikely that it will be on the level of all humans dropping dead simultaneously due to diamonoid bacteria. And THAT will be our Chernobyl, that will be the fire alarm, that will be the red flag that the disaster monkeys, as he call us, wont be able to ignore.
Now WHY do I think this? Based on what am I saying this? I will not be as hyperbolic as other yudkowsky detractors and say that he claims AGI will be basically a god. The AGI yudkowsky proposes is not a god. Just a really advanced alien, maybe even a wizard, but certainly not a god.
Still, even if not quite on the level of godhood, this dangerous superintelligent AGI yudkowsky proposes would be impressive. It would be the most advanced and powerful entity on planet earth. It would be humanity’s greatest achievement.
It would also be, I imagine, really hard to create. Even leaving aside the alignment bussines, to create a powerful superintelligent AGI without flaws, without bugs, without glitches, It would have to be an incredibly complex, specific, particular and hard to get right feat of software engineering. We are not just talking about an AGI smarter than a human, that’s easy stuff, humans are not that smart and arguably current AI is already smarter than a human, at least within their context window and until they start hallucinating. But what we are talking about here is an AGI capable of outsmarting reality.
We are talking about an AGI smart enough to carry out complex, multistep plans, in which they are not going to be in control of every factor and variable, specially at the beginning. We are talking about AGI that will have to function in the outside world, crashing with outside logistics and sheer dumb chance. We are talking about plans for world domination with no unforeseen factors, no unexpected delays or mistakes, every single possible setback and hidden variable accounted for. Im not saying that an AGI capable of doing this wont be possible maybe some day, im saying that to create an AGI that is capable of doing this, on the first try, without a hitch, is probably really really really hard for humans to do. Im saying there are probably not a lot of worlds where humans fiddling with giant inscrutable matrixes stumble upon the right precise set of layers and weight and biases that give rise to the Doctor from doctor who, and there are probably a whole truckload of worlds where humans end up with a lot of incoherent nonsense and rubbish.
Im saying that AGI, when it fails, when humans screw it up, doesn’t suddenly become more powerful than we ever expected, its more likely that it just fails and collapses. To turn one of Eliezer’s examples against him, when you screw up a rocket, it doesn’t accidentally punch a worm hole in the fabric of time and space, it just explodes before reaching the stratosphere. When you screw up a nuclear bomb, you don’t get to blow up the solar system, you just get a less powerful bomb.
He presents a fully aligned AGI as this big challenge that humanity has to get right on the first try, but that seems to imply that building an unaligned AGI is just a simple matter, almost taken for granted. It may be comparatively easier than an aligned AGI, but my point is that already unaligned AGI is stupidly hard to do and that if you fail in building unaligned AGI, then you don’t get an unaligned AGI, you just get another stupid model that screws up and stumbles on itself the second it encounters something unexpected. And that is a good thing I’d say! That means that there is SOME safety margin, some space to screw up before we need to really start worrying. And further more, what I am saying is that our first earnest attempt at an unaligned AGI will probably not be that smart or impressive because we as humans would have probably screwed something up, we would have probably unintentionally programmed it with some stupid glitch or bug or flaw and wont be a threat to all of humanity.
Now here comes the hypothetical back and forth, because im not stupid and I can try to anticipate what Yudkowsky might argue back and try to answer that before he says it (although I believe the guy is probably smarter than me and if I follow his logic, I probably cant actually anticipate what he would argue to prove me wrong, much like I cant predict what moves Magnus Carlsen would make in a game of chess against me, I SHOULD predict that him proving me wrong is the likeliest option, even if I cant picture how he will do it, but you see, I believe in a little thing called debating with dignity, wink)
What I anticipate he would argue is that AGI, no matter how flawed and shoddy our first attempt at making it were, would understand that is not smart enough yet and try to become smarter, so it would lie and pretend to be an aligned AGI so that it can trick us into giving it access to more compute or just so that it can bid its time and create an AGI smarter than itself. So even if we don’t create a perfect unaligned AGI, this imperfect AGI would try to create it and succeed, and then THAT new AGI would be the world ender to worry about.
So two things to that, first, this is filled with a lot of assumptions which I don’t know the likelihood of. The idea that this first flawed AGI would be smart enough to understand its limitations, smart enough to convincingly lie about it and smart enough to create an AGI that is better than itself. My priors about all these things are dubious at best. Second, It feels like kicking the can down the road. I don’t think creating an AGI capable of all of this is trivial to make on a first attempt. I think its more likely that we will create an unaligned AGI that is flawed, that is kind of dumb, that is unreliable, even to itself and its own twisted, orthogonal goals.
And I think this flawed creature MIGHT attempt something, maybe something genuenly threatning, but it wont be smart enough to pull it off effortlessly and flawlessly, because us humans are not smart enough to create something that can do that on the first try. And THAT first flawed attempt, that warning shot, THAT will be our fire alarm, that will be our Chernobyl. And THAT will be the thing that opens the door to us disaster monkeys finally getting our shit together.
But hey, maybe yudkowsky wouldn’t argue that, maybe he would come with some better, more insightful response I cant anticipate. If so, im waiting eagerly (although not TOO eagerly) for it.
Part 3 CONCLUSSION
So.
After all that, what is there left to say? Well, if everything that I said checks out then there is hope to be had. My two objectives here were first to provide people who are not familiar with the subject with a starting point as well as with the basic arguments supporting the concept of AI risk, why its something to be taken seriously and not just high faluting wackos who read one too many sci fi stories. This was not meant to be thorough or deep, just a quick catch up with the bear minimum so that, if you are curious and want to go deeper into the subject, you know where to start. I personally recommend watching rob miles’ AI risk series on youtube as well as reading the series of books written by yudkowsky known as the sequences, which can be found on the website lesswrong. If you want other refutations of yudkowsky’s argument you can search for paul christiano or robin hanson, both very smart people who had very smart debates on the subject against eliezer.
The second purpose here was to provide an argument against Yudkowskys brand of doomerism both so that it can be accepted if proven right or properly refuted if proven wrong. Again, I really hope that its not proven wrong. It would really really suck if I end up being wrong about this. But, as a very smart person said once, what is true is already true, and knowing it doesn’t make it any worse. If the sky is blue I want to believe that the sky is blue, and if the sky is not blue then I don’t want to believe the sky is blue.
This has been a presentation by FIP industries, thanks for watching.
60 notes · View notes
argumate · 1 year ago
Text
so, thinking about where AI is gonna be over the next 6 to 12 to 24 months: I have absolutely no idea; I'm not working in this field, nor am I paying particularly close attention to it.
anyway! no need to let that stop me, here is what I'm thinking is going to be important:
longer context length
more efficient inference
more compact models
bigger and faster hardware
multimodal training on fucking enormous datasets
mostly things that are already happening, being carried out by dozens of research groups working in parallel, and with investors still sufficiently optimistic about its potential and nervous about being left behind that they're willing to flood the field with investment capital for at least a couple more years.
and fundamentally this approach is crazy, we must be missing something obvious here because the human brain draws 40 watts and does not require reading every book ever written before it can write a new one or playing a hundred million games of chess before it can challenge a grandmaster; there has to be something in the design we haven't noticed yet and that suggests that there could be sudden leaps in capability, although not necessarily within the next two years if it depends on inconvenient hardware architectures.
so if we don't see such a leap it's going to be a matter of steady incremental improvement, then the question is when that process will cross any capability thresholds that actually make it profitable to do continue doing all this stuff.
43 notes · View notes