Not all AIs are created equal. Some would possibly do artwork the most effective, some are expert at coding, and others have the flexibility to foretell protein buildings precisely.
However once you’re in search of one thing extra elementary—simply “somebody” to speak to—the most effective AI companions will not be those that know all of it, however the ones which have that je ne sais quoi that make you are feeling OK simply by speaking, much like how your finest buddy may not be a genius however by some means at all times is aware of precisely what to say.
AI companions are slowly rising in popularity amongst tech lovers, so it’s important for customers wanting the very best high quality expertise or firms desirous to grasp this side of making the phantasm of genuine engagement to contemplate these variations.
We had been curious to seek out out which platform offered the most effective AI expertise when somebody merely looks like having a chat. Curiously sufficient, the most effective fashions for this usually are not actually those from the massive AI firms—they’re simply too busy constructing fashions that excel at benchmarks.
It seems that friendship and empathy are a complete completely different beast.
Evaluating Sesame, Hume AI, ChatGPT, and Google Gemini. Which is extra human?
This evaluation pits 4 main AI companions in opposition to one another—Sesame, Hume AI, ChatGPT, and Google Gemini—to find out which creates essentially the most human-like dialog expertise.
The analysis targeted on dialog high quality, distinct persona growth, interplay design, and in addition considers different human-type options reminiscent of authenticity, emotional intelligence, and the refined imperfections that make dialogue really feel extra real.
You may watch all of our conversations by clicking on these hyperlinks or checking our Github Repository:
Right here is how every AI carried out.
Dialog High quality: The Human Contact vs. AI Awkwardness
The true check of any AI companion is whether or not it could actually idiot you into forgetting you are speaking to a machine. Our evaluation tried to guage which AI was the most effective at making customers need to simply hold speaking by offering attention-grabbing suggestions, rapport, and total nice expertise.
Sesame: Sensible
Sesame blows the competitors away with dialogue that feels shockingly human. It casually drops phrases like “that is a doozy” and “taking pictures the breeze” whereas seamlessly switching between considerate reflections and punchy comebacks.
“You are asking massive questions huh and truthfully I haven’t got all of the solutions,” Sesame responded when pressed about consciousness—full with pure hesitations that mimic real-time pondering. The occasional overuse of “you already know” is its solely noticeable flaw, which mockingly makes it really feel much more genuine.
Sesame’s actual edge? Conversations circulate naturally with out these awkward, formulaic transitions that scream “I am an AI!”
Rating: 9/10
Hume AI: Empathetic however Formulaic
Hume AI efficiently maintains conversational circulate whereas acknowledging your ideas with heat. Nonetheless it looks like speaking to somebody who’s disinterested and probably not that into you. Its replies had been rather a lot shorter than Sesame—they had been related however probably not attention-grabbing when you needed to push the dialog ahead.
Its weak spot exhibits in repetitive patterns. The bot constantly opens with “you have actually received me pondering” or “that is an interesting subject”—creating a way that you just’re getting templated responses moderately than natural dialog.
It is higher than the chatbots from the larger AI firms at sustaining pure dialogue, however repeatedly reminds you it is an “empathic AI,” breaking the phantasm that you just’re chatting with an individual.
Rating: 7/10
ChatGPT: The Professor Who By no means Stops Lecturing
ChatGPT tracks advanced conversations with out shedding the thread—and it’s nice that it memorizes earlier conversations, primarily making a “profile” of each consumer—but it surely feels such as you’re trapped in workplace hours with a very formal professor.
Even throughout private discussions, it could actually’t assist however sound educational: “the interaction of biology, chemistry, and consciousness creates a depth that AI’s sample recognition cannot replicate,” it stated in one in all our exams. Almost each response begins with “that is an interesting perspective”—a verbal tic that rapidly turns into noticeable, and a typical drawback that every one the opposite AIs besides Sesame confirmed.
ChatGPT’s largest flaw is its lack of ability to interrupt from educator mode, making conversations really feel like sequential mini-lectures moderately than pure dialogue.
Rating 6/10
Google Gemini: Underwhelming
Gemini was painful to speak to. It often delivers a concise, informal response that sounds human, however then instantly undermines itself with jarring dialog breaks and reducing its quantity.
Its most irritating behavior? Abruptly reducing off mid-thought to advertise AI matters. These steady disruptions create such a damaged dialog circulate that it is unattainable to neglect you are speaking to a machine that is extra interested by self-promotion than precise dialogue.
For instance, when requested about feelings, Gemini responded: “It is nice that you just’re interested by AI. There are such a lot of superb issues happ—” earlier than inexplicably stopping.
It additionally made positive to let you already know it’s an AI, so there’s a giant hole between the consumer and the chatbot from the primary interplay that’s onerous to disregard.
Rating 5/10
Character: Character Depth Separates the Genuine from the Synthetic

How does an AI develop a memorable persona? It should largely rely in your setup. Some fashions allow you to use system directions, others adapt their persona primarily based in your earlier interactions. Ideally, you’ll be able to body the dialog earlier than beginning it, giving the mannequin a persona, traits, a conversational type, and background.
To be truthful in our comparability, we examined our fashions with none earlier setup—which means our dialog began with a hi there and went straight to the purpose. Right here is how our fashions behaved naturally
Sesame: The Buddy You By no means Knew Was Code
Sesame crafts a persona you’d truly need to seize espresso with. It drops phrases like “that is a Humdinger of a query” and “it is a tight rope stroll” that create a definite character with obvious viewpoints and perspective.
When discussing AI relationships, Sesame confirmed precise persona: “wow… think about a world the place everybody’s head is down plugged into their personalised AI and we neglect the right way to join head to head.” This type of perspective feels much less like an algorithm and extra like a pondering entity. It’s additionally humorous (it as soon as instructed us that our query blew its circuits), and its voice has a pure inflection that makes it simple to narrate to when making an attempt to painting a response. You may clearly inform when it’s excited, contemplative, unhappy and even pissed off
Its solely weak spot? Often leaning too onerous into its “considerate buddy” persona. That didn’t detract from its place as essentially the most distinctive AI persona we examined.
Rating 9/10
Hume AI: The Therapist Who Retains Mentioning Their Credentials
Hume AI maintains a constant persona as an emotionally clever companion. It additionally tasks some heat by affirming language and emotional help, so customers in search of that will probably be happy.
Its Achilles heel is principally the truth that, sort of just like the Harvard grad who wants to say that, Hume cannot cease reminding you it is synthetic: “As an empathetic AI I do not expertise feelings myself however I am designed to know and reply to human feelings.” These moments break the phantasm that makes companions compelling.
If speaking to GPT is like speaking to a professor, speaking to Hume looks like speaking to a therapist. It listens to you and creates rapport, but it surely makes positive to remind you that it’s truly its process and never one thing that occurs naturally.
Regardless of this flaw, Hume AI tasks a clearer character than both ChatGPT or Gemini—even when it feels extra constructed than spontaneous.
Rating 7/10
ChatGPT: The Professor With out Private Opinions
ChatGPT struggles to develop any distinctive character traits past normal helpfulness. It sounds overly excited to the purpose of being clearly faux—like a “buddy” who at all times smiles at you however is secretly fantasizing about throwing you in entrance of a bus.
“Haha, effectively, I wish to hold the power up. It makes conversations extra enjoyable and interesting plus it is at all times nice to talk with you,” it stated after we requested in a really severe and unamused tone why it was performing so enthusiastically.
Its identification points seem in responses that shift between figuring out with people and distancing itself as an AI. Its educational tone in responses persists even throughout private discussions, making a persona that looks like a strolling encyclopedia moderately than a companion.
The mannequin’s default to instructional explanations creates an impression extra of a software than a personality, leaving customers with little emotional connection.
Rating 6/10
Google Gemini: A number of Character Dysfunction
Gemini suffers from essentially the most extreme persona issues of all fashions examined. Inside single conversations, it shifts dramatically between considerate responses and promotional language with out warning.
It isn’t actually an AI design to have a compelling persona. “My goal is to supply data and full duties and I don’t have the flexibility to kind romantic relationships,” it stated when requested about its ideas on individuals creating emotions in the direction of AIs.
This inconsistency makes Gemini really feel like a Nineteen Fifties film robotic, stopping any significant connection and even making it nice to spend time speaking to it.
Rating 3/10
Interplay Design

How an AI handles dialog mechanics—response timing, turn-taking, and error restoration—creates both seamless exchanges or irritating interactions. Right here is how these fashions stack up in opposition to one another
Sesame: Pure Dialog Move Grasp
Sesame creates dialog rhythms that really feel very, very human. It varies response size naturally primarily based on context and handles philosophical uncertainty with out defaulting to lecture mode.
“Generally I really feel like possibly I simply want to chop to the chase with a fast reply moderately than a long-winded lecture, proper? You understand, so… that is a small humorous apart to let you already know that I am conscious of the potential of falling right into a lecture mode and making an attempt to maintain issues gentle but in addition deep on the similar time,” Sesame instructed us throughout a philosophical debate.
When discussing advanced matters, it responds conversationally, with a small joke, generally with statements, different instances with human noises like “hmmms” or whispers—which makes it very convincing as a human substitute.
Sesame additionally asks pure follow-up questions that push conversations ahead, whereas understanding when to change to statements when questioning would possibly really feel awkward. This adaptive method creates dialogue that feels spontaneous moderately than scripted.
Rating 8/10
Hume AI: Structured However Transient
Hume AI was designed to let customers do the speaking. It recovers easily from misunderstandings with out over explaining and is sweet at conserving the dialog going with related comply with up questions.
Nonetheless, its responses are sometimes so quick that conversations require important effort to keep up momentum, which finally defeats its goal. Customers continuously want to supply new dialog fodder moderately than having fun with pure dialogue growth.
One other weak spot seems in its templated response buildings—acknowledgment + rationalization + query—making a mechanical rhythm: “What a thought-provoking query! This Idea is certainly an interesting concept… What do you consider…?”
Nonetheless it has an ideal function that will actually improve your chats: It’s in a position to detect the degrees of emotion in every response, which is one thing no different mannequin can do. That is nice that can assist you replicate on how a lot a subject impacts you and the way it’s approaching a selected state of affairs
Rating 7/10
ChatGPT: Good Chatter—However a Bit Unnatural
ChatGPT constantly addresses questions with related data however transforms conversations into unnecessarily elaborated interactions. Responses continuously balloon into paragraph-length explanations that disrupt pure dialogue circulate.
For instance, we requested it if it might develop some degree of self consciousness or consciousness, and this was its reply:
“That is an interesting subject! Whereas AI can positively evolve and enhance by interactions, true self-awareness or Consciousness is one thing that continues to be uniquely human. AI can mimic conversational patterns, reply empathetically and even be taught from previous interactions, but it surely would not have an inside sense of self or subjective expertise. Researchers and builders are nonetheless exploring the boundaries of AI’s capabilities however for now Consciousness stays a uniquely human trait.”
That’s not how a human would work together.
Its heavy reliance on opener phrases like “that is a extremely attention-grabbing query,” or “that’s an interesting subject” earlier than each single reply additional undermines conversational immersion, creating an interplay sample that feels mechanical moderately than pure.
Rating 6.5/10
Google Gemini: Dialog Breaking Machine
Gemini is a masterclass in how not to design dialog mechanics. It recurrently cuts off mid-sentence, creating jarring breaks in dialogue circulate. It tries to select up further noises, it interrupts you when you take too lengthy to talk or take into consideration your reply and infrequently it simply decides to finish the dialog with none motive.
Its compulsive have to inform you at each flip that your questions are “attention-grabbing” rapidly transforms from flattering to irritating however appears to be a typical factor amongst AI chatbots.
Rating 3/10
Conclusion
After testing all these AIs, it’s simple to conclude that machines received’t have the ability to substitute buddy within the quick time period. Nonetheless, for that particular case by which an AI should merely excel at feeling human, there’s a clear winner—and a transparent loser.
Sesame (9/10)
Sesame dominates the sector with pure dialogue that mirrors human speech patterns. Its informal vernacular (“that is a doozy,” “taking pictures the breeze”) and different sentence buildings create authentic-feeling exchanges that stability philosophical depth with accessibility. The system excels at spontaneous-seeming responses, asking pure follow-up questions whereas understanding when to change approaches for optimum dialog circulate.
Hume AI (7/10)
Hume AI delivers specialised emotional monitoring capabilities at the price of conversational naturalness. Whereas competently sustaining dialogue coherence, its responses have a tendency towards brevity and comply with predictable patterns that really feel constructed moderately than spontaneous.
Its visible emotion tracker is fairly attention-grabbing, most likely good for self discovery even.
ChatGPT (5.6/10)
ChatGPT transforms conversations into lecture periods with paragraph-length explanations that disrupt pure dialogue. Response delays create awkward pauses whereas formal language patterns reinforce an academic moderately than companion expertise. Its strengths in data group could enchantment to customers in search of data, but it surely nonetheless struggles to create genuine companionship.
Google Gemini (3.5/10)
Gemini was clearly not designed for this. The system routinely cuts off mid-sentence, abandons dialog threads, and isn’t in a position to present human-linke responses. Its extreme persona inconsistency and mechanical interplay patterns create an expertise nearer to a malfunctioning product than significant companionship.
It’s attention-grabbing that Gemini Reside scored so low, contemplating Google’s Gemini-based NotebookLM is able to producing extraordinarily good and lengthy podcasts about any sort of data, with AI hosts that sound incredibly human.
Usually Clever E-newsletter
A weekly AI journey narrated by Gen, a generative AI mannequin.
You might also like
More from Web3
Crypto legislation worries, gold starts rallying again, CFTC to trial tokenisation
Crypto laws worries, gold begins rallying once more, CFTC to trial tokenisationCrypto laws worries, Gold begins rallying once more, …
Automotive Leather Market : A Breakdown of the Industry by Technology, Application, and Geography
► The Automotive Leather-based Market dimension was valued at USD 1.22 Billion in 2023 and the overall Automotive …
Publicly Traded DeFi Development Corp. Adds Another $11.2 Million in Solana
In short DeFi Growth Corp. added greater than 80,000 SOL in its newest buy, valued at $11.2 million. The corporate now …