In defense of Gemini

Rohit Krishnan

Mar 5

a kvetch

Read →

19 Comments

Nathan Lambert

Mar 5

Bureaucracy is a way bigger brake on LLM development than most people realize. Particularly when it’s political.

Expand full comment

Reply (1)

Rohit Krishnan

Mar 5

I did chuckle to myself when I wrote big bureaucracy smell

Expand full comment

Performative Bafflement

Mar 6Edited

The biggest / funniest Google disconnect that I know about - I just ran across a site that uses the Google Vision API on any still photo you submit, which uses image recognition and/or Gemini (the API has a zillion options and it's hard to determine which of the zillions the site in question is using) to determine emotion, setting, context, likely income, likely politics, and best marketing products and angles.

Me and some friends tried it, it was suprisingly good and accurate.

The thing I found most interesting about it was in a decade plus of using Google products, I've basically never seen a relevant ad.

So more than ten years of emails, documents, video meetings, spreadsheets, social graph analysis, and whatever else got them nothing, but running a single still photo through Vision / Gemini single-shot a much better segmentation?? Crazy.

But if you're interested in trying too, here's the link. No idea what they do with your photo and the inferred data, I assume they collect it in a database for marketing or something even more nefarious.

https://theyseeyourphotos.com/

Expand full comment

Reply (2)

Rohit Krishnan

Mar 6

Great case in point. A perfectly lovely little product, flashes of genius, left to collect dust and inevitably get sunset at some point ...

Expand full comment

Reply (1)

Performative Bafflement

Mar 6

Right, and not just left to collect dust, but it's strictly better *at their core competency,* by some absurd factor (5x - 100x), and is hidden away in some tiny corner of "API services" space not being used by anyone.

That's just "Google in a nutshell," to me.

Expand full comment

Pierre Brunelle

Mar 14

Thank you for the link. It’s pretty uncanny. I uploaded a picture of me hiking in the snow. It got a lot of things wrong, but the first product recommendation was Darn Tough wool socks, which I always wear when hiking!

Expand full comment

giacomo catanzaro

Mar 5

part of the issue for a while was how bespoke and finnicky all their APIs were with vertex/gemini, so they didnt get basically ANY developer adopters that touted them. but gemini 2 has been great for me as it is more than smart enough to handle the agentic system im building and the choices it has to make to accommodate user requests and its cheap enough that i can use it without really thinking as much about the conversation length like with claude

Expand full comment

Reply (1)

Rohit Krishnan

Mar 5

They're good models!

Expand full comment

Hollis Robbins (@Anecdotal)

Mar 5

Yes! I have a premium model, but largely use it for images -- almost all my images from my substack over the last few months have been Gemini-created. The OpenAI image models are bizarre and overly dramatic. Interestingly, in an einvlorment when humanists like me and @HenryOliver are having conversations about AI taste, there isn't yet enough attention paid to the visual capabilities. I can usually tell which model created which image. Yes it took me many back and forths to get the image of Sisyphus working on his laptop sitting on his rock; yes it took equally many to get the image of a chip covering the sun in an eclipse from the perspective of a university, but they are both excellent images and worth the time spent.

Expand full comment

Reply (1)

Rohit Krishnan

Mar 5

Yeah imagen is very good. I had a preview iirc. Just, again, doesn't nearly get the fanfare. I think even I forgot it existed!

Expand full comment

Christian Benedict

Mar 5

Excellent points. I also found it interesting that Gemini is never at the top of benchmarks like OpenAI and Anthropic’s models are. I do wonder if it’s because google’s customer base is so large that a good model integrated into their system is a higher priority than having the “best” model on the market.

Expand full comment

Reply (1)

Rohit Krishnan

Mar 5

Gemini kept coming on top occasionally but gets shrugged off and then a couple months pass, and is forgotten. The ones which have broken through are notebooklm and flash cost efficacy...

Expand full comment

Trelis Research

Mar 10

AI Studio has made it easier to get API keys than going via cloud/vertex, but the big bureaucracy whiff is scary.

It's the smell of logging in one day to a $74,360.17 bill from a project you thought you never had, set up a by a user you can't find, with an admin who couldn't figure out how to set a project quota. I literally fear using Azure, Google Cloud and - worst of all - AWS, for this reason.

Big fan of google flash though, and use it regularly for projects where I need performance at low cost. And good speed.

Expand full comment

Pierre Brunelle

Mar 14

Code Assist on VS Code has few reviews and they are pretty poor: https://marketplace.visualstudio.com/items?itemName=Google.geminicodeassist&ssr=false#review-details. Still I’ll give it a shot.

Expand full comment

The Silicon Rapture

Mar 14

Gemini’s personality is so much worse that it counters any possible difference.

Expand full comment

Reply (1)

Rohit Krishnan

Mar 14

I do say it has a corporate cheerleader voice. It can get grating.

Expand full comment

Dustin

Mar 7

If/when Gemini can put together a whole product from beginning to end, you're getting close to not even needing that product, you just talk to Gemini.

Expand full comment

Reply (1)

Rohit Krishnan

Mar 7

Maybe. But I can chat to anthropic and get artifacts..I can chat with GPT and use code interpreter and images. Just make it work together!

Expand full comment

janoskar.hansen@gmail.com

Mar 5

yes, that's the way it is for betterment one hopes

Expand full comment

Strange Loop Canon

In defense of Gemini