Hacker News new | past | comments | ask | show | jobs | submit login
Three areas where Google Search lags behind competitors: code, cooking, travel (surgehq.ai)
509 points by swethmandava 1 day ago | hide | past | favorite | 343 comments





There are some sites that are so useless and they have never ever helped me once, yet somehow they are always in the top 10 results

1. w3schools

2. pinterest

3. microsoft answers and all microsoft websites actually (it always looks like the person asking the question is asking exactly what i want, but unlike stackexchange, there are seldom any useful answers)

4. all the code clones for SO

5. all alternative to / review sites like capterra, g2, alternativeto, etc. They might have some good suggestions but they always hide the link to the software/site and instead link it to their spammy page. So you have to select part of the link and then re-search it on Google. Doing this for OSS projects can sometimes lead to a whole new rabbit hole.

6. The best of lists. Google for the love of god, please ban them.. they are always always SEO spam and product placements. Often the blog post itself says "to put your product on this list pay us a $1000 and we will include them in our list."

7. Quora and similar answer sites.. okay it's a mixed bag but you have to be very careful on these sites as most often the answers are just spam. I never read any answer with a link in it. But I think it's more Quora's problem than google. But if google is strict with them they may do a better job at moderating I guess. Also now quora hides answers and asks for a payment. Did they learn nothing from experts-exchange!


MSDN was really helpful once. I don't even know how they could destroy it relatively quickly. The current forums are indeed useless for the most part. I don't even know where to get deep technical information about MS tech stacks anymore.

> "please mark this issue as resolved so I can please my incompetent overlords with a point in a metric they are interested in because of the mentioned incompetence".

I agree that users tend to ask the right questions, but the lack of any coherent answer seems to be systematic.

Pinterest is spam if you don't have an account, which I would recommend to nobody because of such policies in the first place. Twitter did the same recently but at least it isn't as prominently featured in search results.

I don't know about w3schools. It did help me a few times. I guess because the content might be out of date? I am no web developer so I sometimes use it as a reference.

I think the worst aspect is that Google seems to favor news sites. Perhaps it is their SEO, but if a term you are looking up accidentally is part of a news article that was copied by tens of news outlets, you have to visit the dark net, page 2+ of Google results.


As many have mentioned, w3schools ranks high because it is actually used a lot more than MDN.

People love it because they don't have to read. Just copy and paste and you can get back to work. And that's fine. It's a nice thought that programmers should understand their code, but remember that article about 99% developers. They get work done, ideals only stand in the way.

The actual issue with w3schools is that the code that's available for copy-pasting is terrible. A lot of it looks like it either hasn't been updated since 90s or 00s, or at least whoever wrote it hasn't done web development since then. That's not necessarily bad, but for example JavaScript was much more of a pain back then, and you can now write much more maintainable code. Same for how HTML has progressed.

I wouldn't replace w3schools with some bleeding edge frontend practices, or even with the newest web standards, because those are even worse than using spaghetti code from the 90s. The former is way too complex for just copy-pasting, and the latter requires you to read about browser compatibility and other caveats, so it is harmful to just copy-paste it.

What I wish would replace w3schools, is a site that takes into account both why people love using w3schools, and still sets them on the right path to write maintainable code with at least some best practices considered.


I would add Stackoverflow for a very significant amount of queries. In particular in the domain of web development extremely outdated answers are heavily outranking far better answers. It shows how domain/link age is a strong factor in ranking, which is also why w3schools ranks highly.

That's bad, but I can somewhat sympathize with it as these sites are not really purposefully gaming SEO. It's rather Google's poor handling or relevancy and recenctness.

That's an entirely different situation from scammers like pinterest. They clone content without permission and then rank higher than the original. They broadcast content to be open to search engines yet when you visit, it's behind a login.

The behavior of duplicating content will normally tank your ranking. Faking content to not be behind a login where it really is behind a login, may get you fully delisted.

But not pinterest, they seem to get a free pass for anything.


Shameless plug for the Let's Block It project: we have a filter template to selectively remove domains from search results: https://letsblock.it/filters/search-results

For the Stackoverflow / Github clones, we partner with https://github.com/quenhus/uBlock-Origin-dev-filter and their lists are available as presets at the bottom of the filter configuration.


That's awesome and exactly what we need. Never any shame in plugging an open source and free project. Would love to see this as a standalone browser plugin some day. Maybe even making it easier to create and share custom lists

Thanks for the feedback! The site currently allows you to create one personal list that you can sync across all browsers, but I have plans to allow creating several lists and sharing some of them publicly

The main complexity will actually be in the frontend: how to make it easy to edit several instances, and check in what list you'll add a new filter?


> alternativeto

I actually find alternativeto so useful that I usually just go directly to it instead of my search engine when looking for FOSS alternatives. Having to go to the alternative page before clicking the official site link is a small hassle compared to finding what's the official from name search, but it used to be better when the description page and the alternative page were the same. If I were to guesa, they probably changed because it would be confusing for people searching for alternatives and finding the software description.


It's my go to place for alternatives, so if it shows up odds are I wanted that result.

I’d say it is that it is Microsoft’s faults that their answers sucks.

Most commonly asked questions you will be getting one of the three responses: 1. assume the user is at fault 2. it’s a “feature”, get over it 3. the official helper doesn’t know anyway so he/she just pasted a link to some random tutorial they found online which doesn’t work anyway

There really isn’t much that Google unless they kinda remove Microsoft at all.


4. Talk utter bollocks about something else.

I don’t understand the hate for w3schools. I found this site super useful when I was first learning web development.

pinterest is really the worst website ever, especially when you're looking for images.

don't they just steal content and then host it? pinterest is the worst when trying to prevent users to use their website when they're not logged in.


W3schools is wonderful, it's got great primers on a whole lot of languages and libraries with concise, clear explanations and examples.

It used to teach people bad coding practices and it had outdated information. It has improved but I am not ready to recommend it to anyone I like.

It has terribly outdated info today. MDN does everything it does but better and always up to date yet MDN almost always shows up after w3schools in my results

w3schools is a pretty good resource for very basic stuff. If you're a professional developer you're probably not going to find it useful but if you're recommending a website for an absolute beginner there are worse sites.

Why no just MDN then? From my experience it’s much better in 95%+ of cases.

Until you realize as a beginner that for some reason that is not enough and you are even more stuck

Maybe they have never been helpful to you, but w3schools, Microsoft answers, alternative websites, best of lists and Quora have all been helpful to me at some time.

You included MS Answers, but left out discussions.apple.com, which I find to be an even greater disappointment. This pair of $2T+ enterprises have failed so badly at something that should be table stakes.

How hard would it be to pay a small team to go through this tangled mess and clean it up and keep it that way? How little do they care about the attitudes of their users??


Google regresses to the mean - what do most people want to see on the top? just from comments on this thread, people both love and hate w3schools.

So I think there's a need for personalized search engines where one can pick their sources - we designed You.com keeping that in mind.

fwiw i hate w3schools too, if anyone's counting


Quora is the first site that I ended up blacklisting by DNS, even before rpz and pi-hole were a thing. I felt dirty every time I inadvertently ended up on that site.

Thanks for the idea.

> 5. all alternative to / review sites like capterra, g2, alternativeto, etc. They might have some good suggestions but they always hide the link to the software/site and instead link it to their spammy page. So you have to select part of the link and then re-search it on Google. Doing this for OSS projects can sometimes lead to a whole new rabbit hole.

Wow I've never had this pop up for me tbh. I wish alternativeto would come up more often, it's a really great resource and it's all crowd-sourced so it's very valuable information. Definitely wouldn't classify it as seo spam



Gah, why do we need another blocking addon? Because previously blocking addons have worked out really well. Why can't this have just been a uBlock Origin list?


Thanks! I saw this in a leaf comment of my original comment. Much better.

w3schools is good.

MDN is usually a much better resource.

It depends.

MDN has a better overall information and is more in depth. And more up-to-date.

On the other hand, w3schools has that one three-line CSS snippet that you need to copy-paste to center your DIV vertically or whatever, without 5 paragraphs of intro.


MDN is much more complicated. I almost always get my answer faster and easier to understand with w3schools. Plus quick little things like browser support and try it (which mdn also has now)

MDN is generally too technical.

As it should be. It's a resource for professional developers, not people who are learning how to code for the first time.

Professional developers enjoy simple and concise explanations just as much as people who are coding for the first time.

I think the problem is that w3schools _was_ terrible. It certainly seems to have improved, but it's reputation - amongst the older coders that I know, at least - is in the toilet

w3schools was useful when it first came out years ago. It's probably riding on its initial utility. I remember when the w3fools site was released later (https://www.w3fools.com/)

I remember W3schools being an excellent resource closer to the beginning of my career (a decade ago?), but now I barely ever enter. But I always thought it's me who outgrew it not that it became worse...

I feel like their site hasn't been overhauled in about 8 years.

Would add cplusplus.com to this list. Cppreference is just a better resource in every way.

g2 is actually used quite a lot by businesses looking for alternative service providers.

w3schools isnt perfect but they are god for html and css syntax.

Alternativeto is quite good in my experience.

pinterest completely destroyed google images and is now starting to infect duckduckgo

I know google search is probably not the same as google flights, but I spent a few hours recently (over a few days) using google flights and it was a delight to use compared to the Expedia children.

We had a bereavement in the family and had to book multiple independent tickets because we could not travel together. This was just after the Ukraine war started so prices had gone through the roof. Exact number of days was not that important as compared to the price and the duration and using google flights UI to slice and dice the data was such a joy. Want to freeze the airline and look at the alternatives - which include from and to dates, number of days of trips, or freeze any other parameter and analyze others, the response was sub-second. Did not eventually book through them since I did not want to get into the google payment system (they offer booking through others that I did not explore).

On the opposite side was Expedia children where they would show a price of 2800 and when you click the price invariably it has gone up to 3600. Again. And again. And again. Not sure if that problem existed with google although I paid the exact same price as the airline as shown by google, it could just be a coincidence.


Google flight's true genius is being able to search multiple city-pairs.

Let's say you want to get from the West Coast to Europe on a business class flight, but want to save some money.

Realistically, it's cheap and easy to get from West Coast airport to another, and similarly cheap and easy to get form one European airport to another.

Google flights will let you search for the best combination of flights that depart from any combination of up to 6 airports, and arrive at up to 6 airports. The technology they purchased (ITA) will let you do this as well, but limits you to a single country. Google flights? No problem with destination airports in multiple countries.

So, you could search for a flight that originates in some combination of, say Seattle, Portland, San Francisco, Los Angeles and Vancouver BC, and lands in some combination of London, Amsterdam, Paris, Madrid, Milan or Frankfurt. (Or any other 6 large European airports).

Google Flights will also show fares for a specific trip length (say, 14-days) over an entire month.

Want to filter by a maximum flight duration? No problem. Number of connections? Done. Specific airline or alliance? Easy.

In a traditional search engine, I'd have to run close to 400 individual searches to get the data that Google Flights gives me in a single screen.

It works well for domestic flights as well. I recently helped someone who had to attend a wedding across the country find a flight that was less than $180/person, when they thought they were going to have to pay over $800/person. Just by using Google Flight's tools for about 10 minutes.

There's plenty to complain about in the Google ecosystem, but Google Flights is amazing.


You can do the same thing in momondo. Being in Europe, what I like about momondo is that they include the cheap carriers - like ryanair

The product was called ITA Matrix, Google bought them a while back

I find https://www.kiwi.com/en/ works well for doing all of this too.

Google bought the main provider of flight information in 2010 (ITA). ITA had such a monopoly that Google was required to continue licensing their software out for years afterwards, though I think it expired in ~2016.

ITA Matrix is still available and is my preferred flight search tool. Still find the old interface much better than the new:

https://oldmatrix.itasoftware.com/


I wonder how much of the old Lisp codebase is left.

Also, it looks like the Matrix might go extinct soon:

> This interface runs on a deprecated web platform. At some point in the near future, we will be forced to shut it down. We unfortunately do not have a timeline on when this will happen. We welcome feedback about features missing from the new interface, we read all feedback and open bugs accordingly.


it's pretty horrible, I just search my vacation destination and it doesn't have LCC Wizzair I'm flying with in Europe = useless search tool

At least on my searches with Google Flights, Wizzair shows up as a carrier for EU flights. Good search engine for low cost EU carriers is https://www.azair.eu/

Azair doesn't really work either anymore, they will show you non-existent flights or wrong prices, for rough idea is OK, but data is wrong

That's on the LCC, not the aggregator. They don't want to pay commission, and that's fine. If you want to fly on them you have to actually go on their website.

I don't really care who pays who, but search engine which can't find me cheapest flights it's useless

Well, you're getting it free: if you don't get any use from it, don't use it. Try doing an iterated search of every possible route on every possible airline website and let us know how you get on!

1. just because something is free it's not excuse to provide poor service

2. I don't use it, I point out for others facts about service OP recommends so they don't waste their time or you prefer everyone wasting their time?


Here is the latest version of ITA Matrix as a google hack https://partnerdash.google.com/apps/matrix/search

Minor note: they announced it in 2010, but the acquisition closed on April 12, 2011. [1]

[1] https://googleblog.blogspot.com/2011/04/ita-software-acquisi...


ITA wasn’t in any way to e main provider of flight information in 2010. It’s a relatively minor player, but with great features

ITA powered the backend flight search functionality for virtually every OTA, even if the ITA Matrix website wasn’t popular among end users.

I personally like to use Momondo. Have had pretty good experience with Momondo over the years so far. I haven’t tried Expedia, so I can’t directly compare it. But I recommend giving Momondo a try anyways.

I will also plug Kayak as being the only other decent flight search site.

The difference between Kayak/Google and the others is that Expedia and friends are online travel agencies. Kayak and Google are really just search engines. It makes a world of difference.


Can you explain what is the difference? I suppose the flight search engines don't book the flights.

The travel agent wants to sell you a flight and pick up a commission. The search engine just tells you what's out there.

On top of querying the travel agency APIs, Kayak will also query each airline directly. So you also get fares and airlines that aren’t published to agencies

I miss Hipmunk, and curse the day Concur bought them.

Same! Their “agony” sorting (basically sorted flights by a combination of # of stops, total hours, and price) was a breakthrough at the time. It seems to be factored into most flight searches today, even though it’s not labeled as such.

Google flights is delightfully easy to use

The one thing I wish that Google offered would be the ability to blacklist sites for a period (coud be fixed - say 6 months).

So damn annoying when the top search results all lead to shitty SEO-optimised sites that use a whole page to blather on and on, leading to a tiny information nugget at the end. No value, just excellent SEO scamming.

As these scam artists get better and better at this, Google gets less and less useful.

When I see a site like that, I can be quite sure there is no value to me from that site. I want to blacklist it - not forever (though I'd settle for that) but so I don't see it in search results again.

The crazy thing is that this could even be a benefit for Google themselves. They could aggregate these signals and use them to identify SEO scammers, since their algorithms clearly can't. I'm sure that Google aren't happy with the lacklustre performance of their search in modern times.


> Blocks sites you specify from appearing in Google search results This extension prevents the sites you specify from appearing in Google search results.

https://github.com/iorate/ublacklist

https://chrome.google.com/webstore/detail/ublacklist/pncfbmi...


I use uBlacklist in Safari - highly recommended. In Chrome I have a custom search string that adds "-pinterest -quora" because result on those spammy sites are behind a login wall

Ironically I tried to find a blacklist extension or similar through Google search past week, and couldn't find it due to blogspam talking about how it used to be. So added reddit and clicked in past year, but reddit now doesn't show accurate dates on posts so that didn't work.

Thanks, will add this!


> The one thing I wish that Google offered would be the ability to blacklist sites for a period (coud be fixed - say 6 months).

I wish I could do that at Hacker News too.

I really just don't want anything from medium.com.


That's a weird thought. Medium might be irritating as an interface or as a business but there are still some really interesting things written on it, no?

There may be good content on medium, but more often than not I'm frustrated by long-winded, less accurate retellings of official documentation. I think it might be because of the monetization. People have an incentive to write shallow blog posts/tutorials on things like "throwing an exception with Python." If I'm out of options I will skim through a Medium post, but usually I avoid them at first because they tend to be less helpful than non-Medium blogs, official docs, etc.

This is completely anecdotal but most medium (and similar platforms) that I end up on from HN are basically an A4 page of what is essentially nothingness.

This is not to say it’s not interesting, because it is, but what typically happens is that I end up in the “want to know more” state, but then don’t actually get to know more. This is not really on the medium authors, it’s more just my social media consumption taking me back to HN and then never looking into it again, but that also means that my time on the medium article was sort of wasted doesn’t it?

I can’t for the life of me remember a single medium article that had any sort of impact on me or anything I do tech wise.


I've found the Medium posts that make it to Hacker News to be drivel: Uninteresting and inaccurate. The titles themselves are interesting, but the content itself isn't worth reading.

It's happened so much that I just won't click on anything that's on medium.com.


I've noticed the same thing, but (like n=2, so take it with a grain of salt) even for authors whose other writings I enjoy; I wonder if there's something weird with how they structure the incentives for writing there such that this is what you get

Seems like you could do that in a reasonably thorough way on HN with an ad block filter.

Quick and dirty version for uBlock Origin:

  news.ycombinator.com##tr.athing:has(a[href*="medium.com"])
  news.ycombinator.com##tr.athing:has(a[href*="medium.com"]) + tr
  news.ycombinator.com##tr.athing:has(a[href*="medium.com"]) + tr + tr.spacer
If somebody can improve this, please share.

There is a programmable search feature [0] that lets you limit search to a defined list of sites. Someone did a ShowHN a few months ago where they had built a programmable search with 200ish common sites that a stereotype HN reader might like (software documentation, wikipedia, reddit, some news and other media, etc), and it was actually pretty good.

I've said before, google is now basically what I'd call a "smart" portal site. For most stuff, you already know the handful of sites you might want to look at, and google just sort of brings you there from a relatively clean interface, as opposed to a traditional portal that would have lots of categorized nested links to traverse. In most cases you're not searching for a random site that you wouldn't know existed if it wasn't indexed, like in 1998. So the whitelist approach actually works pretty well.

[0] https://developers.google.com/custom-search/


You're looking for https://notrashsearch.github.io/

HN comments: [Show HN: No Trash Search](https://news.ycombinator.com/item?id=29774456)


Current Google search doesn't add features, it removes them (see end of + sign and other stuff). This is sadly the logical behavior of a "mature product". When you own the market, your rational question is "how much can I extract from the customers who already must come to me" and that means becoming more and more directive towards the customer. Letting the customers customize only stands against this.

This used to be a feature of Google Search. It then moved from the Google Account to Chrome. I'm not sure if it still exists in the Chrome browser. It can be done with addons but of course that doesn't feed back to Google.

It does surprise me that Google wouldn't want to capture this signal. Maybe it is too susceptible to abuse?


> Maybe it is too susceptible to abuse?

The incentive to block one's competitors from 10,000 different accounts is likely why they no longer offer this function?


At the end of the day, being able to detect bots (and the abuse from them) is key to maintaining, well, anything.

It surprises me too. Perhaps they are snow blind from having had something like it in the past and then removing it.

But in the modern era, I feel that being able to use human signals like that - on top of the fancy algorithms - could well be a killer network effect for them.


The SEO dreck drives more ad impressions than real content. Why would they improve the results when they have a captive audience who won't switch to other search providers.

This is not only a problem with Google search. DDG is now also flooded with auto (AI?) generated content. For example there are now loads of websites about programming topics. They look real but always lack the main point of the article which makes you navigate to other articles on the same website.

>As these scam artists get better and better at this, Google gets less and less useful.

Any ranking manoeuvrability that Google offers can and will be used against them.

The SEO scammers out there would just automate millions of proxy IP addresses to blacklist all of their competitors sites.


But the blacklist would be personal, so applying only to me.

This is a great idea. On YouTube I use the 'Don't recommend channel' button a lot and consequently the algorithm has learned to filter out a lot of the nonsense.

This button doesn’t really work for me. Often I need to click it multiple times before a video dissapears from my recommendations. Same with the ‘Not interested’ button.

I’m guessing YT works with a system that is ‘eventually consistent’, but having to click the button 2-3 times before a video disappears doesn’t seem particularly consistent to me.


I always assumed that they were getting paid.

I can’t think of any reason why a spam Pinterest link has any value, yet it’s ranked high.


Pinterest has a surprisingly large userbase of people who actually like the content there.

There are a lot of real world people I meet who speak highly of all the great creative ideas they get from there.


I don’t care, they are a blight on the internet that deserves to end. I have NEVER wanted to see a Pinterest board in my Google search results image or otherwise and their SEO optimisation serves only to parasitically drive traffic in order to feed their conversion funnel.

If your Pinterest board is public and you let others see it that’s nice, but the Pinterest developers should never have let that show up in my search results, your board is not and will never be more interesting than the original content you have pinned from other places on the internet, shitty aggregate pin pages hurt original content producers by ranking better and stealing page views and preventing users from navigating to the original content without jumping through hoops in the Pinterest user interface.


It's still completely useless for people who are not on Pinterest.

Kagi.com allows such blacklisting.

> So damn annoying when the top search results all lead to shitty SEO-optimised sites ...

The problem is, since these sites dominate the SERPS, there is little incentive for anyone to offer the result you want. As a consequence, the web page that would satisfy your request probably doesn't exist.


I'd say it depends on the money they make. I'm not sure how, but it must be profitable enough for them to let it persist.

I've a good experience with uBlacklist, an extension that blocks results by domain in the search page results.

I believe -site:facebook will remove that site from your queries. Annoying but it is functional.

I’m sure someone is going to point this out but guess what happens when scammers find out their site can be black listed?

They get crowds of people and/or scripts to blacklist all other sites.


Aw man... that would destroy pinterest.

kagi.com allows you to adjust from the results page how often you want to see a page in the results or just block it entirely. I don't think there is a way to temporarily ban though.

ublacklist is a browser extension that does what you want. it works well and i don't find myself having to blacklist new sites very often

A HOSTS file for search would be most welcome.

Google used to allow you to block individual domains right from the search UI. It was discontinued in 2011.

There are numerous extensions to block domains, like uBlacklist: https://chrome.google.com/webstore/detail/ublacklist/pncfbmi... (not affiliated)


This will create Perverse incentive, SEO "ninjas" will just mass-block competitors.

The problem, as always, is credibility. If they're not careful about the credibility of the aggregate blocking signal, this would become another tool in the SEO scammer toolbox as they use sockpuppet accounts to downmoderate their competitors.

I started adding "reddit" to all my searches in google, cause 99.9% of time all google links point to seo rubbish.

It is absolutely impossible to find anything anymore, I gave up on google for all intents and purposes and use it exclusively as reddit indexer.


You.com lets you prefer reddit and see it in your search

An interesting observation is that over the years, I've sort of found that if there are not many good results, maybe I'm not asking the right questions, or I'm not thinking about the problem the right way. For example, you come up with some weird programming idea to solve a problem and no results are found -- I'm almost always headed down the wrong path. This has been proved to me over and over again as sort of a canary in the search coal mine.

Google still solves all my questions, if I'm asking the right questions, I guess is what I'm getting at.


>"if there are not many good results, maybe I'm not asking the right questions"

I used to think this way too, but I've come to realize that Google has turned from a search engine that returns results based on my input to an answer engine that actively tries to reframe whatever I enter into some other more generic query - usually with the intent of selling me something or returning SEO spam. I've also found that the old google-fu techniques are now so unreliable that they must have been deprecated. I can't tell you how often I use quotes in my query and see results that don't contain that text at all.


Back when I started at Google, they at least said they actively targeted "long-tail" queries to gain market share. The thinking was that people would use a search engine until they hit some query that their usual search engine failed on. Then they'd give another search engine (perhaps Google) a try, so if you really targeted those tough queries and were at least decent at the common queries, you'd gain market share.

Though, even when I was doing indexing changes at Google, the common practice was to do A/B testing with both the most common queries and a uniformly random sample (see reservoir sampling) of queries in order to justify a go-live of indexing changes. The former explicitly over-weights common queries, and the latter still optimizes for the common case. (In case you're wondering, the worst query I had to manually check in A/B testing was [flesh hook suspension].)

Google used to turn off some of the query re-writing logic (that tries to fix your query) if you used a query operator. (It has been a while, but I think maybe even their "Kansas" user info database kept track of the last time you used an operator, and would turn off some of the cleverness if you had recently used a search operator, as it was a good signal that you were a power user capable of optimizing your own queries.) My understanding is that they don't disable any of the too-clever bits for power users any more, and that everything uses all of the cleverness of learn-to-rank all the time.

I suspect it has gotten even worse with learn-to-rank, as it must be incredibly difficult to intentionally under-weight the uncommon/difficult queries.

They did keep track of when users re-issued similar queries in a short period, as a signal that the ranking algorithm wasn't doing well. I think an optimal system would use learn-to-rank for the first query in a related sequence of queries, and then switch to turning some of the smarts off, and finally switching to a learn-to-rank algorithm trained only on later queries in these related query sequences. That way, they can avoid the secondary learn-to-rank instance from over-fitting the median/easy queries.


They taught us the search engine operators in elementary school computer class. It’s a shame that they’ve gone away now, and the internet is so much worse than it was then.

Nowadays you should google using what in 2005 would be considered "the wrong way to search". Back then it was all about keywords, quotes and so on. For example if you wanted to know if dogs could eat apples you'd search:

> dog apple dangerous

Today you should search:

> can dogs eat apples

And you get way better results with the second form. I've noticed that people who are stuck thinking that the right way to google is still the former overlap a lot with the crowd that keeps complaining google is worse now than it was before.


I’d argue that it’s not a “better way”, but rather a regression: I assume Google genuinely wants to understand my query, and I don’t see any reason why it shouldn’t still be obvious to Google what kind of results it should return.

Otherwise, it there truly is a “right way of asking Google questions”, why doesn’t Google release and promote a guide about it so people can be more successful in their search?


Okay, so answer me this... I'm trying to find out about the extra controls in the old space suits. I believe they used a chin switch or chin toggle (resistive). If I google "did astronauts control things with their chin" I see: 'It's All Different for Women in Space | Marie Claire' and 'Pillsbury Space Food Sticks were sweet Slim Jims in space' as the first 2 answers... nothing about switches. If I google 'space suit chin switch' I get links to resistivity toggle switches.

heck I was putting in exact model numbers today like MS3015S20-27SZ and google was returning me OTHER models such as MS3015S20-27PZ which is the male or female version of a part (PZ vs SZ), or changing the20-27 to a 29-4 which are all completely different parts. This is with verbatim on.

I could very well be biased, but the first example still seems more useful than the second.

https://www.google.com/search?hl=en&q=dog%20apple%20dangerou...

https://www.google.com/search?hl=en&q=can%20dogs%20eat%20app...

Also worth comparing (seems to focus more on diabetes than cyanoglycosides. Diabetes is the bigger chronic problem, and cyanoglycosides are the bigger acute problem, so which is a bigger danger largely depends on how disciplined the owner is.):

https://www.google.com/search?q=what+is+dangerous+about+feed...


Another thing for us on HN to note. This is how most people ask questions. It is not just that the other way is wrong - it is also less common.

Ya of course, but my natural inclination is not to ask a question to a web page

Right - I think this is part of the "scaling" of the internet. Those of us (relatively small set of earlier heavy internet users) are forced to change as the product is built for the wider audience.

I'm in the awkward middle area I think where I'm almost a digital native, and what you said maps perfectly to my experience. In middle school and high school I got pretty good at the first type of search you mentioned. I was young enough that I was almost learning how to "speak" google and it came easily, I was fluent. Throughout the end of high school and into college I had more and more issues finding what I wanted and one day after I had failed to search several times I thought to myself "Fuck it. I'm just going to type in exactly what I want in plain English, as if I was a moron, and see what happens." And it worked, and that day a lightbulb went off in my head.

I think this also indicates why Google seems to have gotten worse at programming questions.

If “you’re doing it wrong” didn’t fly with Apple. It doesn’t fly with Google.

I hate it as much as any other geek, but it did pretty much fly with apple. Even when it was as bad as "You're holding your phone wrong" bad.

I've noticed the same, and it's an incredibly hard habit to break!

"you're thinking about it wrong" isn't really that great a defense of Google, you know? I'm not being flippant. The two examples you gave are different searches, with different goals.

>answer engine that actively tries to reframe whatever I enter into some other more generic query

Google has come full circle and turned into Ask Jeeves.


Be careful. Maybe it's a useful rule of thumb, but be careful of the "searching for your keys under the light pole" problem.

I worked at Google on rich content indexing for 4 years, more than a decade ago now. Google is pretty good. It used to really cater to "long tail" searches, but a combination of SEOs getting better (and specifically targeting Google) and "learn to rank" over-optimizing for the median query, means that lots of long-tail queries don't do very well on Google.


This is a great way to describe it. Google still gives me good results for the common stuff, but really fails me when I'm looking for something in the "long tail".

I just wish I could find obscure things again.


I wish there was a way to get rid of all sellers from results. Of late I have had to dig 2-3 pages in to get to anything which wasn't an online store.

There is, use a different search engine. I've even found something as new as Andisearch good for finding research papers and Qwant good for general searching. There are many more and they keep up with Google pretty well now without the trash.

What is a common query? facebook.com? Some very impressive search capabilities needed for that query there.

More than a decade ago, Google had some special-case logic for these "navigational queries". Lee Kai-Fu (head of Google China at the time) said that navigational queries were particularly prevalent in Chinese search traffic.

Though, there's also a use case for people trying to find third-party information about various websites, particularly in trying to figure out if the website itself is a scam. You really want to have the navigational result up at the top unless you have a very high degree of confidence that the site is a scam, but in all cases you want high quality reviews of the site to follow up the navigational link.


Probably something like: "food delivery near me"

I wish I could find forums with my answer instead of a multitude of websites and blogs that do not answer my query. I guess I could add 'forum' or something but when I am already searching a specific query and particular piece of hardware I don't need the first page to show me retailers of said hardware.

Tried it, blog spam mentions finding or not finding stuff in forums way too much and it basically didn’t help unless I started getting into weird Google fu like including strings from the copyright footer of various forums in an advanced search using the OR operator and multiple quoted sections … at which point I was like “why am I working this hard on making Google better when they don’t fucking pay me”

> For example, you come up with some (...) idea (...) and no results are found -- I'm almost always headed down the wrong path.

And by that you mean, Google is telling you how to think, and that you should think in some way and not in some other way. I'm quite sure I am not comfortable with that.


And what do you do about this discomfort? Asking for a friend who is also uncomfortable with that "guidance".

This resonated a lot with me. Often if I'm learning a language and I see no one is doing what I'm querying for, I take a step back and try to ask a more basic question

For code maybe that's true. It's hard to see how I might be wrong about the place I want to travel to or the recipe I want to cook though.

I googled a phone number recently. It was the first result, got my answer. The rest of the results were websites for physical locations I had been recently, including a hospital, none of which had any relationship to the phone number.

Google is _not_ a good guideline for asking the right questions. It is trying to make as much money as possible and it will scramble anything it can think of to do that. Don't use that as a guideline for how you are thinking about questions.


1. Show visited locations in search results

2. ???

3. Make as much money as possible


> 2. ???

oh Haha, funny old meme. Step 2 though is very clearly to show more ads and more content, to show more 'full results pages', etc, rather than just show me the 1 thing I needed and be on my way.

I should include that Google said next to each result ('"555-555-555" is not on this page') and then show the page. Totally knows that the results are unrelated to my query, but shows them anyway. Why?


So was there even a single ad on that search result page? From your original description it certainly didn't sound so. If not, how did it contribute to you being shown more ads?

it's not really that Google lags, but rather SEOers have optimized for Google. The problem is intractable. When people talk about the 'good ol days' or times when Google was better, it was simply because there was less SEO, less spam and generally fewer pages on the internet.

Google could be better than it is now, but there's no incentive to do so, unfortunately. Say Google allowed you to blacklist entire sites from the results - inevitably those sites that have the most ads would be the most likely to be blocked, resulting in lower revenue for Google.


The fact that recipes on on this list suggests this is the case.

Recipe sites are notorious for SEO tactics. They all follow the same highly optimized format with the stupid story about the author's grandma and how they just couldn't get enough of these cookies, and how the recipe was lost for 90 years until recently their great great uncle Lou found a copy of the recipe in an old donut.

Google has all of the tools to solve recipes. Make Google Recipe with a standard template and a way to link in and out of YouTube. People who contribute popular recipes get ad revenue. People with recipes and YT videos get even more. Adding ways to find similar recipes would be a killer feature. Who hasn't found a recipe that was almost what they were looking for, but was missing that je ne sais quoi.


The stories also serve the purpose of providing copyright. A recipe alone doesn't have copyright, but the story mixed throughout the recipe does have copyright.

Nah, the stories are there so you have to scroll 6000 pages to finally find the actual recipe. Why do they want you to scroll? Because every 5th word is an advertisement. More scrolling, more ads, more revenue.

All of these recipe sites (that I've seen) will drop the ingredients + directions on the bottom of the page.


No, it's so the pages follow thr Google blessed "ideal" page format. X number of words, 2-3 pictures, iser stays on page for longer than Y time.

Last one is the killer, a site that quickly gives you the info you are looking (or quickly shows you it doesn't have the info) for is punished by google search ranking.


I thought it was so the site has longer to mine monero using your CPU while you scroll? Ever notice how incredibly slow any device gets when looking at recipes?

It's possible that the scrolling is giving the remote server time to use your CPU/GPU to mine some space cash, but it's also possible that attaching listeners to events that happen many times per second (such as scrolling, mouse movement) is common, especially if 3rd party ads are allowed to interact with the page, making it nearly impossible to know what those ads' scripts are doing.

Isn't that similar to what they tried for things like shopping for example, and they got sued for? The problem is that if they become an aggregator for a specific type of content, like let's say travel or lyrics, then other aggregator websites start suing.

Oh, I was not suggesting they be an aggregator. They should get people to produce original content and pay them per view like YouTube.

Shout out to https://onlyrecipe.app which allows you to extract just the recipe from SEO-corrupted ad-riddled websites.

"Recipe sites are notorious for SEO tactics. They all follow the same highly optimized format with the stupid story about the author's grandma and how they just couldn't get enough of these cookies, and how the recipe was lost for 90 years until recently their great great uncle Lou found a copy of the recipe in an old donut."

I LOL'ed. Thank you for that.


You don't even have to blacklist... just look at user behaviour. Did the user come back to google and select the next result after two seconds on that site? Did many users do that? Something has to be wrong with that site, rank it down.

I'm pretty sure i'm not the only one who clicks on pinterest results in eg. image search, and immediately click back to find another image somewhere else.


> Did the user come back to google and select the next result after two seconds on that site?

It usually takes me at least a minute to recognize a search-optimized text if it's a topic I am unfamiliar with. Let's say I am googling something about windproofing underfloor insulation. The article starts with some basics about underfloor insulation in general, so I skim through the introduction, start hunting for the part of the article where it actually starts talking about windproofing and realize it's been cobbled together out of six random introductions or generated by GPT-3.


No, that SEO shit works. The whole adbased internet is optimized to waste your time.

Did I spend a minute or two trying to read a 1000 word article, because I was looking for how many pixels there is in a 4k monitor? Or did I spend 2 seconds visiting a chart?


It has been a constant battle of SEO improving and Google improving the page ranking algorithm. For a long time Google was winning that battle, but more recently they haven't seen as interested. The economic argument you're making may be the reason, but it's short termist if people slowly get driven away from Google by poor quality search results.

Five to ten years ago no other engine was close - the fact that duckduckgo was even try was comical. Google had an effectively monopoly. That might not be true at all five to ten years from now if the trend continues.


those sites would just clone themselves with a different domain

I am sure it’s plenty more than these three.

Yesterday, I searched "best after-sale service of AC". What I was shown was SEO'd pure junk. Absolute junk as the first result.

Next few were the same, but more focused on affiliate programs rather than providing genuine info.

Down the line was Quora, where _sales rep of AC companies_ wrote answers that _theirs_ had the best service.

I wad very disappointed.

You.com showed me better result right away.

My Kagi and You use is now on par with my Google use. They mights surpass Google soon.

I still find Google to be the best for programming answers btw.

I would also say that Google's ad business is in direct conflict of interest with its search business.


Am I the only ones who has a really hard time with Quora. I can never tell which answer is real or fake. And even the "real" ones rarely ever answer the question. Quora is basically the Pinterest of answer results.

I never find google to be good for programming queries. I use you.com for all programming queries, and recently learned to use their Coding Complete app, it's actually great. This article doesn't do it a proper justice

I much prefer Kagi to Google (and DuckDuckGo et al). Often the first page of Google consists solely of SEO rubbish. Being able to block and rank domains is also useful.

For straightforward answers and navigating to websites, I often use what is the default in browser that I happen to use at that time.

I use DDG, FF, and Bromite on my smartphone. Sometimes Brave, too. I have Brave as my secondary browser (FF as the main, LibreWolf for personal stuff) on my daily driver as well.

I used Brave Search and DDG a lot. Nothing can go wrong with them if all I am asking is the capital of Belgium or trying to navigate to a subreddit.

With special, not so straightforward searches, I rely on You and Kagi.

I really appreciate You's different kinds of search results and grouping them. They are clear that they use some kind of AI model to tailor search results, and I have an account there.

I really like Kagi for showing me sites that I wasn’t even aware of. I can boost or shove down sites. I turned off Quora and Pinterest right away. No extensions, no scripts. Love that.

As I said, Google’s adbiz is in direct conflict with its Search division. Search should show the best result for the Searcher. Ad div would want the pages to be shown that are crawling, infesting with Google Adsense ad. When more of these pages are visited, they have better metrics to lure more people to Adsense thus making $$$.

This is clear conflict of interest.


you.com, the search engine you control, huh?

http://shitmyself.com/thumb/thumb_800_5e84542c26a8815f33d767...


Google is not great for code search, but I dislike this "rich snippet" thing.

These engines are stealing the sites traffic. The whole point was to be a search engine, not an encyclopedia. If you want to be the latter, produce your own content.

It's my opinion. I don't use those engines because of that. They jeopardize their sources. It's unsustainable.


Yeah I think coders should give up on search engines to meet their needs and instead rely on coder-specific search engines like

- https://searchcode.com/

- http://symbolhound.com/

- https://publicwww.com/

etc


IMHO, the options you listed still miss the point.

When I do a code-related search, most often I'm looking for a documentation entry.

So, for example, if I search for "python string replace", Google returns me a bunch of crappy pages.

I can fix it with "site:python.org string replace".

SearchCode does a terrible job [1]. So does Public WWW [2].

Now, Google nails with the first result, pointing to Python's built-in types page. But it would be perfect if it could just add a #str.replace fragment to the URL [3] and save me some scrolling. Instead, it sends me to the top of the page...

[1] https://searchcode.com/?q=python+string+replace

[2] https://publicwww.com/websites/python+string+replace/

[3] https://docs.python.org/3/library/stdtypes.html#str.replace


symbol hound looks dead

searchcode seems to not let you say things like "in the main api", or in libraries, it's all usages, it's also actually the code itself. so "boto3 list_buckets" a common query I do (that google actually does well).

publicwww seems to be searching google?


For Python programming I would say 100% of the time you should look the answer up in the official manual for a well-defined problem (delete a file) because the manual is correct, well-written, etc. It's astonishing how often Google and Bing snatch defeat from the jaws of victory on queries like this.

If you go looking in splogs, spam overflow and other spam sites at best you are going to get wrong answers, at worse you will get answers that "aren't even wrong".


Searching for "Delete a file" in the python manual, will take you to a page about the configuration parser.

https://docs.python.org/3/search.html?q=delete+a+file&check_...

Top result: https://docs.python.org/3/library/configparser.html?highligh...

Top 4 result is "Miscellaneous operating system interfaces", which does hold the answer, but it is not obvious, and browsing through that page is quite a chore before you finally get to `os.remove`, which says that it deletes "a path", which even I, a seasoned developer need to look twice to make sure that path removing a path and a file is the same thing. https://docs.python.org/3/library/os.html?highlight=delete%2...


To be fair I have read the Python manual over and over again, I can usually find things by clicking on the appropriate section.

I get so annoyed that geeks for geeks is the default thing for so many things I search for in python. Really, it would be far more useful to just bring up the python docs most of the time.

I agree yet in the article google is criticised for having the official docs as search result 1.

I guess I just want different results to the same query than the author


Of the three code related searches in the original blog post, only one Google example listed the official Python documentation as the first link. In the other two examples, the sites shown where:

- W3 Schools

- StackOverflow

I will definitely agree with you that the author seems to want very different things from their search results than we might though. I will always always prefer the official docs (which I can pick through) when I make as vague a search as "<language> throw exception". If I wanted to know "how do I <verb> <noun> in <language>", then that's what I'd Google.


I wish the python manual was the default search for function references too...

Realpython is a third party site that I'll admit is acceptable for these results.

Mostly yes, you just search efficient documentation.


I haven't coded python in a while, but does the python manual follow the man pages style of listing every possible parameter including the weird deprecated options that haven't been used since 1998 before actually providing information and examples of typical usage and instructions on how to handle the common use-cases?

Because that's the software equivalent to recipe spam.


>because the manual is correct, well-written, etc.

It's also verbose and requires effort and working memory to parse, when I could get a trivial one line answer or code snippet from a stackoverflow post specific to my question. Yes, in an ideal world we would all read the manual, but unfortunately manuals are inconvenient. And in my experience the vast majority of stackoverflow answers are correct.


I think it takes a lot of effort and working memory to ignore the ads and correct the mistakes on sites like spam overflow, spam3school, etc.

If it was was Clojure or some other language that has an awful online manual it is one thing but the Python manual is good.


Spam overflow? Is that stack overflow?

And... I'm guessing w3school?


Question regarding the recipe example. Does Google deduct quality points the more ads that are put on the website? A long time ago, I recall hearing how Google was able to beat Yahoo by focusing on quality of ads & a higher click rate. Has Google defeated it's rivals & switched to their tactics?

Side note, I prefer DDG as my search but only because of the bang operators. For recipes !b added to the search lets me use Bing. As the article points out, Bing is really awesome for searching for recipes.

Looks like I need to start trying out Neeva & You.com. They had some nice features in this article.

Neeva seems to be stealing all the important content from the recipe website which is a highly discussed issue. Bing tries to walk this line by making you still go to the website to read the instructions. Obviously people have trashed Google for doing this same thing on other kinds of websites. Though blogger recipe websites have somewhat encouraged this behavior due to their insane amount of ads & life stories they're well known for.


A very high percentage of ads on sites now use the Google ad network. So it may actually be in their interest to promote sites with more ads. They certainly promote a lot of garbage new sites on their Google news feed app and the product that they include on the pixel phones.


Go look at the horrible chumbox infested AMP sites that Google promotes on Google Assistant. You can tell where their priorities lie.

you.com has all the bangs that DDG has too now.

It is especially frustrating to use when Google while coding. I've noticed an increase in SEO sites on Google that seem to just scrape Q&As from the internet and regurgitate them[0]. I've recently started trying other search engines like DuckDuckGo and You.com and thankfully haven't had any issues with these sorts of sites popping up as results. It makes debugging 10x faster not having to sift through so many fake answer sites.

[0] https://quick-adviser.com/how-do-i-use-google-calendar-in-dj...


Some of these are so bad it's hilarious. My favorite so far is https://quick-adviser.com/how-do-i-turn-my-android-into-a-be...

> Is this page any better than what you’d have gotten in 2010? What if it looked like this instead?

I can't take seriously an example that still puts w3schools as the first search result. If I were searching for a simple answer about a language feature, I would want the search engine to give me a page from the definitive authoritative source on that language. w3schools isn't that.


Web dev is my bread and butter for many years, and for realistic simple and functional samples (which work beyond the latest builds, since I try to accomodate all 100% of my visitors) no one even comes close to w3schools.

I wish them many years of success and prosperity.


One Google feature that I miss terribly is hard filtering.

Used to be if you included "term" or -"term" you'd only get results that did/n't include those terms. But it seems Google has gone all in on the "I don't think you really meant that" approach [], and the hard filters have become suggestions at best.

--

[] Ok, I know it's probably because they're switching more and more to semantic search and ML, but they could retain the hard filters on top.


Similar to the date filtering, you used to be able to see results between two dates, this doesn't seem to work at all anymore.

As an example, if I search for 'gilbert gottfried' and set the date filter 1 apr -> 3 apr, I still see stories about his death.


Do you have any examples of searches that don't respect hard filters? The only time I've seen it ignoring quotes is when there are 0 results and it puts up a little notice at the top "No results found for "…". Results for … (without quotes):"

Next time that happens, open up the page source and search again, and you will likely find the term amongst a huge huge list of SEO terms.

It's extremely frustrating how much "content" is displayed on the page versus how much is hidden in the page source.


100% agree. This used to be my secret weapon for finding things. -"term" along with "other terms you were looking for".

>[] Ok, I know it's probably because they're switching more and more to semantic search and ML, but they could retain the hard filters on top

The issue is that "power users" that are even aware of quotes and and hard filtering are now the long tail that google is no longer optimizing for. They'd much rather focus on the 99% of searches by, for lack of a better term, normies, and as a consequence, rather than expecting users to learn to think, their search features and performance are regressing toward a totally dumbed down mean. And I think society is worse for it.


Applications are open for YC Summer 2022

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: