DISQUS

Scobleizer: The race to time-based and blog search

  • Kenneth Chu · 4 years ago
    Thats so true. When u search for something on technorati, it doesn't reflect the latest thing. Technorati seems to be suffering from information overload and does not give very good search result. By the way, the new template is so much better.
  • Farooq · 4 years ago
    actually i liked your radio blog template better...it forced you to focus on the text and it made for an easy reading directly from the site...

    plus mudpit opened up in a new window...so i didn't have to click once more to see all your entries...

    hmmm...anyway your choice...we'll have to keep up :)

    btw Scoble...when are you putting up your Xbox 360 video online...the one that talks about its web-intergation features...you still under NDA? I'm hoping there's atleast one more big announcement before the launch cause I hear that Sony's going to debut the Playstation Online service to parry the Xbox 360 launch...
  • Robert Scoble · 4 years ago
    October 18. Yeah, this template needs a lot of work.
  • Farooq · 4 years ago
    a couple of more comments:

    - you're dead on about the Apple thing...Steve Jobs has said periodically that Media Centers and video ipods make no sense...and well both have been set up by MS/partners before...

    - Xbox 360 is about to launch...I think Microsoft needs to partner with someone for a handheld gaming system...you guys have great IP in this area...BUT don't go out to make your own handheld...partner with someone who's already got something out there...i think it's important cause that's the only thing missing from the MS portfolio in the gaming market...

    and I wouldn't have written this if I didn't haev a partner in mind :)

    N-Gage! it has not done so well (thanks to shoddy design) but the concept is just plain cool...a good integrated device and you could tout its connectivity features with Xbox 360...heck you can call it the Xage (pronounced: Zage)...
  • Danny Sullivan · 4 years ago
    "But, before I dive into the state of time-based search today, let’s look at Yahoo, Google, and MSN first so you can see just how bad those three are if you want to find something that was added to the Web yesterday."

    Let's qualify. You mean how bad they are if you only look at the web search results and ignore the onebox/shortcut displays they have.

    In other words, do [video ipod] on Google or Yahoo, and at the top of the pages, they show you plenty of news results. They aren't behind in gathering fresh data. They're simply segregating it into the news area and giving you a heads-up that it is there.

    You're either missing it or ignoring it because those top of the page segments don't feel "normal" to you. All I can say is that the search engines are aware of that issue.

    If you look at my article it talks about how at some point, the search engines need to automatically push the right button or tab or link for you, to give you 10 news results for queries that obviously are news related. Or you do a shopping search and you get all shopping results automatically.

    The problem is the search engines are frightened about making such a change. If they get it wrong, they may lose people. So they are slowly letting vertical listings creep in this way.

    Remember, web search is NOT a time based activity. Honestly. Think about it. The last time you did a web search for something new, you weren't looking for the best overall site on the subject, were you? No, you wanted the latest, timely informaiton. You wanted news. They give you excellent news through news search engines. And Yahoo, among the majors, as you know just started incorporating blogs as a news source, as well.

    Overall, Robert, I think the posts you are doing on search are great in raising the issues out there and helping push for further UI changes that need to happen. But I think it would also help to point out some of the features that do exactly what you want, when they exist. IE -- everyone, you want timely info?
    news.google.com, news.yahoo.com are great places to go.

    As for your blog search problem, yeah, I know that well. It's why I don't depend on blog search much. I get get timely, but I also get all the crud. PubSub tries to solve this by picking the most authorative blogs, but I haven't found that's really solved the problem much.

    Ultimately, it will probably come down to blog search further refining this, letting you search by default against a set of hand selected or some other method filtered blogs, to cut out all the spam -- and you can go further across all the blogs if you want. But when there are simply so many blogs out there, a good chunk of them splogs and so on, you've got to have some filtering. THAT's why news search works so well, because the vertical sites allowed in there are reviewed.
  • Jeff Schiller · 4 years ago
    Danny's comment has an open element with no closing , it's screwing up your comments.

    Time/Blog-based searching does currently suck, but it will get better. I agree that people need to get these results from the main search page, not from a separate page (and I assume once BlogSearch makes its way out of Beta that it will be integrated somehow into google's main search). I think it would be good to see time-based entries in a separate column next to regular search entries. If you look at google, their regular search has the heading "Web" and their Blog Search has the heading "Blog Search". They should combine these on one page (in two columns) but they would likely have to shrink their "sponsored links" div. Each column could be leafed through its pages without affecting the other column (ie use some Ajax).

    I think a relevancy figure should be determinable from a heuristic that combines text relevancy with recency with # of inbounds with # of comments/trackbacks (for that entry only), but of course that's open to spam attacks.
  • Jeff Schiller · 4 years ago
    Ugh, that was supposed to be " element without a closing element"...c'mon WordPress, time for a "Preview Comment" option!
  • John Furrier · 4 years ago
    Everyone knows the web 1.0 algorithmic search doesn't work in the real time environment. Historical stuff - great and awesome but real time stuff I agree with you Scob 100%. It's going to be an algorithm that no one sees coming and I think it will be social based .... keep your eye out for some new stuff...
  • Leo F · 4 years ago
    I still think we should not divide search engines by content type unless it is clearly divided. Yeah, blogs are a type of content, but there's a fine line between what is a blog and what is a "site". I think we should just search from the standard Google website, and let the search engine figure out what's what. Don't put that burden on the user. I want to be able to get for example the most relevant posts about the new Xbox.... are blogs more relevant ? Or content websites ? Are newer posts better (or more relevant) than older posts ? Relevancy is very hard to measure as you know.

    regards,
    Leo
  • Lorelle VanFossen · 4 years ago
    Part of the problem is that time-based searching is dependent upon the blog post being found in a "timely manner". This means that web crawlers have to be on the alert 24/7 and happen to cross THAT post at THAT moment or soon after its release. Imagine the size and speed of that crawler!

    So that leaves us with two options. Either the burden is upon the user to update tagging and search services, or pings and trackbacks will have to merge and grow into a new form of "I got a secret!".

    I see a form of ping and trackback services sending out an excerpt of the post to search engines and directories at the moment of posting. Immediately, time-based information is delivered, literally, to your door.

    Google, Yahoo, and MSN are not the end-all, but they are the beginning. I see tagging as part of the baby steps of information gathering on the Internet.

    The first to come up with this new form of ping and trackback service, with checks and balances thrown in, will get all my attention, and it should get yours.
  • PierreS · 4 years ago
    Well you should try Atiki.com because it *really* works :)
  • theroxylandr · 4 years ago
    So where is your link to Paris Hilton video?
  • PierreS · 4 years ago
    What ?
  • mary hodder · 4 years ago
    thanks Robert, very kind of you to note my work. i wish someone would fix these problems now too. i think there is a real need to make blogs accessible to people who aren't in the blog community, and who want some comprehensible way (no geekiness allowed) to understand them, and who is doing the writing.

    it would help people quite a bit.

    mary
  • Jeremy Pepper · 4 years ago
    What is it with you and scrapbooking?
  • scobleizer · 4 years ago
    Jeremy: it's just a subculture I'm not that familiar with that's very large. Translation: it is a good one for geeks like me to study and see if we can do better in serving.
  • David Sifry · 4 years ago
    Robert, have you tried our Blog Finder?

    http://www.technorati.com/blogs/scrapbooking

    More to come, thanks for your feedback.

    Dave
  • None · 4 years ago
    Have you seen these search engine indexing speed benchmarks? http://www.mackmo.com/nick/blog/tech/?permalink=Search_Engine_Indexing_Speed.txt
  • Jeremy Pepper · 4 years ago
    Oh, you should have mentioned it in NYC - I love that community, and have worked with it. And, I think it makes too much money to be a subculture. :)

    And, you are right - there's a lot we can learn from scrapbooking about grassroots efforts, word-of-mouth activities, and building lasting communities.
  • Christopher Coulter · 4 years ago
    "The best stuff" is very subjective, the world tis not all geek posers.

    PS - Gawd are these comments impossible to read. For all the blog blahger and overhype, you'd think someone could ever come up with a decent comment system, no blog engines ever seem to work.
  • Martin Sigaard · 4 years ago
    I came to this site via Blogniscient. Being a librarian, I always prefer some sort of classification :)
    Try and see if this is fresh enough for you:
    http://technology.blogniscient.com/main_fs.html
  • Andrew Sears · 4 years ago
    Blogniscient looks interesting.

    Sounds like you are looking for a tail -f INTERNET | grep 'October 16, 2005' command somewhere.

    Findforward has some interesting ways of using the Google api..

    http://www.findforward.com/?q=microsoft&t=chat

    My problem isn't finding stuff on the internet, it's reading it all. Robert, you have way too many blogs going on right now! :) Way too much stuff happening with Microsoft. Also with China... (see here http://travelcostarica.blogspot.com/)

    The name is a bit misleading, but I use this blog for documenting places I am going to travel to. I am going to China this week to see if it works from over there...

    Try searching for China on Gadda.be to get some interesting results of what is happening in the world today...
  • Don Singleton · 4 years ago
    There are a lot of different sites blogs can ping when they post, and if a search engine is not pinged, it cannot include that post in a search.
  • Jana Bischof · 3 years ago
    fitness
  • Jana Bischof · 3 years ago
    high school diploma
    blog
  • webbeleah · 3 years ago
    beautiful online information center. greatest work... thanks
  • oxley · 3 years ago
    Great job guys...
  • Anime9200 · 3 months ago
    I am about to start a blog and your blog gave me much hint how to do it. I really loved to visit your blog. Hope to see more inputs from you in your blog.
    regards
    sears parts