Berkeley CSUA MOTD:Entry 16692
Berkeley CSUA MOTD
 
WIKI | FAQ | Tech FAQ
http://csua.com/feed/
2025/05/24 [General] UID:1000 Activity:popular
5/24    

1999/10/11-13 [Computer/SW/Unix] UID:16692 Activity:high
10/11   Does anyone know the secret algorithm behind http://www.google.com It is
        quite good. I'm very impressed. I want to know HOW they did it.
        \_ a bunch of shell scripts -- awk, sed, grep, and pipe.
        \_ My understanding is it's a popularity based results engine.  The
           more people who choose a particular result for a particular query
           the  higher that result will be displayed for similar future
           queries.  I don't work at google but we did a similar thing at the
           search engine company I did work for.
           \-maybe PCI. there are a lot of ways to do this kind of thing.
            there are linear algebra approaches and statistical/baysian
            approaches, depending on problem size and nature of prob. --psb
           \_ I've read a couple of articles that have mentioned it, which
              said that they base their scoring on how many other pages
              link to a particular page, rather than number of times it's
              chosen on their site.  http://www.google.com/why_use.html
              seems to support this.  -niloc
        \_ number of links to a page determine importance. they sort according
           to relevance and important. i know what i'm talking about. -ali
            \_ relevance is only determined by what people actually choose
              out of the links returned from the search. the second factor
              is accuracy, which is the "drift" from relevance. accuracy is
              the perennial problem, since almost all search engines start
              suffering around 5 to 10% of the first links offered.
              \_ Ali is correct.  The stanford prof of the grad students who
                 developed google comercially came to Soda a few weeks ago
                 and said exactly that - it's click throughs and links to
                 that determine ranking -jones
            \_ It has to be more complex than this or Yahoo would show
               up as the #1 hit for every query.  #2 would be Microsoft.
               They *must* take into account the query itself (seems
               obvious, no?) in some way before doing a most-linked sort
               on the results.  So, no, I don't think you know what you're
               talking about.  Are there any CSUA'ers on their architecture
               design, engineering or database staff?  If so, please come
               forward.  Ali having had coffee with someone's secretary at
                                        \_ was this after the mindblasting sex?
               google doesn't impress.
               \_ Eat your words, blasphemer!  The only person I trust more
                  than ali is bh.
        \_ I think they use a variant of the clusterfuck algorithm.
        \_ Algorithm, Heuristic, BAH! They don't interest me and are trolls.
           \_ You are a faggoty bitch.
2025/05/24 [General] UID:1000 Activity:popular
5/24    

You may also be interested in these entries...
2012/8/30-11/7 [Computer/SW/Apps, Computer/SW/Unix] UID:54470 Activity:nil
8/30    Is wall just dead? The wallall command dies for me, muttering
        something about /var/wall/ttys not existing.
        \_ its seen a great drop in usage, though it seems mostly functional.
            -ERic
        \_ Couldn't open wall log!: Bad file descriptor
           Could not open wall subscription directory /var/wall/ttys: No such file or directory
	...
2012/9/20-11/7 [Computer/SW/Unix, Finance/Investment] UID:54482 Activity:nil
9/20    How do I change my shell? chsh says "Cannot change ID to root."
        \_ /usr/bin/chsh does not have the SUID permission set. Without
           being set, it does not successfully change a user's shell.
           Typical newbie sys admin (on soda)
           \_ Actually, it does: -rwsr-xr-x 1 root root 37552 Feb 15  2011 /usr/bin/chsh
	...
2012/9/24-11/7 [Computer/SW/Languages, Computer/SW/Unix] UID:54484 Activity:nil
9/24    How come changing my shell using ldapmodify (chsh doesn't work) doesn't
        work either? ldapsearch and getent show the new shell but I still get
        the old shell on login.
        \_ Scratch that, it magically took my new shell now. WTF?
           \_ probably nscd(8)
	...
2012/4/27-6/4 [Computer/SW/Languages/Misc, Computer/SW/Unix] UID:54372 Activity:nil
4/27    I wrote a little shell script to collect iostat data:
        #!/bin/bash
        DATE=`date +%m%d`
        DATADIR=/var/tmp/user
        OUTPUTFILE=$DATADIR/$DATE.out
        while true
	...
2011/11/20-2012/2/6 [Computer/Companies/Apple, Computer/SW/Unix] UID:54237 Activity:nil
11/20   Are there tools that can justify a chunk of plain ASCII text by
        replacing words with words of similar meaning and inserting/removing
        commas into the text?  I received a 40-line plain text mail where
        all the lines are justified on left and right.  Every word and comma
        is followed by only one space, and every period is followed by two
        spaces.  The guy is my kid's karate instructor which I don't think is
	...
2011/10/26-12/6 [Computer/SW/Unix] UID:54202 Activity:nil
10/24  What's an easy way to see if say column 3 of a file matches a list of
       expressions in a file? Basically I want to combine "grep -f <file>"
       to store the patterns and awk's $3 ~ /(AAA|BBB|CCC)/ ... I realize
       I can do this with "egrep -f " and use regexp instead of strings, but
       was wondering if there was some magic way to do this.
       \_ UNIX has no magic. Make a shell script to produce the ask or egrep
	...
Cache (199 bytes)
www.google.com -> www.google.com/
Web Images Groups News Froogle^ New! Google Search I'm Feeling Lucky Advanced Search Preferences Language Tools Advertising Programs - Business Solutions - About Google Graduating? Come work with us.