Berkeley CSUA MOTD:Entry 38465
Berkeley CSUA MOTD
 
WIKI | FAQ | Tech FAQ
http://csua.com/feed/
2025/04/03 [General] UID:1000 Activity:popular
4/3     

2005/7/7-9 [Computer/SW/Languages/C_Cplusplus, Computer/Theory] UID:38465 Activity:nil
7/7     I never took AI but I'd like to learn something about Bayesian
        belief, inference, etc. Something like a 10 page intro PDF or
        a URL would be useful. Recommendation?
        \_ http://www.cs.ubc.ca/~murphyk/Bayes/bayes.html
           Kevin is a former Cal grad student, btw.  Very smart guy. -- ilyas
                 \- how do you know kevin?
                    http://home.lbl.gov:8080/~psb/ASSOCIATES/Mike+Kevin81.jpg
                    He actually fits into this story:
                      *Boredcast Message from 'psb': Mon Aug  4 15:22:19 2003
                      ||let's see: A is hosting B for a few days ... B
                      ||who used to date C who is marrying D who was
                      ||interested in A and was told by B to go out with
                      ||A. D was the roomate of the office mate of E who
                      ||used to date X and later dated F who got
                      ||together with B.
                      ||A = psb
                      ||X = rob pike
                    I note in passing B also works on graphical models and
                    was formerly trafficking with various UCLA and Harvard
                    and Berkeley Bayesians.
                    \_ I worked on Kevin's toolbox a long long time ago.
                       We took some classes together also.  Kevin interviewed
                       at UCLA for a faculty position, but didn't take it.
                         -- ilyas
           \_ spasibo ilya! That is exactly what I was looking for. You are
              a very cool guy. Why do people make fun of you?
              \_ Probably because I am considered the motd retard. -- ilyas
2025/04/03 [General] UID:1000 Activity:popular
4/3     

You may also be interested in these entries...
2014/1/14-2/5 [Computer/SW/Languages/C_Cplusplus] UID:54763 Activity:nil
1/14    Why is NULL defined to be "0" in C++ instead of "((void *) 0)" like in
        C?  I have some overloaded functtions where one takes an integer
        parameter and the other a pointer parameter.  When I call it with
        "NULL", the compiler matches it with the integer version instead of
        the pointer version which is a problem.  Other funny effect is that
        sizeof(NULL) is different from sizeof(myPtr).  Thanks.
	...
2013/4/9-5/18 [Computer/SW/Languages/C_Cplusplus, Computer/SW/Apps, Computer/SW/Languages/Perl] UID:54650 Activity:nil
4/04    Is there a good way to diff 2 files that consist of columns of
        floating point numbers, such that it only tells me if there's a
        difference if the numbers on a given line differ by at least a given
        ratio?  Say, 1%?
        \_ Use Excel.
           1. Open foo.txt in Excel.  It should convert all numbers to cells in
	...
2013/4/29-5/18 [Computer/SW/Languages/C_Cplusplus, Computer/SW/Compilers] UID:54665 Activity:nil
4/29    Why were C and Java designed to require "break;" statements for a
        "case" section to terminate rather than falling-through to the next
        section?  99% of the time poeple want a "case" section to terminate.
        In fact some compilers issue warning if there is no "break;" statement
        in a "case" section.  Why not just design the languages to have
        termination as the default behavior, and provide a "fallthru;"
	...
2012/7/19-11/7 [Computer/SW/Languages/C_Cplusplus] UID:54439 Activity:nil
7/19    In C or C++, how do I write the code of a function with variable
        number of parameters in order to pass the variable parameters to
        another function that also has variable number of parameters?  Thanks.
        \_ The usual way (works on gcc 3.0+, Visual Studio 2005+):
               #define foo(fmt, ...) printf(fmt, ##__VA_ARGS__)
           The cool new way (works on gcc 4.3+):
	...
2011/3/7-4/20 [Computer/SW/Languages/C_Cplusplus] UID:54056 Activity:nil
3/7     I have a C question.  I have the following source code in two identical
        files t.c and t.cpp:
                #include <stdlib.h>
                int main(int argc, char *argv[]) {
                  const char * const * p1;
                  const char * * p2;
	...
2011/2/5-19 [Computer/SW/Languages/C_Cplusplus] UID:54027 Activity:nil
2/4     random C programming/linker fu question.  If I have
        int main() { printf("%s is at this adddr %p\n", "strlen", strlen); }
        and soda's /proc/sys/kernel/randomize_va_space is 2 (eg; on)
        why is strlen (or any other libc fn) at the same address every time?
        \_ I don't pretend to actually know the right answer to this, but
           could it have something to do with shared libraries?
	...
2010/2/12-3/9 [Computer/SW/Languages/C_Cplusplus] UID:53708 Activity:nil
2/12    I need a way to make a really big C++ executable (~200MBs) that does
        nothing.  No static initialization either.  Any ideas?
        \_ static link in lots of libraries?
        \_ #define a   i=0; i=0; i=0; i=0; i=0; i=0; i=0; i=0; i=0; i=0;
           #define b   a a a a a a a a a a
           #define c   b b b b b b b b b b
	...
2009/9/28-10/8 [Computer/SW/Languages/C_Cplusplus] UID:53409 Activity:nil
9/28    http://www.tiobe.com/index.php/content/paperinfo/tpci/index.html
        Java is #1!!! Followed by C, PHP, C++, Visual Basic, Perl,
        C#, Python, Javascript, then finally Ruby. The good news is
        Pascal is going waaaay back up!
        \_ C is still more popular than C++?  I feel much better about myself
           now.
	...
2009/8/7-14 [Computer/SW/Languages/C_Cplusplus, Computer/SW/Languages/Java] UID:53252 Activity:high
8/6     In C one can do "typedef int my_index_t;".  What's the equivalent in
        C#?  Thanks.
        \_ C#? Are you serious? Is this what the class of 2009 learn?
           \_ No.  I have to learn .NET code at work.  I am Class of '93.
           \_ python is what 2009 learns, see the motd thread about recent
              cal courses and languages
	...
2009/7/21-24 [Computer/SW/Languages/Java] UID:53168 Activity:moderate
7/20    For those who care btw, it looks like eclipse is now A Standard Tool
        at UCB ugrad cs, probably replaced emacs.  Furthermore, people get
        angry at seeing Makefiles, (since eclispe takes care of that).  I
        guess it's just a sign of the times.
        \_ The more people at my work use eclipse the less the code is
           managable in emacs.  I'm not sure which application's fault
	...
2009/9/23-10/5 [Computer/SW/Database] UID:53392 Activity:nil
9/23    I never took CS188, is there a good book that's an intro to formal
        database theory, normalization, etc.?  I've got experience with SQL
        (MySQL & MSSQL), and understand tables, etc.
        \_ You mean CS186?
           \_ Oops, yah.  188 is AI or something?
              \_ That's right.
	...
2004/4/6-7 [Computer/Companies/Google, Computer/Theory] UID:13045 Activity:nil
4/6     Looking for introductory material on SVM, C4., Ripper, and Naive
        Bayes. Most of the junk on google are too deep for comprehension.
        I just need introductory material. ok thx
        \_ http://paulgraham.com / google "A Plan For Spam"
        \_ where did you hear about all of these terms? did you learn about
           them with regards to spam filtering? naive bayes is a specific
	...
2002/11/7-8 [Computer/Theory] UID:26462 Activity:kinda low
11/7    Anyone have a good intro (i'm in math54 atm) to bayesian analysis
        that a college sophomore could understand? i read that the new
        spamassassin uses it (like ifile) but cant find a good intro text on
        it other than:
        http://www.paulgraham.com/spam.html
        (which was actually pretty good)
	...
2001/11/8-9 [Computer/Theory] UID:22971 Activity:very high
11/7    I hate math 55. If CS is an extension of math 55, I'm gonna drop out.
        \_ if you don't like math 55 you are better off doing something else
           with your life.  You will not be happy as a programmer.
           \_ I don't think math55 is a good measure of what a career in cs
              is like.
              \_ career in CS or a career in programming?  they are not the
	...
2001/6/17-19 [Computer/Theory] UID:21552 Activity:high 86%like:21549
6/17    [REPOST] Anyone know of a good introduction to the Gell-Mann
        8 Fold Way that can be understood by a engineering grad (7
        series plus one ud phyics class and one ud math class)?
        \_ forget it.
        \_ forget it. There's so much physics out there that is accesible
           to someone with the background you describe, that will actually
	...
2000/10/20 [Computer/Theory] UID:19531 Activity:very high
10/20   Except for Graphics, it seems like high level math doesn't seem to
        be needed in computer science.
        \_ heh heh.  heh heh heh.  heh.
           \_ yes.
        \_ except for graphics and for anything that involves fucking
           modeling. -ali
	...
2000/8/14-15 [Computer/Theory] UID:18977 Activity:high
8/13    Is there such a thing as an algorithm that can output another
        algorithm? Kind of like self tuning, self evolving algorithm?
        \_ Yes. Not as glamorous as it sounds; see also: genetic algorithms,
           Remez algorithms, data-directed programming
        \_ bison, yacc, (f)lex, and many many more. not self tuning though.
        \_ There are algorithmns that can learn from data.  It's really not
	...
2000/5/9-10 [Computer/SW/Editors/Vi] UID:18218 Activity:nil
5/9     Try this: In vi, type in a paragraph that has its first line
        tab-indented. Then, go to the top of the paragraph, and !}fmt
        (Pipe the paragraph through fmt). It'll tab indent the first
        TWO lines of the paragraph. Here's where it gets wierd: Go
        down and delete the second tab. Go back up and !}fmt again.
        This time it doesn't add the second indent, it does what I
	...
2000/5/3-4 [Computer/Theory] UID:18162 Activity:very high
4/32    Stat 200a vs Stat 101 vs Stat 134 -- any comments? Experience with
        any would be appreciated too. (Assuming that purpose is applications
        in an arbitrary area of theoretical CS)
        \-sigh, i guess i will wade in: if you let me know what topics you
        \_ Can you describe the subject matter please?  I am too lazy to
           look it up.
	...
Cache (8192 bytes)
www.cs.ubc.ca/~murphyk/Bayes/bayes.html
A Brief Introduction to Graphical Models and Bayesian Networks By Kevin Murphy, 1998. "Graphical models are a marriage between probability theory and graph the ory. They provide a natural tool for dealing with two problems that occu r throughout applied mathematics and engineering -- uncertainty and comp lexity -- and in particular they are playing an increasingly important r ole in the design and analysis of machine learning algorithms. Fundament al to the idea of a graphical model is the notion of modularity -- a com plex system is built by combining simpler parts. Probability theory prov ides the glue whereby the parts are combined, ensuring that the system a s a whole is consistent, and providing ways to interface models to data. The graph theoretic side of graphical models provides both an intuitive ly appealing interface by which humans can model highly-interacting sets of variables as well as a data structure that lends itself naturally to the design of efficient general-purpose algorithms. Many of the classical multivariate probabalistic systems studied in field s such as statistics, systems engineering, information theory, pattern r ecognition and statistical mechanics are special cases of the general gr aphical model formalism -- examples include mixture models, factor analy sis, hidden Markov models, Kalman filters and Ising models. The graphica l model framework provides a way to view all of these systems as instanc es of a common underlying formalism. This view has many advantages -- in particular, specialized techniques that have been developed in one fiel d can be transferred between research communities and exploited more wid ely. Moreover, the graphical model formalism provides a natural framewor k for the design of new systems." New York Times article (4/28/01) about Bayesian statistics. Note: Despite the name, Bayesian networks do not necessarily imply a comm itment to Bayesian statistics. List of other Bayes net tutorials Representation Probabilistic graphical models are graphs in which nodes represent random variables, and the (lack of) arcs represent conditional independence assumptions. Hence they provide a compact representation of joint probability distributions. Undirected graphical models, also called Markov Random Fields(MRFs) or Markov networks, have a simple definition of independence: two (sets of) nodes A and B are conditionally independent given a third set, C, if allpaths between the nodes in A and B are separated by a node in C By contrast, directed graphical models also called Bayesian Networks or Belief Networks (BNs), have a more complicated notion of independence, which takes into account the directionality of the arcs, as we explain below. In addition, directed models can en code deterministic relationships, and are easier to learn (fit to data). In the rest of this tutorial, we will only discuss directed graphical m odels, ie, Bayesian networks. In addition to the graph structure, it is necessary to specify the parame ters of the model. For a directed model, we must specify the Conditional Probability Distribution (CPD) at each node. If the variables are discr ete, this can be represented as a table (CPT), which lists the probabili ty that the child node takes on each of its different values for each co mbination of values of its parents. Consider the following example, in w hich all nodes are binary, ie, have two possible values, which we will denote by T (true) and F (false). We see that the event "grass is wet" (W=true) has two possible causes: ei ther the water sprinker is on (S=true) or it is raining (R=true). The st rength of this relationship is shown in the table. For example, we see t hat Pr(W=true | S=true, R=false) = 09 (second row), and hence, Pr(W=fal se | S=true, R=false) = 1 - 09 = 01, since each row must sum to one. S ince the C node has no parents, its CPT specifies the prior probability that it is cloudy (in this case, 05). The simplest conditional independence relationship encoded in a Bayesian network can be stated as follows: a node is independent of its ancestors given its parents, where the ancestor/parent relationship is with respe ct to some fixed topological ordering of the nodes. By the chain rule of probability, the joint probability of all the nodes in the graph above is P(C, S, R, W) = P * P(S|C) * P(R|C,S) * P(W|C,S,R) By using conditional independence relationships, we can rewrite this as P(C, S, R, W) = P * P(S|C) * P(R|C) * P(W|S,R) where we were allowed to simplify the third term because R is independent of S given its parent C, and the last term because W is independent of C given its parents S and R We can see that the conditional independence relationships allow us to re present the joint more compactly. Here the savings are minimal, but in g eneral, if we had n binary nodes, the full joint would require O(2^n) sp ace to represent, but the factored form would require O(n 2^k) space to represent, where k is the maximum fan-in of a node. Inference The most common task we wish to solve using Bayesian networks is probabil istic inference. For example, consider the water sprinkler network, and suppose we observe the fact that the grass is wet. There are two possibl e causes for this: either it is raining, or the sprinkler is on. We can use Bayes' rule to compute the posterior probabili ty of each explanation (where 0==false and 1==true). is a normalizing constant, equal to the probability (likelihood) of the d ata. Explaining away In the above example, notice that the two causes "compete" to "explain" the observed data. Hence S and R become conditionally dependent given that their common child, W, is observed, even though they are marginally independent. For example, suppose the grass is wet, but that we also know that itis raining. Then the posterior probability that the sprinkler is on goes down: Pr(S=1|W=1,R=1) = 01945 This is called "explaining away". In statistics, this is known as Berkson's paradox, or "selection bias". Let C denote the event that someone is admitted to college, which is made true if they are either brainy or sporty . Suppose in the general population, Band S are independent. We can model our conditional independence assumptions using a graph which is a V structure, with arrows pointing down: B S \ / v C Now look at a population of college students (those for which C is observ ed to be true). It will be found that being brainy makes you less likely to be sporty and vice versa, because either property alone is sufficien t to explain the evidence on C (ie, P(S=1 | C=1, B=1) <= P(S=1 | C=1)) . This is called diagnostic, or "bott om up", reasoning, since it goes from effects to causes; Bayes nets can also be used for causal, or "top down", reasoning. For example, we can compute the probability that the g rass will be wet given that it is cloudy. Hence Bayes nets are often cal led "generative" models, because they specify how causes generate effect s Causality One of the most exciting things about Bayes nets is that they can beused to put discussions about causality on a solid mathematical basis. One very interesting question is: can we distinguish causation from mere correlation? The answer is "sometimes", but you need to measure the relationships between at least three variables; the intution is that one of the variables acts as a"virtual control" for the relationship between the other two, so we don't always need to do experiments to infer causality. "Cause and Correlation in Biology", Bill Shipley, 2000, Cambridge University Press. Conditional independence in Bayes Nets In general, the conditional independence relationships encoded by a Bayes Net are best be explained by means of the "Bayes Ball" algorithm (due t o Ross Shachter), which is as follows: Two (sets of) nodes A and B are c onditionally independent (d-separated) given a set C if and only if ther e is no way for a ball to get from A to B in the graph, where the allowa ble movements of the ball are shown below. Hidden nodes are nodes whose values are not known, and are depicted as unshaded; The most interesting case is the first column, when we have two arrows co nverging on a nod...