Berkeley CSUA MOTD:Entry 15036
Berkeley CSUA MOTD
 
WIKI | FAQ | Tech FAQ
http://csua.com/feed/
2025/04/03 [General] UID:1000 Activity:popular
4/3     

1998/11/28-30 [Computer/HW/Memory] UID:15036 Activity:high
11/28   Let's talk about virtual memory:
\__ if you'r edealing with virtual memory which
   far exceeds physical memory, you've already lost.
        \_ if you'r edealing with virtual memory which far exceeds
        \_ First of all, the guy is talking about
           number crunching, not image processing.
           It is likely that he's going to be
           addressing all of the memory he's
           crunching with.  And second, no one said
           it was impossible--it just is painfully
           slow.  disk is like 6 orders of magnitude
           slower than RAM.
           \_ uhh. "number crunching" applications
              usually exhibit greater locality
              than almost any other app, if
              optimized properly. -nick
                \_ which will help not at all if
                   it's using more than physical RAM.
                   \_ What part of "locality" don't
                      you understand, twink?  Nick knows
                      what he's talking about.
                   \_ Whether you can get away with crunching >> than RAM
                      depends strongly on the precise task.  A blocked
                      matrix multiply might perform well, especially if
                      you somehow anticipate the data needed next and
                      stream it into RAM beforehand so you don't have to
                      eat the large latency while it is paged in.....
                      Having a RAID array (expensive) also helps.
                      A non-sparse matrix-vector multiply, however, requires
                      1 memory reference for every 2 flops (not counting the
                      memory reference for the vector element).
                      memory reference for the vector element or result).
                      Assuming .5 flop per tick on a 400MHz P-II, we'd need
                      floats from the matrix at 100MHz, or 400MB/sec.
                      SDRAM might sustain that, but if the matrix were
                      SDRAM might sustain that, but if the matrix were much
                      larger than memory....  Performance would drop 10x
                      at least.  --PeterM
                      \_ I don't think ILP is heavily influenced by how
                         much a process' virtual memory compares to the
                         physical memory.  Virtual memory pages are usually
                         on the order of 64kb.  Compare that to, say, a Cray
                         vector register file which is a 32x32 64bit matrix.
                         That's 8kb and it takes several clock cycles for
                         a number crunching program to process the data in
                         the 64kb page anyway.  But this guy is talking about
                         a Pentium and a K6 to do number crunching.  I don't
                         think he's going to benefit from that kind of ILP
                         and even if he did he would still benefit from
                         the spatial and temporal locality of the program.
           physical memory, you've already lost.
           (\_ This has pretty much been my experience--PeterM )
               \_ First of all, the guy is talking about number
                  crunching, not image processing.  It is likely that
                  he's going to be addressing all of the memory he's
                  crunching with.  And second, no one said it was
                  impossible--it just is painfully slow.  disk is like 6
                  orders of magnitude slower than RAM.
                  \_ uhh. "number crunching" applications usually
                     exhibit greater locality than almost any other app,
                     if optimized properly. -nick
                     \_ which will help not at all if it's using more
                        than physical RAM.
                        \_ What part of "locality" don't you understand,
                           twink?  Nick knows what he's talking about.
                           \_ Whether you can get away with crunching >>
                              than RAM depends strongly on the precise
                              task.  A blocked matrix multiply might
                              perform well, especially if you somehow
                              anticipate the data needed next and stream
                              it into RAM beforehand so you don't have
                              to eat the large latency while it is paged
                              in.....  Having a RAID array (expensive)
                              also helps.  A non-sparse matrix-vector
                              multiply, however, requires 1 memory
                              reference for every 2 flops (not counting
                              the memory reference for the vector
                              element or result).  Assuming .5 flop per
                              tick on a 400MHz P-II, we'd need floats
                              from the matrix at 100MHz, or 400MB/sec.
                              SDRAM might sustain that, but if the
                              matrix were much larger than memory....
                              Performance would drop 10x at least.
                              --PeterM
                              \_ I don't think ILP is heavily influenced by
                                \_ ILP ==> "Instruction Level Parallelism"
                                   I don't understand why you mention it
                                   here.  --PeterM
                                   \_ You mentioned vector ops which is a
                                      form of instruction level parallelism.
                              how much a process' virtual memory
                              compares to the physical memory.  Virtual
                              memory pages are usually on the order of
                              64kb.  Compare that to, say, a Cray vector
                              register file which is a 32x32 64bit
                              matrix.  That's 8kb and it takes several
                              clock cycles for a number crunching
                              program to process the data in the 64kb
                              page anyway.  But this guy is talking
                              about a Pentium and a K6 to do number
                              crunching.  I don't think he's going to
                              benefit from that kind of ILP and even if
                              he did he would still benefit from the
                              spatial and temporal locality of the
                              program.
2025/04/03 [General] UID:1000 Activity:popular
4/3     

You may also be interested in these entries...
2012/12/18-2013/1/24 [Computer/SW/Languages/Perl] UID:54561 Activity:nil
12/18   Happy 25th birthday Perl, and FUCK YOU Larry Wall for fucking up
        the computer science formalism that sets back compilers development
        back for at least a decade:
        http://techcrunch.com/2012/12/18/print-happy-25th-birthday-perl
        \_ I tried to learn Perl but was scared away by it.  Maybe scripting
           lanauages have to be like that in order to work well?
	...
2012/2/5-3/26 [Computer/SW/WWW/Browsers] UID:54300 Activity:nil
2/5     How is Firefox on version 10, while I still have 3.6 installed.
        I wait for the X.1 versions and they never come out.
        \_ I'm also on 3.6.26.  It claims that versions 4 - 10 are all faster
           than 3.6.x, but do they use more memory?  Thx.
           \_ Newer Firefox versions use less memory too:
              http://www.maximumpc.com/article/news/mozillas_memshrink_program_brings_big_memory_savings_firefox_7
	...
2011/6/5-8/27 [Computer/HW/Memory] UID:54127 Activity:nil
6/5     In an effort to stabilize our services, we'll be rebuilding parts of
        the CSUA infrastructure over the course of this summer.  To give us
        some wiggle room, I've temporarily decreased soda's allocated RAM from
        8GB to 2GB.  If you need to run something that requires large amounts
        of memory, please send mail to root@csua.org and we'll try to
        accommodate your request.  --jordan
	...
2011/3/26-4/20 [Computer/HW/Memory] UID:54062 Activity:nil
3/19    When you're explaining the stack to people do you draw it with the
        highest addresses at the top or at the bottom?
        \_ When I explain any memory layout including stacks, I draw with the
           highest addresses at the bottom.  But I've seen people doing the
           other way.  -- yuen
           \_ do you by any chance have seen or have a jpg of the full memory
	...
2010/7/25-8/25 [Computer/SW/Graphics] UID:53897 Activity:nil
7/25    What's up with that moving bit pattern that Win7 displays when it
        boots up?  (It's the one that's like in the Apple II days when you
        use the graphics memory for code or data while it's still in HGR
        mode.)  Is there a way to disable that?  It slows things down a lot
        every time I reboot my Win7 VM on my office machine if I RDP to the
        machine via DSL.  Thanks in advance.
	...
2010/5/26-6/30 [Computer/SW/Unix/WindowManager, Computer/SW/OS/OsX] UID:53844 Activity:nil
5/26    anyone use lxde?  supposedly it is less stupid than xfce and
        less bloated than gnome.  thoughts?
        \_ lol, does anyone still use desktop linux?  Get with the times
           buy a mac.  Now.  DO IT.  Go NOW.
           \_ but we prefer herring to Kool-Aid
              \_ "you have to yell, he's hard of herring"
	...
2010/6/4-30 [Computer/SW/Languages/C_Cplusplus] UID:53849 Activity:nil
6/4     Is this valid C++ code?
        std::string getStr(void) {
            std::string str("foo");
            return str;
        }
        void foo(char *s);
	...
2009/12/7-2010/1/3 [Computer/HW/Memory, Computer/HW] UID:53574 Activity:nil
12/7    How many TCP retransmits are too many? Here is what I get:
            3594143433 segments received
            3760174421 segments send out
            3801829561 segments retransmited
        \_ rephrase. you can never have too much money. or too little.
           what is, is.
	...
2009/10/27-11/3 [Computer/HW/Drives] UID:53474 Activity:nil
10/27   I just read an article that Facebook had moved their database
        to all SSD to speed throughput, but now I can't find it. Has
        anyone else seen this? Any experience with doing this? -ausman
        \_ I hope you're not running mission critical data:
           http://ask.slashdot.org/story/09/10/27/1559248/Reliability-of-PC-Flash-SSDs?from=rss
        \_ Do you have any idea how much storage space is used by Facebook,
	...
2009/7/21-24 [Computer/SW/Languages/Java] UID:53168 Activity:moderate
7/20    For those who care btw, it looks like eclipse is now A Standard Tool
        at UCB ugrad cs, probably replaced emacs.  Furthermore, people get
        angry at seeing Makefiles, (since eclispe takes care of that).  I
        guess it's just a sign of the times.
        \_ The more people at my work use eclipse the less the code is
           managable in emacs.  I'm not sure which application's fault
	...