Berkeley CSUA MOTD:Entry 48642
Berkeley CSUA MOTD
 
WIKI | FAQ | Tech FAQ
http://csua.com/feed/
2025/05/24 [General] UID:1000 Activity:popular
5/24    

2007/11/15-17 [Computer/HW/Drives] UID:48642 Activity:high
11/15   So if I have a process running in Linux and kill -9 isn't
        killing it, and killing its parent process didn't kill it,
        and now it's reading as though its parent process is 1,
        is there any way to kill it short of rebooting the machine?
        \_ Sure, but you dont want to be mucking around in kernel data
           strucures.  Most practical way is to reboot the machine.
        \_ Sure, but you dont want to be mucking around in kernel data
           strucures.  Most practical way is to reboot the machine.
           \_ Ah, good ol' Linux reliability. Just out of curiosity,
              what kind of process is this anyway?
              \_ This has nothing to do with "Linux reliability"; it has to
                 do with trying to kill a process that's blocked.  The most
                 common scenario is a disk wait; NFS server goes away or
                 physical I/O error on hard disk hangs up the process.  -tom
              \_ I believe some low range of numbers are reserved for
                 kernel processes only and will not let you kill it with
                 conventional kill method.
                  \_ no, you can kill any process with kill signals from the
                     root user.   However, kill is just a signal, and there
                     are paths (i.e. device wait) where processes are too
                     wedged to process the signal and die nicely.  -ERic
                     \_ The process in question is referencing a SCSI device.
                        Is there a way to kill the device wait? -op
                        \- the singal handler will not send a signal to a
                        \- the singal dispatcher will not send a signal to a
                           process in a disk wait. you could try while 1
                           kill -9 PID or force an umount of the fs ... but
                           that's all unpredictable. there are some super
                           hairy things you can do but they are beyond the
                           scope of the motd and are os dep.
                        \_ Plug the SCSI device back in.  -tom
                           \_ Never unplugged it. Turned it off and turned it
                              back on. No love.
                              \- it might be interesting to see what happens
                                 if you change run levels or unload the
                                 scsi kernel module [rmmod on AssOS].
                                 But as I said earlier, if you dont know
                                 why you are going in a disk wait ... it
                                 could be something obvious like removable
                                 media, netowkring going out etc ... or it
                                 could be unclear [failing disk?] ... that's
                                 what you should be trying to figure out.--psb
                              \_ Anyway, your fundamental problem is that
                                 your process is waiting on the SCSI device,
                                 and it won't go away untill the SCSI device
                                 unblocks.  Look at rescan-scsi-bus or
                                 something.  -tom
                  \- You should try to undnerstand why "kill -9" isnt
                     killing the process.
        \_ Short answer: No.  You're waiting on the drive.  If your proc is
           in locked in 'D' state, you're hosed.  Figure out what is wrong
           with your drive.
        \_ Followup: Rebooted the machine, and now all is well. Will explore
           other methods of downing this particular process in case this
           arises again. Thanks to all for suggestions and information. -op
           \_ Depending on exactly what is going on you may not be able to
              kill the proc.  At my first job I was 'tape back up guy' among
              other things.  We had first gen crappy tape drives that often
              just stopped responding to commands.  Usually power cycling the
              tape drive would clear the procs, but very rarely that wasn't
              good enough and a full reboot was required.  These were Suns.
              YMMV, but you may find there is no answer beyond 'reboot'.
              \- tape drive device drivers look more like disk device drivers
                 than tapes looked like disks. --psb
                 \_ Uhm, ok.  Yes.  Are you supporting what I said or
                    disagreeing in some way or ...?   --confused
2025/05/24 [General] UID:1000 Activity:popular
5/24    

You may also be interested in these entries...
2012/1/4-2/6 [Computer/HW/Drives] UID:54281 Activity:nil
1/4     I want to test how my servers behave during a disk failure and
        a RAID reconstruction so I want to simulate a hardware failure.
        How can I do this in Linux without having to physically pull
        a drive? These disks are behind a RAID card and run Linux. -ausman
        \_ According to the Linux RAID wiki, you might be able to use mdadm
           to do this with something like the following:
	...
2011/9/14-10/25 [Computer/HW/Drives] UID:54173 Activity:nil
9/13    Thanks to Jordan, our disk server is no longer virtualized. Our long
        nightmare of poor IO performance should hopefully be over. Prepare for
        another long nightmare of poor hardware reliability!
        ...
        Just kidding! (I hope)
        In any case, this means that cooler was taken out back and shot, and
	...
2011/2/14-4/20 [Computer/SW/Unix] UID:54039 Activity:nil
2/14    You sure soda isn't running windows in disguise?  It would explain the
        uptimes.
        \_ hardly, My winbox stays up longer.
        \_ Nobody cares about uptime anymore brother, that's what web2.0 has
           taught us.  Everything is "stateless".
           \_ You;d think gamers would care more about uptime.
	...
2010/7/22-8/9 [Computer/SW/OS/FreeBSD, Computer/HW/Drives] UID:53893 Activity:nil
7/22    Playing with dd if=/dev/random of=/dev/<disk> on linux and bsd:
        2 questions, on linux when <disk>==hda it always gives me this off
        by one report i.e. Records out == records in-1 and says there is an
        error. Has anyone else seen this?  Second, when trying to repeat this
        on bsd, <disk>==rwd0 now, to my surprise, using the install disk and
        selecting (S)hell, when I try to dd a 40 gig disk it says "409 records
	...
2009/10/27-11/3 [Computer/HW/Drives] UID:53474 Activity:nil
10/27   I just read an article that Facebook had moved their database
        to all SSD to speed throughput, but now I can't find it. Has
        anyone else seen this? Any experience with doing this? -ausman
        \_ I hope you're not running mission critical data:
           http://ask.slashdot.org/story/09/10/27/1559248/Reliability-of-PC-Flash-SSDs?from=rss
        \_ Do you have any idea how much storage space is used by Facebook,
	...
2009/8/4-13 [Computer/SW/OS/Windows] UID:53239 Activity:kinda low
8/3     VMWare + Windows XP + Validation question. I need to test stuff with
        Service Pack 3 installed. I have a valid key that I own (yeah yeah I
        actually *bought* a copy, please don't flame me for supporting evil
        M$). Is it possible to register the key once, and then duplicate it
        for testing purposes?  Will Windows or Microsoft detect copies and
        disable the rest the copies?
	...
2009/7/28-8/6 [Computer/HW/Drives] UID:53216 Activity:nil
7/28    Does it make sense to defragment disks on VMWare? My 80GB disk
        on VMWare isn't really using 80GB, it just uses what it needs.
        Will defragment do anything to it?
        \_ If you want to speed up disk operation in your VM, it's best to
           defragment the disks in your VM, then defragment the disk on your
           host machine where the VM files are.
	...
2009/7/24-27 [Computer/SW/WWW/Browsers, Computer/SW/OS/OsX] UID:53191 Activity:kinda low
7/24    Firefox 3.5.1 on MacOS is a piece of crap. It crashes ALL THE TIME.
        It has crashed 3 or 4 times on me in the last hour, and not on
        the same pages either. The new Yahoo! home page also sucks ass.
        \_ os x keeps trashing my raid disk: '11 hours to rebuild. have fun
           with the kernel IO subsystem running like shit until then".
           Worthless piece of shit.
	...
2009/7/17-24 [Computer/SW/OS/OsX] UID:53156 Activity:kinda low
7/17    -rw-r--r--@
        What does the "at sign" mean? This is on Mac OS. VMWare disk file.
        \_ The file has metadata attributes
           \_ How do I add/delete attributes to files? What about
              -rw-r--r--+ <-- what is the "+" sign? Also how do you make
              tar preserve these attributes?
	...