Berkeley CSUA MOTD:Entry 28656
Berkeley CSUA MOTD
 
WIKI | FAQ | Tech FAQ
http://csua.com/feed/
2017/09/21 [General] UID:1000 Activity:popular
9/21    

2003/6/6-7 [Computer/SW/Apps] UID:28656 Activity:nil
6/6     Is there any sort of utility that takes an acrobat file that is
        mostly text and then extracts the text and outputs it as a word
        document?
        \_ Omnipage Pro
           \_ And by that, I mean you don't have to print it and scan it.  It
              can open .pdf files.  Sadly, it doesn't appear to extract the text
              (I've had documents that I could copy from in Acrobat, and while
              the text was there and correct, the formatting was lost.  Omnipage
              got the formatting right, but some of the text was wrong.  Go
              figure.)
        \_ pdf2ps | ps2html does a pretty good job, and then you can open in
           word, save as .doc
2017/09/21 [General] UID:1000 Activity:popular
9/21    

You may also be interested in these entries...
2009/8/27-9/9 [Computer/SW/OS/OsX] UID:53304 Activity:nil
8/26    Any suggestions on a good OCR program for either OS X or Windows that
        will work on scanned documents outputted to pdf?  Preferably free?
        Thanks, scottyg
        \_ Check Abbyy or Scansoft.  Not free.
           \_ Thanks...I think I'd prefer a free or opensource piece of
              software unless there is a huge difference in quality.  I
	...
2009/3/26-4/2 [Computer/SW/Apps, Computer/SW/Languages/Misc] UID:52760 Activity:nil
3/26    Anyone here uses Heritrix?  I'm trying to read the Intro document at
        http://crawler.archive.org/An%20Introduction%20to%20Heritrix.pdf but
        both Adobe Reader 8.1.3 (Win32) and gv 3.6.5 (cygwin) display error
        messages and show me blank pages.  Adobe displays:
        "Cannot extract the embedded font 'FTXWSG+TimesNewRomanMS'.  Some
        characters may not display or print correctly."
	...
2007/3/29-4/2 [Computer/SW/Virus] UID:46142 Activity:moderate
3/28    After almost a decade of not using windows I'm thinking about getting
        a cheap windows computer.  Security wise what are some musts?
        \_ install Linux
           \_ Haha you are still funny.
              \_ http://www.csua.berkeley.edu/~erikred/imlinux.jpg
           \_ I actually agree w/ this. Install Linux and VMWare. Then
	...
2006/10/10-12 [Computer/HW/Printer] UID:44752 Activity:kinda low
10/10   Any reccomendations for a cheap monochrome laser printer?  Network
        not needed.  A decent-size starter toner cartridge would be nice.
        \_ I got a used HP 2200D off craigslist for $250 or so.  It's
           great, it does postscript and prints on both sides of the
           paper, which is nice.  Speed is not too bad.  -phr
        \_ I recently bought a Samsung 3051n from Newegg for $200 ($250
	...
2006/9/26-27 [Computer/SW/Apps] UID:44550 Activity:nil
9/26    Is there a way to strip away parts of a pdf document in order to
        keep just one part of it, without buying Adobe Acrobat? Thanks.
        \_ convert to postscript, start editing with $EDITOR
        \_ unlicensed copying of Acrobat
        \_ Depending on the document and what you're trying to save, you
           may be able to select the text and copy and paste.  -tom
	...
2006/9/18-20 [Computer/SW/Apps, Computer/SW/OS/Windows] UID:44430 Activity:nil
9/18    Are there any free/shareware tools to edit a pdf file?  Any OS is ok.
        Thanks.
        \_ I'm not sure what you mean by edit (ie do you mean move/delete
           pages or do you mean edit text), but the following might be
           helpful: http://www.accesspdf.com/pdftk
           \_ I mean I have a pdf someone else at work generated but it has
	...
2006/6/27-29 [Computer/SW/Apps] UID:43512 Activity:nil 66%like:43501 75%like:43539
6/27    http://www.csua.berkeley.edu/~erikred/wallchart.pdf
        Updated: 1017 PST
        \_ Updated: 1418 PST
           \_ Next match: Germany v. Argentina, Friday, 0800 PST
              Expect Updates after that.
        \_ What are using to do the updates? Acrobat?
	...
2006/6/7-9 [Computer/SW/OS/OsX] UID:43316 Activity:nil
6/7     What, if any, is the best way to "convert" a PDF document to either
        a text file or a Word document? I tried copying/pasting the text
        (after highlighting it), but the document formatting is lost. I can
        manually enter carriage returns, but it's a long document (~50 pages)
        The entire document is just paragraphs of text. This is on a OS X
        machine. Thanks.
	...
2006/1/30-2/1 [Computer/SW/Database, Computer/SW/Apps] UID:41603 Activity:nil
1/30    What is an easy and free way to extract about 40 pages from a 180 page
        pdf document, so that I end up with one 40 page .pdf file and one
        140 page .pdf file?  I only need to do this once, so if there's
        some business that'll do this, I'd pay for it, but I don't want to
        buy software to only do it once.  I have Acrobat Professional, but I
        can't figure out how to use that to do this.
	...
2005/11/21-23 [Computer/SW/OS/OsX] UID:40679 Activity:moderate
11/21   I've noticed that the OS X Terminal.app and xterm etc are all
        slower than xterm / rxvt on Linux/x86. Even a 500 MHz P2 has
        quicker response than a 1.33 Ghz G4 when you are logged
        into a server via SSH. I'm wondering what could be the cause
        of this lag. Is it something that can be adjusted? I find the
        sluggishness the only think holding me back from using a Mac
	...
2005/9/8-10 [Computer/SW/OS/Windows] UID:39575 Activity:nil
9/8     Is there a better Windows ghostscript front-end than GSview? What do
        you use to view PostScript on Windows?
        \_ Not really. The usual method is to convert to PDF using distiller
           and view it using Acrobat Reader.
	...