Converting Word to Excel เธรดต่อผู้เขียนข้อความ: Anu Mukharji-Gorski
|
Hallo,
Sorry if this isn't the correct forum to be posting in ...
For an analysis I'm carrying out on sentence structure, I'm looking for a way to convert a Word document into an Excel one. Every sentence in the Word document should start on a new row in Excel. There are two articles, both about 15 pages long. Is there a way of doing this?
Would be very grateful for any hints.
Thanks,
Anu | | | Marc P (X) Local time: 02:46 ภาษาเยอรมัน เป็น ภาษาอังกฤษ + ... Simple procedure | Dec 16, 2007 |
In Word, run a search and replace for full stops (periods) and replace them with paragraph marks.
Mark all the text and convert it to table.
Open Excel, copy the table in Word and paste it into an empty Excel spreadsheet.
You might want to fine-tune this, e.g. by searching for question marks, exclamation marks, etc. as well.
Marc | | | Samuel Murray เนเธอร์แลนด์ Local time: 02:46 สมาชิก (2006) ภาษาอังกฤษ เป็น ภาษาอาฟริกา + ...
Swaiyam wrote:
Every sentence in the Word document should start on a new row in Excel.
Okay, this is easy to accomplish (in theory). To convert an MS Word file into Excel in the way you describe, you just have to ensure that each sentence is in a separate "paragraph".
You can accomplish that using Marc's approach, or if you have a CAT tool you can do an autotranslate and grab the sentences from the TM. If you have Wordfast (even the free version), you can do an "Extract" which will produce a file much like the one you require.
Another useful tool to have is a text editor that allows you to deactivate word wrap (you can't disable word wrap in MS Word). I suggest Metapad, if you don't use Unicode.
[Edited at 2007-12-16 09:43] | | |
Marc P wrote:
In Word, run a search and replace for full stops (periods) and replace them with paragraph marks.
Thanks, it's worked great. The dates and "z.B." are driving me up the wall (a bit) but I guess that's something that can't be avoided
Anu | |
|
|
If you have Wordfast (even the free version), you can do an "Extract" which will produce a file much like the one you require.
Thanks for the suggestion, Samuel. That's something I'll try next time
Anu | | | Edward LIU แคนาดา Local time: 20:46 ภาษาจีน เป็น ภาษาอังกฤษ + ... How can you enter paragraph mark in the Replace Tab? | Jun 1, 2008 |
Marc P wrote:
In Word, run a search and replace for full stops (periods) and replace them with paragraph marks.
Mark all the text and convert it to table.
Open Excel, copy the table in Word and paste it into an empty Excel spreadsheet.
You might want to fine-tune this, e.g. by searching for question marks, exclamation marks, etc. as well.
Marc
How can you enter paragraph mark in the Replace Tab? Everytime I hit the return key, the cursor simply moves away. | | | Tony M ฝรั่งเศส Local time: 02:46 ภาษาฝรั่งเศส เป็น ภาษาอังกฤษ + ... SITE LOCALIZER Entering paragraph marks and other special characters in 'search-&-replace' | Jun 1, 2008 |
On the dialogue box for 'search and/or replace', click the button near the bottom that is labelled 'special characters', and it gives you the codes for all the things like hard/soft returns etc. Once you have learnt the commonest ones, you can simply type them directly into the search field; for example, the code for a hard return (= paragraph mark) is ^p
I hope that helps!
[Edited at 2008-06-01 17:25] | | | Tony M ฝรั่งเศส Local time: 02:46 ภาษาฝรั่งเศส เป็น ภาษาอังกฤษ + ... SITE LOCALIZER Workaround for things like z.B. | Jun 1, 2008 |
Swaiyam wrote:
The dates and "z.B." are driving me up the wall (a bit) but I guess that's something that can't be avoided
Anu
Actually, in a lot of cases, you can avoid it!
If you have something predictable like z.B., for example, all you need do is first search for this exact expression, and replace the full stops by some other character that never occurs elsewhere in your text — for example, perhaps §, or somesuch. So you get a document littered with frightening-looking things like z§B§!
Then you can do the rest of your manœuvre, and finally, once all the full stops have been replaced with . + [paragraph mark], you can then go back and re-search and replace for z§B§ to replace it with z.B. again.
Of course, you need to do this manœuvre for each problem string you may have, and it may be less easy for dates (but you could try using the 'any digit' wildcard and see if that worked..) | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Converting Word to Excel TM-Town | Manage your TMs and Terms ... and boost your translation business
Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.
More info » |
| Wordfast Pro | Translation Memory Software for Any Platform
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Buy now! » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |