Converting Word to Excel
เธรดต่อผู้เขียนข้อความ: Anu Mukharji-Gorski
Anu Mukharji-Gorski
Anu Mukharji-Gorski  Identity Verified
เยอรมนี
Local time: 02:46
ภาษาเยอรมัน เป็น ภาษาอังกฤษ
+ ...
Dec 16, 2007

Hallo,

Sorry if this isn't the correct forum to be posting in ...

For an analysis I'm carrying out on sentence structure, I'm looking for a way to convert a Word document into an Excel one. Every sentence in the Word document should start on a new row in Excel. There are two articles, both about 15 pages long. Is there a way of doing this?

Would be very grateful for any hints.

Thanks,
Anu


 
Marc P (X)
Marc P (X)  Identity Verified
Local time: 02:46
ภาษาเยอรมัน เป็น ภาษาอังกฤษ
+ ...
Simple procedure Dec 16, 2007

In Word, run a search and replace for full stops (periods) and replace them with paragraph marks.

Mark all the text and convert it to table.

Open Excel, copy the table in Word and paste it into an empty Excel spreadsheet.

You might want to fine-tune this, e.g. by searching for question marks, exclamation marks, etc. as well.

Marc


 
Samuel Murray
Samuel Murray  Identity Verified
เนเธอร์แลนด์
Local time: 02:46
สมาชิก (2006)
ภาษาอังกฤษ เป็น ภาษาอาฟริกา
+ ...
A solution Dec 16, 2007

Swaiyam wrote:
Every sentence in the Word document should start on a new row in Excel.


Okay, this is easy to accomplish (in theory). To convert an MS Word file into Excel in the way you describe, you just have to ensure that each sentence is in a separate "paragraph".

You can accomplish that using Marc's approach, or if you have a CAT tool you can do an autotranslate and grab the sentences from the TM. If you have Wordfast (even the free version), you can do an "Extract" which will produce a file much like the one you require.

Another useful tool to have is a text editor that allows you to deactivate word wrap (you can't disable word wrap in MS Word). I suggest Metapad, if you don't use Unicode.


[Edited at 2007-12-16 09:43]


 
Anu Mukharji-Gorski
Anu Mukharji-Gorski  Identity Verified
เยอรมนี
Local time: 02:46
ภาษาเยอรมัน เป็น ภาษาอังกฤษ
+ ...
TOPIC STARTER
Thank you! Dec 17, 2007

Marc P wrote:

In Word, run a search and replace for full stops (periods) and replace them with paragraph marks.


Thanks, it's worked great. The dates and "z.B." are driving me up the wall (a bit) but I guess that's something that can't be avoided

Anu


 
Anu Mukharji-Gorski
Anu Mukharji-Gorski  Identity Verified
เยอรมนี
Local time: 02:46
ภาษาเยอรมัน เป็น ภาษาอังกฤษ
+ ...
TOPIC STARTER
Extract Dec 17, 2007

If you have Wordfast (even the free version), you can do an "Extract" which will produce a file much like the one you require.


Thanks for the suggestion, Samuel. That's something I'll try next time

Anu


 
Edward LIU
Edward LIU  Identity Verified
แคนาดา
Local time: 20:46
ภาษาจีน เป็น ภาษาอังกฤษ
+ ...
How can you enter paragraph mark in the Replace Tab? Jun 1, 2008

Marc P wrote:

In Word, run a search and replace for full stops (periods) and replace them with paragraph marks.

Mark all the text and convert it to table.

Open Excel, copy the table in Word and paste it into an empty Excel spreadsheet.

You might want to fine-tune this, e.g. by searching for question marks, exclamation marks, etc. as well.

Marc


How can you enter paragraph mark in the Replace Tab? Everytime I hit the return key, the cursor simply moves away.


 
Tony M
Tony M
ฝรั่งเศส
Local time: 02:46
ภาษาฝรั่งเศส เป็น ภาษาอังกฤษ
+ ...
SITE LOCALIZER
Entering paragraph marks and other special characters in 'search-&-replace' Jun 1, 2008

On the dialogue box for 'search and/or replace', click the button near the bottom that is labelled 'special characters', and it gives you the codes for all the things like hard/soft returns etc. Once you have learnt the commonest ones, you can simply type them directly into the search field; for example, the code for a hard return (= paragraph mark) is ^p

I hope that helps!

[Edited at 2008-06-01 17:25]


 
Tony M
Tony M
ฝรั่งเศส
Local time: 02:46
ภาษาฝรั่งเศส เป็น ภาษาอังกฤษ
+ ...
SITE LOCALIZER
Workaround for things like z.B. Jun 1, 2008


Swaiyam wrote:
The dates and "z.B." are driving me up the wall (a bit) but I guess that's something that can't be avoided
Anu


Actually, in a lot of cases, you can avoid it!

If you have something predictable like z.B., for example, all you need do is first search for this exact expression, and replace the full stops by some other character that never occurs elsewhere in your text — for example, perhaps §, or somesuch. So you get a document littered with frightening-looking things like z§B§!

Then you can do the rest of your manœuvre, and finally, once all the full stops have been replaced with . + [paragraph mark], you can then go back and re-search and replace for z§B§ to replace it with z.B. again.

Of course, you need to do this manœuvre for each problem string you may have, and it may be less easy for dates (but you could try using the 'any digit' wildcard and see if that worked..)


 


To report site rules violations or get help, contact a site moderator:

ผู้ไกล่เกลี่ยของฟอรัมนี้
Maya Gorgoshidze[Call to this topic]
Prachya Mruetusatorn[Call to this topic]

You can also contact site staff by submitting a support request »

Converting Word to Excel






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »