<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://okapiframework.org/wiki/index.php?action=history&amp;feed=atom&amp;title=How_to_Extract_Text_for_Translation</id>
	<title>How to Extract Text for Translation - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://okapiframework.org/wiki/index.php?action=history&amp;feed=atom&amp;title=How_to_Extract_Text_for_Translation"/>
	<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=How_to_Extract_Text_for_Translation&amp;action=history"/>
	<updated>2026-05-21T09:31:08Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.38.2</generator>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=How_to_Extract_Text_for_Translation&amp;diff=235&amp;oldid=prev</id>
		<title>Ysavourel: 1 revision imported</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=How_to_Extract_Text_for_Translation&amp;diff=235&amp;oldid=prev"/>
		<updated>2016-06-04T23:19:56Z</updated>

		<summary type="html">&lt;p&gt;1 revision imported&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 19:19, 4 June 2016&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;en&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(No difference)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>Ysavourel</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=How_to_Extract_Text_for_Translation&amp;diff=234&amp;oldid=prev</id>
		<title>Ysavourel at 13:28, 3 January 2012</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=How_to_Extract_Text_for_Translation&amp;diff=234&amp;oldid=prev"/>
		<updated>2012-01-03T13:28:23Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;__TOC__&lt;br /&gt;
==Using Rainbow==&lt;br /&gt;
&lt;br /&gt;
This example illustrates how to prepare a simple translation package. It assumes you do not have an existing translation memory available, don't pre-translate or pre-segment. It just creates a basic set of XLIFF files you can translate using one of the several editors that support XLIFF.&lt;br /&gt;
&lt;br /&gt;
For this example execute the following steps:&lt;br /&gt;
&lt;br /&gt;
1. Start Rainbow.&lt;br /&gt;
&lt;br /&gt;
2. Drop the files you want to prepare in the &amp;lt;cite&amp;gt;Input List 1&amp;lt;/cite&amp;gt; tab of the main window.&lt;br /&gt;
&lt;br /&gt;
3. Associate each input document with its appropriate [[Filter Configurations|filter configuration]]. In many case the configuration selected by default is fine. In some cases you may need to defined [[Understanding Filter Configurations|custom configurations]] (For an example, see the article &amp;quot;[[How to Create a Custom Configuration for the XML Filter]]).&lt;br /&gt;
&lt;br /&gt;
4. In the &amp;lt;cite&amp;gt;Languages and Encodings&amp;lt;/cite&amp;gt; tab: Set the proper languages and encodings. You do not need to set the encoding for the file formats where the encoding can be detected automatically (e.g. XML documents).&lt;br /&gt;
&lt;br /&gt;
5. In the &amp;lt;cite&amp;gt;Other Settings&amp;lt;/cite&amp;gt; tab: Specify what file name your translated files should have. By default they will be named the same as the source document, but with an extra &amp;quot;&amp;lt;code&amp;gt;.out&amp;lt;/code&amp;gt;&amp;quot; extension just before the original extension (e.g. &amp;lt;code&amp;gt;myFile.txt&amp;lt;/code&amp;gt; will become &amp;lt;code&amp;gt;myFile.out.txt&amp;lt;/code&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
6. Select &amp;lt;cite&amp;gt;Utilities&amp;lt;/cite&amp;gt; &amp;gt; &amp;lt;cite&amp;gt;Translation Kit Creation&amp;lt;/cite&amp;gt;. This opens the &amp;lt;cite&amp;gt;Translation Kit Creation&amp;lt;/cite&amp;gt; pre-defined pipeline.&lt;br /&gt;
&lt;br /&gt;
7. Go to the &amp;lt;cite&amp;gt;Rainbow Translation Kit Creation&amp;lt;/cite&amp;gt; entry.&lt;br /&gt;
&lt;br /&gt;
8. In the &amp;lt;cite&amp;gt;Package Format&amp;lt;/cite&amp;gt; tab: Select the type of package to create. For this example choose &amp;lt;cite&amp;gt;Generic XLIFF&amp;lt;/cite&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
9. In the &amp;lt;cite&amp;gt;Output Location&amp;lt;/cite&amp;gt; tab: Enter the root part of the output directory, for example in Windows: &amp;lt;code&amp;gt;C:\Tmp\Extraction&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
10. Enter also the name of the package to create. This will be the added to the root directory and make up the full output path. For example, if you enter &amp;lt;code&amp;gt;pack1&amp;lt;/code&amp;gt; the output directory will be &amp;lt;code&amp;gt;C:\Tmp\Extraction\pack1&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
11. Because you are creating a simple package and not using any additional option, you do not need to set other parameters. Simply click &amp;lt;cite&amp;gt;Execute&amp;lt;/cite&amp;gt; to start the process. Any error or warning will be displayed in the &amp;lt;cite&amp;gt;Log&amp;lt;/cite&amp;gt; window.&lt;br /&gt;
&lt;br /&gt;
Assuming you have prepared a single file called &amp;lt;code&amp;gt;myFile.html&amp;lt;/code&amp;gt;, you should have something like this:&lt;br /&gt;
&lt;br /&gt;
  C:\Tmp\Extraction\pack1&lt;br /&gt;
  |&lt;br /&gt;
  +--- original&lt;br /&gt;
  |    |&lt;br /&gt;
  |    +--- myFile.html&lt;br /&gt;
  |&lt;br /&gt;
  +--- work&lt;br /&gt;
  |    |&lt;br /&gt;
  |    +--- myFile.html.xlf&lt;br /&gt;
  |&lt;br /&gt;
  +--- manifest.rkm&lt;br /&gt;
&lt;br /&gt;
* The &amp;lt;code&amp;gt;original&amp;lt;/code&amp;gt; directory contains the data needs to merge back the extracted text into its original format. You should not touch those files.&lt;br /&gt;
* The &amp;lt;code&amp;gt;work&amp;lt;/code&amp;gt; directory contains the XLIFF documents that need to be translated.&lt;br /&gt;
* The file &amp;lt;code&amp;gt;manifest.rkm&amp;lt;/code&amp;gt; is used lated to merge back the extracted data. You should not touch this file.&lt;br /&gt;
&lt;br /&gt;
At this point you are ready to translate your package. See: &amp;quot;[[How to Translate XLIFF Documents]]&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Once the XLIFF documents are translated you need to post-process the package to generate the translated documents in their original format. See: &amp;quot;[[How to Post-Process Extracted Text]]&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
==Using Tikal==&lt;br /&gt;
&lt;br /&gt;
1. Start a DOS prompt or a shell console from where you can run Tikal.&lt;br /&gt;
&lt;br /&gt;
2. Assuming:&lt;br /&gt;
&lt;br /&gt;
* You have a file &amp;lt;code&amp;gt;myFile.html&amp;lt;/code&amp;gt; in the current directory.&lt;br /&gt;
* The file is in English and you want to translate into Japanese.&lt;br /&gt;
&lt;br /&gt;
Execute the command-line:&lt;br /&gt;
&lt;br /&gt;
 tikal -x myFile.html -sl en -tl ja &lt;br /&gt;
&lt;br /&gt;
A file &amp;lt;code&amp;gt;myFile.html.xlf&amp;lt;/code&amp;gt; is created. It contains the extracted text for the original document.&lt;br /&gt;
&lt;br /&gt;
Here the filter used is automatically detected using the extension of the input file, but if you need to use a specific filter configuration you can use the &amp;lt;code&amp;gt;-fc&amp;lt;/code&amp;gt; parameter. For example if you have a file &amp;lt;code&amp;gt;myFile.txt&amp;lt;/code&amp;gt; that is a tab-delimited document with specific columns to extract and you have created a filter configuration for it, you would run:&lt;br /&gt;
&lt;br /&gt;
 tikal -x myFile.csv -sl en -tl ja -fc okf_table@myConfig&lt;br /&gt;
&lt;br /&gt;
Where &amp;lt;code&amp;gt;okf_table@myConfig&amp;lt;/code&amp;gt; is the filter configuration identifier to use.&lt;br /&gt;
&lt;br /&gt;
Tikal offers other extraction options such as pre-segmentation, leveraging, etc.&lt;br /&gt;
&lt;br /&gt;
At this point you are ready to translate your XLIFF file. See: &amp;quot;[[How to Translate XLIFF Documents]]&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
Once the XLIFF documents are translated you need to post-process them to generate the translated documents in their original format. See: &amp;quot;[[How to Post-Process Extracted Text]]&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
[[Category:Rainbow]] [[Category:Tikal]] [[Category:XLIFF]]&lt;/div&gt;</summary>
		<author><name>Ysavourel</name></author>
	</entry>
</feed>