source: general/sysutils/unzip.xml@ 86eaa277

10.0 10.1 11.0 11.1 11.2 11.3 12.0 12.1 6.2 6.2.0 6.2.0-rc1 6.2.0-rc2 6.3 6.3-rc1 6.3-rc2 6.3-rc3 7.10 7.4 7.5 7.6 7.6-blfs 7.6-systemd 7.7 7.8 7.9 8.0 8.1 8.2 8.3 8.4 9.0 9.1 basic bdubbs/svn elogind gnome kde5-13430 kde5-14269 kde5-14686 kea ken/TL2024 ken/inkscape-core-mods ken/tuningfonts krejzi/svn lazarus lxqt nosym perl-modules plabs/newcss plabs/python-mods python3.11 qt5new rahul/power-profiles-daemon renodr/vulkan-addition systemd-11177 systemd-13485 trunk upgradedb xry111/intltool xry111/llvm18 xry111/soup3 xry111/test-20220226 xry111/xf86-video-removal
Last change on this file since 86eaa277 was 86eaa277, checked in by Dan Nichilson <dnicholson@…>, 17 years ago

Implemented Alexander Patrakov's Locale Related Issues changes

git-svn-id: svn://svn.linuxfromscratch.org/BLFS/trunk/BOOK@6364 af4574ff-66df-0310-9fd7-8a98e5e911e0

  • Property mode set to 100644
File size: 10.3 KB
Line 
1<?xml version="1.0" encoding="ISO-8859-1"?>
2<!DOCTYPE sect1 PUBLIC "-//OASIS//DTD DocBook XML V4.4//EN"
3 "http://www.oasis-open.org/docbook/xml/4.4/docbookx.dtd" [
4 <!ENTITY % general-entities SYSTEM "../../general.ent">
5 %general-entities;
6
7 <!ENTITY unzip-download-http "http://www.mirrorservice.org/sites/ftp.info-zip.org/pub/infozip/src/unzip552.tar.gz">
8 <!ENTITY unzip-download-ftp "ftp://ftp.info-zip.org/pub/infozip/src/unzip552.tar.gz">
9 <!ENTITY unzip-md5sum "9d23919999d6eac9217d1f41472034a9">
10 <!ENTITY unzip-size "1.1 MB">
11 <!ENTITY unzip-buildsize "6.7 MB">
12 <!ENTITY unzip-time "0.1 SBU">
13]>
14
15<sect1 id="unzip" xreflabel="UnZip-&unzip-version;">
16 <?dbhtml filename="unzip.html"?>
17
18 <sect1info>
19 <othername>$LastChangedBy$</othername>
20 <date>$Date$</date>
21 </sect1info>
22
23 <title>UnZip-&unzip-version;</title>
24
25 <indexterm zone="unzip">
26 <primary sortas="a-UnZip">UnZip</primary>
27 </indexterm>
28
29 <sect2 role="package">
30 <title>Introduction to UnZip</title>
31
32 <para>The <application>UnZip</application> package contains
33 <filename>ZIP</filename> extraction utilities. These are useful for
34 extracting files from <filename>ZIP</filename> archives.
35 <filename>ZIP</filename> archives are created with
36 <application>PKZIP</application> or <application>Info-ZIP</application>
37 utilities, primarily in a DOS environment.</para>
38
39 <caution>
40 <para>The <application>UnZip</application> package has some locale
41 related issues. See the discussion below in the
42 <xref linkend="unzip-locale-issues"/> section. A more general
43 discussion of these problems can be found on the
44 <xref linkend="locale-issues"/> page.</para>
45 </caution>
46
47 <bridgehead renderas="sect3">Package Information</bridgehead>
48 <itemizedlist spacing="compact">
49 <listitem>
50 <para>Download (HTTP): <ulink url="&unzip-download-http;"/></para>
51 </listitem>
52 <listitem>
53 <para>Download (FTP): <ulink url="&unzip-download-ftp;"/></para>
54 </listitem>
55 <listitem>
56 <para>Download MD5 sum: &unzip-md5sum;</para>
57 </listitem>
58 <listitem>
59 <para>Download size: &unzip-size;</para>
60 </listitem>
61 <listitem>
62 <para>Estimated disk space required: &unzip-buildsize;</para>
63 </listitem>
64 <listitem>
65 <para>Estimated build time: &unzip-time;</para>
66 </listitem>
67 </itemizedlist>
68
69 <para condition="html" role="usernotes">User Notes:
70 <ulink url="&blfs-wiki;/unzip"/></para>
71
72 </sect2>
73
74 <sect2 id="unzip-locale-issues">
75 <title>UnZip Locale Issues</title>
76
77 <note>
78 <para>Use of <application>UnZip</application> in the
79 <application>JDK</application>, <application>Mozilla</application>,
80 <application>DocBook</application> or any other BLFS package
81 installation is not a problem, as BLFS instructions never use
82 <application>UnZip</application> to extract a file with non-ASCII
83 characters in the file's name.</para>
84 </note>
85
86 <para>The <application>UnZip</application> package assumes that filenames
87 stored in the ZIP archives created on non-Unix systems are encoded in
88 CP850, and that they should be converted to ISO-8859-1 when writing files
89 onto the filesystem. Such assumptions are not always valid. In fact,
90 inside the ZIP archive, filenames are encoded in the DOS codepage that is
91 in use in the relevant country, and the filenames on disk should be in
92 the locale encoding. In MS Windows, the OemToChar() C function (from
93 <filename>User32.DLL</filename>) does the correct conversion (which is
94 indeed the conversion from CP850 to a superset of ISO-8859-1 if MS
95 Windows is set up to use the US English language), but there is no
96 equivalent in Linux.</para>
97
98 <para>When using <command>unzip</command> to unpack a ZIP archive
99 containing non-ASCII filenames, the filenames are damaged because
100 <command>unzip</command> uses improper conversion when any of its
101 encoding assumptions are incorrect. For example, in the ru_RU.KOI8-R
102 locale, conversion of filenames from CP866 to KOI8-R is required, but
103 conversion from CP850 to ISO-8859-1 is done, which produces filenames
104 consisting of undecipherable characters instead of words (the closest
105 equivalent understandable example for English-only users is rot13). There
106 are several ways around this limitation:</para>
107
108 <para>1) For unpacking ZIP archives with filenames containing non-ASCII
109 characters, use <ulink url="http://www.winzip.com/">WinZip</ulink> while- running the <ulink url="http://www.winehq.com/">Wine</ulink> Windows
110 emulator.</para>
111
112 <para>2) After running <command>unzip</command>, fix the damage made to
113 the filenames using the <command>convmv</command> tool
114 (<ulink url="http://j3e.de/linux/convmv/"/>). The following is an example
115 for the ru_RU.KOI8-R locale:</para>
116
117 <blockquote>
118 <para>Step 1. Undo the conversion done by
119 <command>unzip</command>:</para>
120
121<screen><userinput>convmv -f iso-8859-1 -t cp850 -r --nosmart --notest \
122 <replaceable>&lt;/path/to/unzipped/files&gt;</replaceable></userinput></screen>
123
124 <para>Step 2. Do the correct conversion instead:</para>
125
126<screen><userinput>convmv -f cp866 -t koi8-r -r --nosmart --notest \
127 <replaceable>&lt;/path/to/unzipped/files&gt;</replaceable></userinput></screen>
128 </blockquote>
129
130 <para>3) Apply this patch to unzip:
131 <ulink url="https://bugzilla.altlinux.ru/attachment.cgi?id=532"/></para>
132
133 <para>It allows to specify the assumed filename encoding in the ZIP
134 archive using the <option>-O charset_name</option> option and the
135 on-disk filename encoding using the <option>-I charset_name</option>
136 option. Defaults: the on-disk filename encoding is the locale encoding,
137 the encoding inside the ZIP archive is guessed according to the builtin
138 table based on the locale encoding. For US English users, this still
139 means that unzip converts from CP850 to ISO-8859-1 by default.</para>
140
141 <para>Caveat: this method works only with 8-bit locale encodings, not
142 with UTF-8. Attempting to use a patched <command>unzip</command> in UTF-8
143 locales may result in a segmentation fault and is probably a security
144 risk.</para>
145
146 </sect2>
147
148 <sect2 role="installation">
149 <title>Installation of UnZip</title>
150
151 <para>Install <application>UnZip</application> by running the following
152 commands:</para>
153
154<screen><userinput>make -f unix/Makefile LOCAL_UNZIP=-D_FILE_OFFSET_BITS=64 linux</userinput></screen>
155
156 <para>To test the results, issue: <command>make check</command>.</para>
157
158 <para>Now, as the <systemitem class="username">root</systemitem> user:</para>
159
160<screen role="root"><userinput>make prefix=/usr install</userinput></screen>
161
162 </sect2>
163
164 <sect2 role="commands">
165 <title>Command Explanations</title>
166
167 <para><parameter>linux</parameter>:
168 This target in the <filename>Makefile</filename> makes assumptions
169 that are useful for a Linux system when compiling the executables.
170 To obtain alternatives to this target, use <command>make list</command></para>
171
172 <para><parameter>LOCAL_UNZIP=...</parameter>:
173 This sets the compilation flags to allow <application>UnZip</application>
174 to handle files up to 4 GB.</para>
175
176 </sect2>
177
178 <sect2 role="content">
179 <title>Contents</title>
180
181 <segmentedlist>
182 <segtitle>Installed Programs</segtitle>
183 <segtitle>Installed Libraries</segtitle>
184 <segtitle>Installed Directories</segtitle>
185
186 <seglistitem>
187 <seg>funzip, unzip, unzipfsx, zipgrep, and zipinfo</seg>
188 <seg>None</seg>
189 <seg>None</seg>
190 </seglistitem>
191 </segmentedlist>
192
193 <variablelist>
194 <bridgehead renderas="sect3">Short Descriptions</bridgehead>
195 <?dbfo list-presentation="list"?>
196 <?dbhtml list-presentation="table"?>
197
198 <varlistentry id="funzip">
199 <term><command>funzip</command></term>
200 <listitem>
201 <para>allows the output of <command>unzip</command> commands to be
202 redirected.</para>
203 <indexterm zone="unzip funzip">
204 <primary sortas="b-funzip">funzip</primary>
205 </indexterm>
206 </listitem>
207 </varlistentry>
208
209 <varlistentry id="unzip-prog">
210 <term><command>unzip</command></term>
211 <listitem>
212 <para>lists, tests or extracts files from a <filename>ZIP</filename>
213 archive.</para>
214 <indexterm zone="unzip unzip-prog">
215 <primary sortas="b-unzip">unzip</primary>
216 </indexterm>
217 </listitem>
218 </varlistentry>
219
220 <varlistentry id="unzipfsx">
221 <term><command>unzipfsx</command></term>
222 <listitem>
223 <para>is a self-extracting stub that can be prepended to a
224 <filename>ZIP</filename> archive. Files in this format allow the
225 recipient to decompress the archive without installing
226 <application>UnZip</application>.</para>
227 <indexterm zone="unzip unzipfsx">
228 <primary sortas="b-unzipfsx">unzipfsx</primary>
229 </indexterm>
230 </listitem>
231 </varlistentry>
232
233 <varlistentry id="zipgrep">
234 <term><command>zipgrep</command></term>
235 <listitem>
236 <para>searches files in a <filename>ZIP</filename> archive for
237 lines matching a pattern.</para>
238 <indexterm zone="unzip zipgrep">
239 <primary sortas="b-zipgrep">zipgrep</primary>
240 </indexterm>
241 </listitem>
242 </varlistentry>
243
244 <varlistentry id="zipinfo">
245 <term><command>zipinfo</command></term>
246 <listitem>
247 <para>produces technical information about the files in a
248 <filename>ZIP</filename> archive, including file access permissions,
249 encryption status, type of compression, etc.</para>
250 <indexterm zone="unzip zipinfo">
251 <primary sortas="b-zipinfo">zipinfo</primary>
252 </indexterm>
253 </listitem>
254 </varlistentry>
255
256 <varlistentry id="libunzip">
257 <term><filename class='libraryfile'>libunzip.so</filename></term>
258 <listitem>
259 <para>contains the API functions required by the
260 <application>UnZip</application> programs.</para>
261 <indexterm zone="unzip libunzip">
262 <primary sortas="c-libunzip">libunzip.so</primary>
263 </indexterm>
264 </listitem>
265 </varlistentry>
266
267 </variablelist>
268
269 </sect2>
270
271</sect1>
Note: See TracBrowser for help on using the repository browser.