source: introduction/important/building-notes.xml@ b6d54494

12.1 ken/TL2024 ken/tuningfonts lazarus plabs/newcss python3.11 rahul/power-profiles-daemon renodr/vulkan-addition trunk xry111/llvm18
Last change on this file since b6d54494 was b6d54494, checked in by Xi Ruoyao <xry111@…>, 9 months ago

building-notes: Note how to use cgroup for limiting resource usage

We were saying "-jN means using N cores (or N threads)". This is
completely wrong. "-jN" only tells the building system to run N jobs
simultaneously, but each job can start their own subprocesses or threads
and there is no way for the building system to know how many
subprocesses or threads a job will start.

This caused a lot of misunderstandings and encouraged users to wrongly
blame building systems.

Fix the description of -jN, and add how to use cgroup to control the
usage of CPU cores and system RAM.

On a systemd-based system, systemd is the cgroup manager and manually
operating on cgroups may puzzle systemd. So use systemd-run for
creating and setting up cgroup. On a sysv-based system create and set
up the cgroup manually.

  • Property mode set to 100644
File size: 60.5 KB
Line 
1<?xml version="1.0" encoding="ISO-8859-1"?>
2<!DOCTYPE sect1 PUBLIC "-//OASIS//DTD DocBook XML V4.5//EN"
3 "http://www.oasis-open.org/docbook/xml/4.5/docbookx.dtd" [
4 <!ENTITY % general-entities SYSTEM "../../general.ent">
5 %general-entities;
6]>
7
8<sect1 id="unpacking">
9 <?dbhtml filename="notes-on-building.html"?>
10
11
12 <title>Notes on Building Software</title>
13
14 <para>Those people who have built an LFS system may be aware
15 of the general principles of downloading and unpacking software. Some
16 of that information is repeated here for those new to building
17 their own software.</para>
18
19 <para>Each set of installation instructions contains a URL from which you
20 can download the package. The patches; however, are stored on the LFS
21 servers and are available via HTTP. These are referenced as needed in the
22 installation instructions.</para>
23
24 <para>While you can keep the source files anywhere you like, we assume that
25 you have unpacked the package and changed into the directory created by the
26 unpacking process (the source directory). We also assume you have
27 uncompressed any required patches and they are in the directory
28 immediately above the source directory.</para>
29
30 <para>We can not emphasize strongly enough that you should start from a
31 <emphasis>clean source tree</emphasis> each time. This means that if
32 you have had an error during configuration or compilation, it's usually
33 best to delete the source tree and
34 re-unpack it <emphasis>before</emphasis> trying again. This obviously
35 doesn't apply if you're an advanced user used to hacking
36 <filename>Makefile</filename>s and C code, but if in doubt, start from a
37 clean tree.</para>
38
39 <sect2>
40 <title>Building Software as an Unprivileged (non-root) User</title>
41
42 <para>The golden rule of Unix System Administration is to use your
43 superpowers only when necessary. Hence, BLFS recommends that you
44 build software as an unprivileged user and only become the
45 <systemitem class='username'>root</systemitem> user when installing the
46 software. This philosophy is followed in all the packages in this book.
47 Unless otherwise specified, all instructions should be executed as an
48 unprivileged user. The book will advise you on instructions that need
49 <systemitem class='username'>root</systemitem> privileges.</para>
50
51 </sect2>
52
53 <sect2>
54 <title>Unpacking the Software</title>
55
56 <para>If a file is in <filename class='extension'>.tar</filename> format
57 and compressed, it is unpacked by running one of the following
58 commands:</para>
59
60<screen><userinput>tar -xvf filename.tar.gz
61tar -xvf filename.tgz
62tar -xvf filename.tar.Z
63tar -xvf filename.tar.bz2</userinput></screen>
64
65 <note>
66 <para>You may omit using the <option>v</option> parameter in the commands
67 shown above and below if you wish to suppress the verbose listing of all
68 the files in the archive as they are extracted. This can help speed up the
69 extraction as well as make any errors produced during the extraction
70 more obvious to you.</para>
71 </note>
72
73 <para>You can also use a slightly different method:</para>
74
75<screen><userinput>bzcat filename.tar.bz2 | tar -xv</userinput></screen>
76
77 <para>
78 Finally, sometimes we have a compressed patch file in
79 <filename class='extension'>.patch.gz</filename> or
80 <filename class='extension'>.patch.bz2</filename> format.
81 The best way to apply the patch is piping the output of the
82 decompressor to the <command>patch</command> utility. For example:
83 </para>
84
85 <screen><userinput>gzip -cd ../patchname.patch.gz | patch -p1</userinput></screen>
86
87 <para>
88 Or for a patch compressed with <command>bzip2</command>:
89 </para>
90
91 <screen><userinput>bzcat ../patchname.patch.bz2 | patch -p1</userinput></screen>
92
93 </sect2>
94
95 <sect2>
96 <title>Verifying File Integrity</title>
97
98 <para>Generally, to verify that the downloaded file is complete,
99 many package maintainers also distribute md5sums of the files. To verify the
100 md5sum of the downloaded files, download both the file and the
101 corresponding md5sum file to the same directory (preferably from different
102 on-line locations), and (assuming <filename>file.md5sum</filename> is the
103 md5sum file downloaded) run the following command:</para>
104
105<screen><userinput>md5sum -c file.md5sum</userinput></screen>
106
107 <para>If there are any errors, they will be reported. Note that the BLFS
108 book includes md5sums for all the source files also. To use the BLFS
109 supplied md5sums, you can create a <filename>file.md5sum</filename> (place
110 the md5sum data and the exact name of the downloaded file on the same
111 line of a file, separated by white space) and run the command shown above.
112 Alternately, simply run the command shown below and compare the output
113 to the md5sum data shown in the BLFS book.</para>
114
115<screen><userinput>md5sum <replaceable>&lt;name_of_downloaded_file&gt;</replaceable></userinput></screen>
116
117 <para>MD5 is not cryptographically secure, so the md5sums are only
118 provided for detecting unmalicious changes to the file content. For
119 example, an error or truncation introduced during network transfer, or
120 a <quote>stealth</quote> update to the package from the upstream
121 (updating the content of a released tarball instead of making a new
122 release properly).</para>
123
124 <para>There is no <quote>100%</quote> secure way to make
125 sure the genuity of the source files. Assuming the upstream is managing
126 their website correctly (the private key is not leaked and the domain is
127 not hijacked), and the trust anchors have been set up correctly using
128 <xref linkend="make-ca"/> on the BLFS system, we can reasonably trust
129 download URLs to the upstream official website
130 <emphasis role="bold">with https protocol</emphasis>. Note that
131 BLFS book itself is published on a website with https, so you should
132 already have some confidence in https protocol or you wouldn't trust the
133 book content.</para>
134
135 <para>If the package is downloaded from an unofficial location (for
136 example a local mirror), checksums generated by cryptographically secure
137 digest algorithms (for example SHA256) can be used to verify the
138 genuity of the package. Download the checksum file from the upstream
139 <emphasis role="bold">official</emphasis> website (or somewhere
140 <emphasis role="bold">you can trust</emphasis>) and compare the
141 checksum of the package from unofficial location with it. For example,
142 SHA256 checksum can be checked with the command:</para>
143
144 <note>
145 <para>If the checksum and the package are downloaded from the same
146 untrusted location, you won't gain security enhancement by verifying
147 the package with the checksum. The attacker can fake the checksum as
148 well as compromising the package itself.</para>
149 </note>
150
151<screen><userinput>sha256sum -c <replaceable>file</replaceable>.sha256sum</userinput></screen>
152
153 <para>If <xref linkend="gnupg2"/> is installed, you can also verify the
154 genuity of the package with a GPG signature. Import the upstream GPG
155 public key with:</para>
156
157<screen><userinput>gpg --recv-key <replaceable>keyID</replaceable></userinput></screen>
158
159 <para><replaceable>keyID</replaceable> should be replaced with the key ID
160 from somewhere <emphasis role="bold">you can trust</emphasis> (for
161 example, copy it from the upstream official website using https). Now
162 you can verify the signature with:</para>
163
164<screen><userinput>gpg --recv-key <replaceable>file</replaceable>.sig <replaceable>file</replaceable></userinput></screen>
165
166 <para>The advantage of <application>GnuPG</application> signature is,
167 once you imported a public key which can be trusted, you can download
168 both the package and its signature from the same unofficial location and
169 verify them with the public key. So you won't need to connect to the
170 official upstream website to retrieve a checksum for each new release.
171 You only need to update the public key if it's expired or revoked.
172 </para>
173
174 </sect2>
175
176 <sect2>
177 <title>Creating Log Files During Installation</title>
178
179 <para>For larger packages, it is convenient to create log files instead of
180 staring at the screen hoping to catch a particular error or warning. Log
181 files are also useful for debugging and keeping records. The following
182 command allows you to create an installation log. Replace
183 <replaceable>&lt;command&gt;</replaceable> with the command you intend to execute.</para>
184
185<screen><userinput>( <replaceable>&lt;command&gt;</replaceable> 2&gt;&amp;1 | tee compile.log &amp;&amp; exit $PIPESTATUS )</userinput></screen>
186
187 <para><option>2&gt;&amp;1</option> redirects error messages to the same
188 location as standard output. The <command>tee</command> command allows
189 viewing of the output while logging the results to a file. The parentheses
190 around the command run the entire command in a subshell and finally the
191 <command>exit $PIPESTATUS</command> command ensures the result of the
192 <replaceable>&lt;command&gt;</replaceable> is returned as the result and not the
193 result of the <command>tee</command> command.</para>
194
195 </sect2>
196
197 <sect2 id="parallel-builds" xreflabel="Using Multiple Processors">
198 <title>Using Multiple Processors</title>
199
200 <para>For many modern systems with multiple processors (or cores) the
201 compilation time for a package can be reduced by performing a "parallel
202 make" by either setting an environment variable or telling the make program
203 to simultaneously execute multiple jobs.</para>
204
205 <para>For instance, an Intel Core i9-13900K CPU contains 8 performance
206 (P) cores and 16 efficiency (E) cores, and the P cores support SMT
207 (Simultaneous MultiThreading, also known as
208 <quote>Hyper-Threading</quote>) so each P core can run two threads
209 simultaneously and the Linux kernel will treat each P core as two
210 logical cores. As the result, there are 32 logical cores in total.
211 To utilize all these logical cores running <command>make</command>, we
212 can set an environment variable to tell <command>make</command> to
213 run 32 jobs simultaneously:</para>
214
215 <screen><userinput>export MAKEFLAGS='-j32'</userinput></screen>
216
217 <para>or just building with:</para>
218
219 <screen><userinput>make -j32</userinput></screen>
220
221 <para>
222 If you have applied the optional <command>sed</command> when building
223 <application>ninja</application> in LFS, you can use:
224 </para>
225
226 <screen><userinput>export NINJAJOBS=32</userinput></screen>
227
228 <para>
229 when a package uses <command>ninja</command>, or just:
230 </para>
231
232 <screen><userinput>ninja -j32</userinput></screen>
233
234 <para>
235 If you are not sure about the number of logical cores, run the
236 <command>nproc</command> command.
237 </para>
238
239 <para>
240 For <command>make</command>, the default number of jobs is 1. But
241 for <command>ninja</command>, the default number of jobs is N + 2 if
242 the number of logical cores N is greater than 2; or N + 1 if
243 N is 1 or 2. The reason to use a number of jobs slightly greater
244 than the number of logical cores is keeping all logical
245 processors busy even if some jobs are performing I/O operations.
246 </para>
247
248 <para>
249 Note that the <option>-j</option> switches only limits the parallel
250 jobs started by <command>make</command> or <command>ninja</command>,
251 but each job may still spawn its own processes or threads. For
252 example, <command>ld.gold</command> will use multiple threads for
253 linking, and some tests of packages can spawn multiple threads for
254 testing thread safety properties. There is no generic way for the
255 building system to know the number of processes or threads spawned by
256 a job. So generally we should not consider the value passed with
257 <option>-j</option> a hard limit of the number of logical cores to
258 use. Read <xref linkend='build-in-cgroup'/> if you want to set such
259 a hard limit.
260 </para>
261
262 <para>Generally the number of processes should not exceed the number of
263 cores supported by the CPU too much. To list the processors on your
264 system, issue: <userinput>grep processor /proc/cpuinfo</userinput>.
265 </para>
266
267 <para>In some cases, using multiple processes may result in a race
268 condition where the success of the build depends on the order of the
269 commands run by the <command>make</command> program. For instance, if an
270 executable needs File A and File B, attempting to link the program before
271 one of the dependent components is available will result in a failure.
272 This condition usually arises because the upstream developer has not
273 properly designated all the prerequisites needed to accomplish a step in the
274 Makefile.</para>
275
276 <para>If this occurs, the best way to proceed is to drop back to a
277 single processor build. Adding <option>-j1</option> to a make command
278 will override the similar setting in the <envar>MAKEFLAGS</envar>
279 environment variable.</para>
280
281 <important>
282 <para>
283 Another problem may occur with modern CPU's, which have a lot of cores.
284 Each job started consumes memory, and if the sum of the needed
285 memory for each job exceeds the available memory, you may encounter
286 either an OOM (Out of Memory) kernel interrupt or intense swapping
287 that will slow the build beyond reasonable limits.
288 </para>
289
290 <para>
291 Some compilations with <command>g++</command> may consume up to 2.5 GB
292 of memory, so to be safe, you should restrict the number of jobs
293 to (Total Memory in GB)/2.5, at least for big packages such as LLVM,
294 WebKitGtk, QtWebEngine, or libreoffice.
295 </para>
296 </important>
297 </sect2>
298
299 <sect2 id="build-in-cgroup">
300 <title>Use Linux Control Group to Limit the Resource Usage</title>
301
302 <para>
303 Sometimes we want to limit the resource usage when we build a
304 package. For example, when we have 8 logical cores, we may want
305 to use only 6 cores for building the package and reserve another
306 2 cores for playing a movie. The Linux kernel provides a feature
307 called control groups (cgroup) for such a need.
308 </para>
309
310 <para>
311 Enable control group in the kernel configuration, then rebuild the
312 kernel and reboot if necessary:
313 </para>
314
315 <xi:include xmlns:xi="http://www.w3.org/2001/XInclude"
316 href="cgroup-kernel.xml"/>
317
318 <!-- We need cgroup2 mounted at /sys/fs/cgroup. It's done by
319 systemd itself in LFS systemd, mountvirtfs script in LFS sysv. -->
320
321 <para revision='systemd'>
322 Ensure <xref linkend='systemd'/> and <xref linkend='shadow'/> have
323 been rebuilt with <xref linkend='linux-pam'/> support (if you are
324 interacting via a SSH or graphical session, also ensure the
325 <xref linkend='openssh'/> server or the desktop manager has been
326 built with <xref linkend='linux-pam'/>). As the &root; user, create
327 a configuration file to allow resource control without &root;
328 privilege, and instruct <command>systemd</command> to reload the
329 configuration:
330 </para>
331
332 <screen revision="systemd" role="nodump"><userinput>mkdir -pv /etc/systemd/system/user@.service.d &amp;&amp;
333cat &gt; /etc/systemd/system/user@.service.d/delegate.conf &lt;&lt; EOF &amp;&amp;
334<literal>[Service]
335Delegate=memory cpuset</literal>
336systemctl daemon-reload</userinput></screen>
337
338 <para revision='systemd'>
339 Then logout and login again. Now to run <command>make -j5</command>
340 with the first 4 logical cores and 8 GB of system memory, issue:
341 </para>
342
343 <screen revision="systemd" role="nodump"><userinput>systemctl --user start dbus &amp;&amp;
344systemd-run --user --pty --pipe --wait -G -d \
345 -p MemoryHigh=8G \
346 -p AllowedCPUs=0-3 \
347 make -j5</userinput></screen>
348
349 <para revision='sysv'>
350 Ensure <xref linkend='sudo'/> is installed. To run
351 <command>make -j5</command> with the first 4 logical cores and 8 GB
352 of system memory, issue:
353 </para>
354
355 <!-- "\EOF" because we expect $$ to be expanded by the "bash -e"
356 shell, not the current shell.
357
358 TODO: can we use elogind to delegate the controllers (like
359 systemd) to avoid relying on sudo? -->
360 <screen revision="sysv" role="nodump"><userinput>bash -e &lt;&lt; \EOF
361 sudo mkdir /sys/fs/cgroup/$$
362 sudo sh -c \
363 "echo +memory +cpuset > /sys/fs/cgroup/cgroup.subtree_control"
364 sudo sh -c \
365 "echo 0-3 > /sys/fs/cgroup/$$/cpuset.cpus"
366 sudo sh -c \
367 "echo $((8 &lt;&lt; 30)) > /sys/fs/cgroup/$$/memory.high"
368 (
369 sudo sh -c "echo $BASHPID > /sys/fs/cgroup/$$/cgroup.procs"
370 exec make -j5
371 )
372 sudo rmdir /sys/fs/cgroup/$$
373EOF</userinput></screen>
374
375 <para>
376 With
377 <phrase revision='systemd'>
378 <parameter>MemoryHigh=8G</parameter>
379 </phrase>
380 <phrase revision='sysv'>
381 <literal>8589934592</literal> (expanded from
382 <userinput>$((8 &lt;&lt; 30))</userinput>) in the
383 <filename>memory.high</filename> entry
384 </phrase>, a soft limit of memory usage is set.
385 If the processes in the cgroup (<command>make</command> and all the
386 descendants of it) uses more than 8 GB of system memory in total,
387 the kernel will throttle down the processes and try to reclaim the
388 system memory from them. But they can still use more than 8 GB of
389 system memory. If you want to make a hard limit instead, replace
390 <phrase revision='systemd'>
391 <parameter>MemoryHigh</parameter> with
392 <parameter>MemoryMax</parameter>.
393 </phrase>
394 <phrase revision='sysv'>
395 <filename>memory.high</filename> with
396 <filename>memory.max</filename>.
397 </phrase>
398 But doing so will cause the processes killed if 8 GB is not enough
399 for them.
400 </para>
401
402 <para>
403 <phrase revision='systemd'>
404 <parameter>AllowedCPUs=0-3</parameter>
405 </phrase>
406 <phrase revision='sysv'>
407 <literal>0-3</literal> in the <filename>cpuset.cpus</filename>
408 entry
409 </phrase> makes the kernel only run the processes in the cgroup on
410 the logical cores with numbers 0, 1, 2, or 3. You may need to
411 adjust this setting based the mapping between the logical cores and the
412 physical cores. For example, with an Intel Core i9-13900K CPU,
413 the logical cores 0, 2, 4, ..., 14 are mapped to the first threads of
414 the eight physical P cores, the logical cores 1, 3, 5, ..., 15 are
415 mapped to the second threads of the physical P cores, and the logical
416 cores 16, 17, ..., 31 are mapped to the 16 physical E cores. So if
417 we want to use four threads from four different P cores, we need to
418 specify <literal>0,2,4,6</literal> instead of <literal>0-3</literal>.
419 Note that the other CPU models may use a different mapping scheme.
420 If you are not sure about the mapping between the logical cores
421 and the physical cores, run <command>grep -E '^processor|^core'
422 /proc/cpuinfo</command> which will output logical core IDs in the
423 <computeroutput>processor</computeroutput> lines, and physical core
424 IDs in the <computeroutput>core id</computeroutput> lines.
425 </para>
426
427 <para>
428 When the <command>nproc</command> or <command>ninja</command> command
429 runs in a cgroup, it will use the number of logical cores assigned to
430 the cgroup as the <quote>system logical core count</quote>. For
431 example, in a cgroup with logical cores 0-3 assigned,
432 <command>nproc</command> will print
433 <computeroutput>4</computeroutput>, and <command>ninja</command>
434 will run 6 (4 + 2) jobs simultaneously if no <option>-j</option>
435 setting is explicitly given.
436 </para>
437
438 <para revision="systemd">
439 Read the man pages <filename>systemd-run(1)</filename> and
440 <filename>systemd.resource-control(5)</filename> for the detailed
441 explanation of parameters in the command.
442 </para>
443
444 <para revision="sysv">
445 Read the <filename>Documentation/admin-guide/cgroup-v2.rst</filename>
446 file in the Linux kernel source tree for the detailed explanation of
447 <systemitem class="filesystem">cgroup2</systemitem> pseudo file
448 system entries referred in the command.
449 </para>
450
451 </sect2>
452
453 <sect2 id="automating-builds" xreflabel="Automated Building Procedures">
454 <title>Automated Building Procedures</title>
455
456 <para>There are times when automating the building of a package can come in
457 handy. Everyone has their own reasons for wanting to automate building,
458 and everyone goes about it in their own way. Creating
459 <filename>Makefile</filename>s, <application>Bash</application> scripts,
460 <application>Perl</application> scripts or simply a list of commands used
461 to cut and paste are just some of the methods you can use to automate
462 building BLFS packages. Detailing how and providing examples of the many
463 ways you can automate the building of packages is beyond the scope of this
464 section. This section will expose you to using file redirection and the
465 <command>yes</command> command to help provide ideas on how to automate
466 your builds.</para>
467
468 <bridgehead renderas="sect3">File Redirection to Automate Input</bridgehead>
469
470 <para>You will find times throughout your BLFS journey when you will come
471 across a package that has a command prompting you for information. This
472 information might be configuration details, a directory path, or a response
473 to a license agreement. This can present a challenge to automate the
474 building of that package. Occasionally, you will be prompted for different
475 information in a series of questions. One method to automate this type of
476 scenario requires putting the desired responses in a file and using
477 redirection so that the program uses the data in the file as the answers to
478 the questions.</para>
479<!-- outdated
480 <para>Building the <application>CUPS</application> package is a good
481 example of how redirecting a file as input to prompts can help you automate
482 the build. If you run the test suite, you are asked to respond to a series
483 of questions regarding the type of test to run and if you have any
484 auxiliary programs the test can use. You can create a file with your
485 responses, one response per line, and use a command similar to the
486 one shown below to automate running the test suite:</para>
487
488<screen><userinput>make check &lt; ../cups-1.1.23-testsuite_parms</userinput></screen>
489-->
490 <para>This effectively makes the test suite use the responses in the file
491 as the input to the questions. Occasionally you may end up doing a bit of
492 trial and error determining the exact format of your input file for some
493 things, but once figured out and documented you can use this to automate
494 building the package.</para>
495
496 <bridgehead renderas="sect3">Using <command>yes</command> to Automate
497 Input</bridgehead>
498
499 <para>Sometimes you will only need to provide one response, or provide the
500 same response to many prompts. For these instances, the
501 <command>yes</command> command works really well. The
502 <command>yes</command> command can be used to provide a response (the same
503 one) to one or more instances of questions. It can be used to simulate
504 pressing just the <keycap>Enter</keycap> key, entering the
505 <keycap>Y</keycap> key or entering a string of text. Perhaps the easiest
506 way to show its use is in an example.</para>
507
508 <para>First, create a short <application>Bash</application> script by
509 entering the following commands:</para>
510
511<screen><userinput>cat &gt; blfs-yes-test1 &lt;&lt; "EOF"
512<literal>#!/bin/bash
513
514echo -n -e "\n\nPlease type something (or nothing) and press Enter ---> "
515
516read A_STRING
517
518if test "$A_STRING" = ""; then A_STRING="Just the Enter key was pressed"
519else A_STRING="You entered '$A_STRING'"
520fi
521
522echo -e "\n\n$A_STRING\n\n"</literal>
523EOF
524chmod 755 blfs-yes-test1</userinput></screen>
525
526 <para>Now run the script by issuing <command>./blfs-yes-test1</command> from
527 the command line. It will wait for a response, which can be anything (or
528 nothing) followed by the <keycap>Enter</keycap> key. After entering
529 something, the result will be echoed to the screen. Now use the
530 <command>yes</command> command to automate the entering of a
531 response:</para>
532
533<screen><userinput>yes | ./blfs-yes-test1</userinput></screen>
534
535 <para>Notice that piping <command>yes</command> by itself to the script
536 results in <keycap>y</keycap> being passed to the script. Now try it with a
537 string of text:</para>
538
539<screen><userinput>yes 'This is some text' | ./blfs-yes-test1</userinput></screen>
540
541 <para>The exact string was used as the response to the script. Finally,
542 try it using an empty (null) string:</para>
543
544<screen><userinput>yes '' | ./blfs-yes-test1</userinput></screen>
545
546 <para>Notice this results in passing just the press of the
547 <keycap>Enter</keycap> key to the script. This is useful for times when the
548 default answer to the prompt is sufficient. This syntax is used in the
549 <xref linkend="net-tools-automate-example"/> instructions to accept all the
550 defaults to the many prompts during the configuration step. You may now
551 remove the test script, if desired.</para>
552
553 <bridgehead renderas="sect3">File Redirection to Automate Output</bridgehead>
554
555 <para>In order to automate the building of some packages, especially those
556 that require you to read a license agreement one page at a time, requires
557 using a method that avoids having to press a key to display each page.
558 Redirecting the output to a file can be used in these instances to assist
559 with the automation. The previous section on this page touched on creating
560 log files of the build output. The redirection method shown there used the
561 <command>tee</command> command to redirect output to a file while also
562 displaying the output to the screen. Here, the output will only be sent to
563 a file.</para>
564
565 <para>Again, the easiest way to demonstrate the technique is to show an
566 example. First, issue the command:</para>
567
568<screen><userinput>ls -l /usr/bin | less</userinput></screen>
569
570 <para>Of course, you'll be required to view the output one page at a time
571 because the <command>less</command> filter was used. Now try the same
572 command, but this time redirect the output to a file. The special file
573 <filename>/dev/null</filename> can be used instead of the filename shown,
574 but you will have no log file to examine:</para>
575
576<screen><userinput>ls -l /usr/bin | less &gt; redirect_test.log 2&gt;&amp;1</userinput></screen>
577
578 <para>Notice that this time the command immediately returned to the shell
579 prompt without having to page through the output. You may now remove the
580 log file.</para>
581
582 <para>The last example will use the <command>yes</command> command in
583 combination with output redirection to bypass having to page through the
584 output and then provide a <keycap>y</keycap> to a prompt. This technique
585 could be used in instances when otherwise you would have to page through
586 the output of a file (such as a license agreement) and then answer the
587 question of <quote>do you accept the above?</quote>. For this example,
588 another short <application>Bash</application> script is required:</para>
589
590<screen><userinput>cat &gt; blfs-yes-test2 &lt;&lt; "EOF"
591<literal>#!/bin/bash
592
593ls -l /usr/bin | less
594
595echo -n -e "\n\nDid you enjoy reading this? (y,n) "
596
597read A_STRING
598
599if test "$A_STRING" = "y"; then A_STRING="You entered the 'y' key"
600else A_STRING="You did NOT enter the 'y' key"
601fi
602
603echo -e "\n\n$A_STRING\n\n"</literal>
604EOF
605chmod 755 blfs-yes-test2</userinput></screen>
606
607 <para>This script can be used to simulate a program that requires you to
608 read a license agreement, then respond appropriately to accept the
609 agreement before the program will install anything. First, run the script
610 without any automation techniques by issuing
611 <command>./blfs-yes-test2</command>.</para>
612
613 <para>Now issue the following command which uses two automation techniques,
614 making it suitable for use in an automated build script:</para>
615
616<screen><userinput>yes | ./blfs-yes-test2 &gt; blfs-yes-test2.log 2&gt;&amp;1</userinput></screen>
617
618 <para>If desired, issue <command>tail blfs-yes-test2.log</command> to see
619 the end of the paged output, and confirmation that <keycap>y</keycap> was
620 passed through to the script. Once satisfied that it works as it should,
621 you may remove the script and log file.</para>
622
623 <para>Finally, keep in mind that there are many ways to automate and/or
624 script the build commands. There is not a single <quote>correct</quote> way
625 to do it. Your imagination is the only limit.</para>
626
627 </sect2>
628
629 <sect2>
630 <title>Dependencies</title>
631
632 <para>For each package described, BLFS lists the known dependencies.
633 These are listed under several headings, whose meaning is as follows:</para>
634
635 <itemizedlist>
636 <listitem>
637 <para><emphasis>Required</emphasis> means that the target package
638 cannot be correctly built without the dependency having first been
639 installed, except if the dependency is said to be
640 <quote>runtime</quote>, which means the target package can be built but
641 cannot function without it.</para>
642 <para>
643 Note that a target package can start to <quote>function</quote>
644 in many subtle ways: an installed configuration file can make the
645 init system, cron daemon, or bus daemon to run a program
646 automatically; another package using the target package as an
647 dependency can run a program from the target package in the
648 building system; and the configuration sections in the BLFS book
649 may also run a program from a just installed package. So if
650 you are installing the target package without a
651 <emphasis>Required (runtime)</emphasis> dependency installed,
652 You should install the dependency as soon as possible after the
653 installation of the target package.
654 </para>
655 </listitem>
656 <listitem>
657 <para><emphasis>Recommended</emphasis> means that BLFS strongly
658 suggests this package is installed first (except if said to be
659 <quote>runtime</quote>, see below) for a clean and trouble-free
660 build, that won't have issues either during the build process, or at
661 run-time. The instructions in the book assume these packages are
662 installed. Some changes or workarounds may be required if these
663 packages are not installed. If a recommended dependency is said
664 to be <quote>runtime</quote>, it means that BLFS strongly suggests
665 that this dependency is installed before using the package, for
666 getting full functionality.</para>
667 </listitem>
668 <listitem>
669 <para><emphasis>Optional</emphasis> means that this package might be
670 installed for added functionality. Often BLFS will describe the
671 dependency to explain the added functionality that will result.
672 An optional dependency may be automatically pick up by the target
673 package if the dependency is installed, but another some optional
674 dependency may also need additional configuration options to enable
675 them when the target package is built. Such additional options are
676 often documented in the BLFS book. If an optional dependency is
677 said to be <quote>runtime</quote>, it means you may install
678 the dependency after installing the target package to support some
679 optional features of the target package if you need these
680 features.</para>
681 <para>An optional dependency may be out of BLFS. If you need such
682 an <emphasis>external</emphasis> optional dependency for some
683 features you need, read <xref linkend='beyond'/> for the general
684 hint about installing an out-of-BLFS package.</para>
685 </listitem>
686 </itemizedlist>
687
688 </sect2>
689
690 <sect2 id="package_updates">
691 <title>Using the Most Current Package Sources</title>
692
693 <para>On occasion you may run into a situation in the book when a package
694 will not build or work properly. Though the Editors attempt to ensure
695 that every package in the book builds and works properly, sometimes a
696 package has been overlooked or was not tested with this particular version
697 of BLFS.</para>
698
699 <para>If you discover that a package will not build or work properly, you
700 should see if there is a more current version of the package. Typically
701 this means you go to the maintainer's web site and download the most current
702 tarball and attempt to build the package. If you cannot determine the
703 maintainer's web site by looking at the download URLs, use Google and query
704 the package's name. For example, in the Google search bar type:
705 'package_name download' (omit the quotes) or something similar. Sometimes
706 typing: 'package_name home page' will result in you finding the
707 maintainer's web site.</para>
708
709 </sect2>
710
711 <sect2 id="stripping">
712 <title>Stripping One More Time</title>
713
714 <para>
715 In LFS, stripping of debugging symbols and unneeded symbol table
716 entries was discussed a couple of times. When building BLFS packages,
717 there are generally no special instructions that discuss stripping
718 again. Stripping can be done while installing a package, or
719 afterwards.
720 </para>
721
722 <bridgehead renderas="sect3" id="stripping-install">Stripping while Installing a Package</bridgehead>
723
724 <para>
725 There are several ways to strip executables installed by a
726 package. They depend on the build system used (see below <link
727 linkend="buildsystems">the section about build systems</link>),
728 so only some
729 generalities can be listed here:
730 </para>
731
732 <note>
733 <para>
734 The following methods using the feature of a building system
735 (autotools, meson, or cmake) will not strip static libraries if any
736 is installed. Fortunately there are not too many static libraries
737 in BLFS, and a static library can always be stripped safely by
738 running <command>strip --strip-unneeded</command> on it manually.
739 </para>
740 </note>
741
742 <itemizedlist>
743 <listitem>
744 <para>
745 The packages using autotools usually have an
746 <parameter>install-strip</parameter> target in their generated
747 <filename>Makefile</filename> files. So installing stripped
748 executables is just a matter of using
749 <command>make install-strip</command> instead of
750 <command>make install</command>.
751 </para>
752 </listitem>
753 <listitem>
754 <para>
755 The packages using the meson build system can accept
756 <parameter>-Dstrip=true</parameter> when running
757 <command>meson</command>. If you've forgot to add this option
758 running the <command>meson</command>, you can also run
759 <command>meson install --strip</command> instead of
760 <command>ninja install</command>.
761 </para>
762 </listitem>
763 <listitem>
764 <para>
765 <command>cmake</command> generates
766 <parameter>install/strip</parameter> targets for both the
767 <parameter>Unix Makefiles</parameter> and
768 <parameter>Ninja</parameter> generators (the default is
769 <parameter>Unix Makefiles</parameter> on linux). So just run
770 <command>make install/strip</command> or
771 <command>ninja install/strip</command> instead of the
772 <command>install</command> counterparts.
773 </para>
774 </listitem>
775 <listitem>
776 <para>
777 Removing (or not generating) debug symbols can also be
778 achieved by removing the
779 <parameter>-g&lt;something&gt;</parameter> options
780 in C/C++ calls. How to do that is very specific for each
781 package. And, it does not remove unneeded symbol table entries.
782 So it will not be explained in detail here. See also below
783 the paragraphs about optimization.
784 </para>
785 </listitem>
786 </itemizedlist>
787
788 <bridgehead renderas="sect3" id="stripping-installed">Stripping Installed Executables</bridgehead>
789
790 <para>
791 The <command>strip</command> utility changes files in place, which may
792 break anything using it if it is loaded in memory. Note that if a file is
793 in use but just removed from the disk (i.e. not overwritten nor
794 modified), this is not a problem since the kernel can use
795 <quote>deleted</quote> files. Look at <filename>/proc/*/maps</filename>
796 and it is likely that you'll see some <emphasis>(deleted)</emphasis>
797 entries. The <command>mv</command> just removes the destination file from
798 the directory but does not touch its content, so that it satisfies the
799 condition for the kernel to use the old (deleted) file.
800 But this approach can detach hard links into duplicated copies,
801 causing a bloat which is obviously unwanted as we are stripping to
802 reduce system size. If two files in a same file system share the
803 same inode number, they are hard links to each other and we should
804 reconstruct the link. The script below is just an example.
805 It should be run as the &root; user:
806 </para>
807
808<screen><userinput>cat &gt; /usr/sbin/strip-all.sh &lt;&lt; "EOF"
809<literal>#!/usr/bin/bash
810
811if [ $EUID -ne 0 ]; then
812 echo "Need to be root"
813 exit 1
814fi
815
816last_fs_inode=
817last_file=
818
819{ find /usr/lib -type f -name '*.so*' ! -name '*dbg'
820 find /usr/lib -type f -name '*.a'
821 find /usr/{bin,sbin,libexec} -type f
822} | xargs stat -c '%m %i %n' | sort | while read fs inode file; do
823 if ! readelf -h $file >/dev/null 2>&amp;1; then continue; fi
824 if file $file | grep --quiet --invert-match 'not stripped'; then continue; fi
825
826 if [ "$fs $inode" = "$last_fs_inode" ]; then
827 ln -f $last_file $file;
828 continue;
829 fi
830
831 cp --preserve $file ${file}.tmp
832 strip --strip-unneeded ${file}.tmp
833 mv ${file}.tmp $file
834
835 last_fs_inode="$fs $inode"
836 last_file=$file
837done</literal>
838EOF
839chmod 744 /usr/sbin/strip-all.sh</userinput></screen>
840
841 <para>
842 If you install programs in other directories such as <filename
843 class="directory">/opt</filename> or <filename
844 class="directory">/usr/local</filename>, you may want to strip the files
845 there too. Just add other directories to scan in the compound list of
846 <command>find</command> commands between the braces.
847 </para>
848
849 <para>
850 For more information on stripping, see <ulink
851 url="https://www.technovelty.org/linux/stripping-shared-libraries.html"/>.
852 </para>
853
854 </sect2>
855
856<!--
857 <sect2 id="libtool">
858 <title>Libtool files</title>
859
860 <para>
861 One of the side effects of packages that use Autotools, including
862 libtool, is that they create many files with an .la extension. These
863 files are not needed in an LFS environment. If there are conflicts with
864 pkgconfig entries, they can actually prevent successful builds. You
865 may want to consider removing these files periodically:
866 </para>
867
868<screen><userinput>find /lib /usr/lib -not -path "*Image*" -a -name \*.la -delete</userinput></screen>
869
870 <para>
871 The above command removes all .la files with the exception of those that
872 have <quote>Image</quote> or <quote>openldap</quote> as a part of the
873 path. These .la files are used by the ImageMagick and openldap programs,
874 respectively. There may be other exceptions by packages not in BLFS.
875 </para>
876
877 </sect2>
878-->
879 <sect2 id="buildsystems">
880 <title>Working with different build systems</title>
881
882 <para>
883 There are now three different build systems in common use for
884 converting C or C++ source code into compiled programs or
885 libraries and their details (particularly, finding out about available
886 options and their default values) differ. It may be easiest to understand
887 the issues caused by some choices (typically slow execution or
888 unexpected use of, or omission of, optimizations) by starting with
889 the <envar>CFLAGS</envar>, <envar>CXXFLAGS</envar>, and
890 <envar>LDFLAGS</envar> environment variables. There are also some
891 programs which use Rust.
892 </para>
893
894 <para>
895 Most LFS and BLFS builders are probably aware of the basics of
896 <envar>CFLAGS</envar> and <envar>CXXFLAGS</envar> for altering how a
897 program is compiled. Typically, some form of optimization is used by
898 upstream developers (<option>-O2</option> or <option>-O3</option>),
899 sometimes with the creation of debug symbols (<option>-g</option>),
900 as defaults.
901 </para>
902
903 <para>
904 If there are contradictory flags (e.g. multiple different
905 <option>-O</option> values),
906 the <emphasis>last</emphasis> value will be used. Sometimes this means
907 that flags specified in environment variables will be picked up before
908 values hardcoded in the Makefile, and therefore ignored. For example,
909 where a user specifies <option>-O2</option> and that is followed by
910 <option>-O3</option> the build will use <option>-O3</option>.
911 </para>
912
913 <para>
914 There are various other things which can be passed in CFLAGS or
915 CXXFLAGS, such as allowing using the instruction set extensions
916 available with a specific microarchitecture (e.g.
917 <option>-march=amdfam10</option> or <option>-march=native</option>),
918 tune the generated code for a specific microarchitecture (e. g.
919 <option>-mtune=tigerlake</option> or <option>-mtune=native</option>,
920 if <option>-mtune=</option> is not used, the microarchitecture from
921 <option>-march=</option> setting will be used), or specifying a
922 specific standard for C or C++ (<option>-std=c++17</option> for
923 example). But one thing which has now come to light is that
924 programmers might include debug assertions in their code, expecting
925 them to be disabled in releases by using <option>-DNDEBUG</option>.
926 Specifically, if <xref linkend="mesa"/> is built with these
927 assertions enabled, some activities such as loading levels of games
928 can take extremely long times, even on high-class video cards.
929 </para>
930
931 <bridgehead renderas="sect3" id="autotools-info">Autotools with Make</bridgehead>
932
933 <para>
934 This combination is often described as <quote>CMMI</quote>
935 (configure, make, make install) and is used here to also cover
936 the few packages which have a configure script that is not
937 generated by autotools.
938 </para>
939
940 <para>
941 Sometimes running <command>./configure --help</command> will produce
942 useful options about switches which might be used. At other times,
943 after looking at the output from configure you may need to look
944 at the details of the script to find out what it was actually searching
945 for.
946 </para>
947
948 <para>
949 Many configure scripts will pick up any CFLAGS or CXXFLAGS from the
950 environment, but CMMI packages vary about how these will be mixed with
951 any flags which would otherwise be used (<emphasis>variously</emphasis>:
952 ignored, used to replace the programmer's suggestion, used before the
953 programmer's suggestion, or used after the programmer's suggestion).
954 </para>
955
956 <para>
957 In most CMMI packages, running <command>make</command> will list
958 each command and run it, interspersed with any warnings. But some
959 packages try to be <quote>silent</quote> and only show which file
960 they are compiling or linking instead of showing the command line.
961 If you need to inspect the command, either because of an error, or
962 just to see what options and flags are being used, adding
963 <option>V=1</option> to the make invocation may help.
964 </para>
965
966 <bridgehead renderas="sect3" id="cmake-info">CMake</bridgehead>
967
968 <para>
969 CMake works in a very different way, and it has two backends which
970 can be used on BLFS: <command>make</command> and
971 <command>ninja</command>. The default backend is make, but
972 ninja can be faster on large packages with multiple processors. To
973 use ninja, specify <option>-G Ninja</option> in the cmake command.
974 However, there are some packages which create fatal errors in their
975 ninja files but build successfully using the default of Unix
976 Makefiles.
977 </para>
978
979 <para>
980 The hardest part of using CMake is knowing what options you might wish
981 to specify. The only way to get a list of what the package knows about
982 is to run <command>cmake -LAH</command> and look at the output for that
983 default configuration.
984 </para>
985
986 <para>
987 Perhaps the most-important thing about CMake is that it has a variety
988 of CMAKE_BUILD_TYPE values, and these affect the flags. The default
989 is that this is not set and no flags are generated. Any
990 <envar>CFLAGS</envar> or <envar>CXXFLAGS</envar> in the environment
991 will be used. If the programmer has coded any debug assertions,
992 those will be enabled unless -DNDEBUG is used. The following
993 CMAKE_BUILD_TYPE values will generate the flags shown, and these
994 will come <emphasis>after</emphasis> any flags in the environment
995 and therefore take precedence.
996 </para>
997
998 <informaltable align="center">
999 <tgroup cols="2">
1000 <colspec colnum="1" align="center"/>
1001 <colspec colnum="2" align="center"/>
1002 <thead>
1003 <row><entry>Value</entry><entry>Flags</entry></row>
1004 </thead>
1005 <tbody>
1006 <row>
1007 <entry>Debug</entry><entry><option>-g</option></entry>
1008 </row>
1009 <row>
1010 <entry>Release</entry><entry><option>-O3 -DNDEBUG</option></entry>
1011 </row>
1012 <row>
1013 <entry>RelWithDebInfo</entry><entry><option>-O2 -g -DNDEBUG</option></entry>
1014 </row>
1015 <row>
1016 <entry>MinSizeRel</entry><entry><option>-Os -DNDEBUG</option></entry>
1017 </row>
1018 </tbody>
1019 </tgroup>
1020 </informaltable>
1021
1022 <para>
1023 CMake tries to produce quiet builds. To see the details of the commands
1024 which are being run, use <command>make VERBOSE=1</command> or
1025 <command>ninja -v</command>.
1026 </para>
1027
1028 <para>
1029 By default, CMake treats file installation differently from the other
1030 build systems: if a file already exists and is not newer than a file
1031 that would overwrite it, then the file is not installed. This may be
1032 a problem if a user wants to record which file belongs to a package,
1033 either using <envar>LD_PRELOAD</envar>, or by listing files newer
1034 than a timestamp. The default can be changed by setting the variable
1035 <envar>CMAKE_INSTALL_ALWAYS</envar> to 1 in the
1036 <emphasis>environment</emphasis>, for example by
1037 <command>export</command>'ing it.
1038 </para>
1039
1040 <bridgehead renderas="sect3" id="meson-info">Meson</bridgehead>
1041
1042 <para>
1043 Meson has some similarities to CMake, but many differences. To get
1044 details of the defines that you may wish to change you can look at
1045 <filename>meson_options.txt</filename> which is usually in the
1046 top-level directory.
1047 </para>
1048
1049 <para>
1050 If you have already configured the package by running
1051 <command>meson</command> and now wish to change one or more settings,
1052 you can either remove the build directory, recreate it, and use the
1053 altered options, or within the build directory run <command>meson
1054 configure</command>, e.g. to set an option:
1055 </para>
1056
1057<screen><userinput>meson configure -D&lt;some_option&gt;=true</userinput></screen>
1058
1059 <para>
1060 If you do that, the file <filename>meson-private/cmd_line.txt</filename>
1061 will show the <emphasis>last</emphasis> commands which were used.
1062 </para>
1063
1064 <para>
1065 Meson provides the following buildtype values, and the flags they enable
1066 come <emphasis>after</emphasis> any flags supplied in the environment and
1067 therefore take precedence.
1068 </para>
1069
1070 <itemizedlist>
1071 <listitem>
1072 <para>plain : no added flags. This is for distributors to supply their
1073 own <envar>CFLAGS</envar>, <envar>CXXFLAGS</envar> and
1074 <envar>LDFLAGS</envar>. There is no obvious reason to use
1075 this in BLFS.</para>
1076 </listitem>
1077 <listitem>
1078 <para>debug : <option>-g</option> - this is the default if
1079 nothing is specified in either <filename>meson.build</filename>
1080 or the command line. However it results large and slow binaries,
1081 so we should override it in BLFS.</para>
1082 </listitem>
1083 <listitem>
1084 <para>debugoptimized : <option>-O2 -g</option> - this is the
1085 default specified in <filename>meson.build</filename> of some
1086 packages.</para>
1087 </listitem>
1088 <listitem>
1089 <para>release : <option>-O3</option> (occasionally a package will
1090 force <option>-O2</option> here) - this is the buildtype we use
1091 for most packages with Meson build system in BLFS.</para>
1092 </listitem>
1093 </itemizedlist>
1094
1095 <!-- From https://mesonbuild.com/Builtin-options.html#core-options:
1096 b_ndebug: Default value = false, Possible values are
1097 true, false, if-release. Some packages sets it to if-release
1098 so we mistakenly believed if-release had been the default. -->
1099 <para>
1100 The <option>-DNDEBUG</option> flag is implied by the release
1101 buildtype for some packages (for example <xref linkend='mesa'/>).
1102 It can also be provided explicitly by passing
1103 <option>-Db_ndebug=true</option>.
1104 </para>
1105
1106 <para>
1107 To see the details of the commands which are being run in a package using
1108 meson, use <command>ninja -v</command>.
1109 </para>
1110
1111 <bridgehead renderas="sect3" id="rust-info">Rustc and Cargo</bridgehead>
1112
1113 <para>
1114 Most released rustc programs are provided as crates (source tarballs)
1115 which will query a server to check current versions of dependencies
1116 and then download them as necessary. These packages are built using
1117 <command>cargo --release</command>. In theory, you can manipulate the
1118 RUSTFLAGS to change the optimize-level (default for
1119 <option>--release</option> is 3, i. e.
1120 <option>-Copt-level=3</option>, like <option>-O3</option>) or to
1121 force it to build for the machine it is being compiled on, using
1122 <option>-Ctarget-cpu=native</option> but in practice this seems to
1123 make no significant difference.
1124 </para>
1125
1126 <para>
1127 If you are compiling a standalone Rust program (as an unpackaged
1128 <filename class='extension'>.rs</filename> file) by running
1129 <command>rustc</command> directly, you should specify
1130 <option>-O</option> (the abbreviation of
1131 <option>-Copt-level=2</option>) or <option>-Copt-level=3</option>
1132 otherwise it will do an unoptimized compile and run
1133 <emphasis>much</emphasis> slower. If are compiling the program
1134 for debugging it, replace the <option>-O</option> or
1135 <option>-Copt-level=</option> options with <option>-g</option> to
1136 produce an unoptimized program with debug info.
1137 </para>
1138
1139 <para>
1140 Like <command>ninja</command>, by default <command>cargo</command>
1141 uses all logical cores. This can often be worked around,
1142 either by exporting
1143 <envar>CARGO_BUILD_JOBS=<replaceable>&lt;N&gt;</replaceable></envar>
1144 or passing
1145 <option>--jobs <replaceable>&lt;N&gt;</replaceable></option> to
1146 <command>cargo</command>.
1147 For compiling rustc itself, specifying
1148 <option>--jobs <replaceable>&lt;N&gt;</replaceable></option> for
1149 invocations of <command>x.py</command>
1150 (together with the <envar>CARGO_BUILD_JOBS</envar> environment
1151 variable, which looks like a <quote>belt and braces</quote>
1152 approach but seems to be necessary) mostly works. The exception is
1153 running the tests when building rustc, some of them will
1154 nevertheless use all online CPUs, at least as of rustc-1.42.0.
1155 </para>
1156
1157 </sect2>
1158
1159 <sect2 id="optimizations">
1160 <title>Optimizing the build</title>
1161
1162 <para>
1163 Many people will prefer to optimize compiles as they see fit, by providing
1164 <envar>CFLAGS</envar> or <envar>CXXFLAGS</envar>. For an
1165 introduction to the options available with gcc and g++ see <ulink
1166 url="https://gcc.gnu.org/onlinedocs/gcc-&gcc-version;/gcc/Optimize-Options.html"/>.
1167 The same content can be also found in <command>info gcc</command>.
1168 </para>
1169
1170 <para>
1171 Some packages default to <option>-O2 -g</option>, others to
1172 <option>-O3 -g</option>, and if <envar>CFLAGS</envar> or
1173 <envar>CXXFLAGS</envar> are supplied they might be added to the
1174 package's defaults, replace the package's defaults, or even be
1175 ignored. There are details on some desktop packages which were
1176 mostly current in April 2019 at
1177 <ulink url="https://www.linuxfromscratch.org/~ken/tuning/"/> - in
1178 particular, <filename>README.txt</filename>,
1179 <filename>tuning-1-packages-and-notes.txt</filename>, and
1180 <filename>tuning-notes-2B.txt</filename>. The particular thing to
1181 remember is that if you want to try some of the more interesting
1182 flags you may need to force verbose builds to confirm what is being
1183 used.
1184 </para>
1185
1186 <para>
1187 Clearly, if you are optimizing your own program you can spend time to
1188 profile it and perhaps recode some of it if it is too slow. But for
1189 building a whole system that approach is impractical. In general,
1190 <option>-O3</option> usually produces faster programs than
1191 <option>-O2</option>. Specifying
1192 <option>-march=native</option> is also beneficial, but means that
1193 you cannot move the binaries to an incompatible machine - this can
1194 also apply to newer machines, not just to older machines. For
1195 example programs compiled for <literal>amdfam10</literal> run on
1196 old Phenoms, Kaveris, and Ryzens : but programs compiled for a
1197 Kaveri will not run on a Ryzen because certain op-codes are not
1198 present. Similarly, if you build for a Haswell not everything will
1199 run on a SandyBridge.
1200 </para>
1201
1202 <note>
1203 <para>
1204 Be careful that the name of a <option>-march</option> setting
1205 does not always match the baseline of the microarchitecture
1206 with the same name. For example, the Skylake-based Intel Celeron
1207 processors do not support AVX at all, but
1208 <option>-march=skylake</option> assumes AVX and even AVX2.
1209 </para>
1210 </note>
1211
1212 <para>
1213 When a shared library is built by GCC, a feature named
1214 <quote>semantic interposition</quote> is enabled by default. When
1215 the shared library refers to a symbol name with external linkage
1216 and default visibility, if the symbol exists in both the shared
1217 library and the main executable, semantic interposition guarantees
1218 the symbol in the main executable is always used. This feature
1219 was invented in an attempt to make the behavior of linking a shared
1220 library and linking a static library as similar as possible. Today
1221 only a small number of packages still depend on semantic
1222 interposition, but the feature is still on by the default of GCC,
1223 causing many optimizations disabled for shared libraries because
1224 they conflict with semantic interposition. The
1225 <option>-fno-semantic-interposition</option> option can be passed
1226 to <command>gcc</command> or <command>g++</command> to disable
1227 semantic interposition and enable more optimizations for shared
1228 libraries. This option is used as the default of some packages
1229 (for example <xref linkend='python3'/>), and it's also the default
1230 of Clang.
1231 </para>
1232
1233 <para>
1234 There are also various other options which some people claim are
1235 beneficial. At worst, you get to recompile and test, and then
1236 discover that in your usage the options do not provide a benefit.
1237 </para>
1238
1239 <para>
1240 If building Perl or Python modules,
1241 in general the <envar>CFLAGS</envar> and <envar>CXXFLAGS</envar>
1242 used are those which were used by those <quote>parent</quote>
1243 packages.
1244 </para>
1245
1246 <para>
1247 For <envar>LDFLAGS</envar>, there are three options can be used
1248 for optimization. They are quite safe to use and the building
1249 system of some packages use some of these options as the default.
1250 </para>
1251
1252 <para>
1253 With <option>-Wl,-O1</option>, the linker will
1254 optimize the hash table to speed up the dynamic linking.
1255 Note that <option>-Wl,-O1</option> is completely unrelated to the
1256 compiler optimization flag <option>-O1</option>.
1257 </para>
1258
1259 <para>
1260 With <option>-Wl,--as-needed</option>, the linker will disregard
1261 unnecessary <option>-l<replaceable>foo</replaceable></option> options
1262 from the command line, i. e. the shared library <systemitem
1263 class='library'>lib<replaceable>foo</replaceable></systemitem>
1264 will only be linked if a symbol in <systemitem
1265 class='library'>lib<replaceable>foo</replaceable></systemitem> is
1266 really referred from the executable or shared library being linked.
1267 This can sometimes mitigate the <quote>excessive dependencies to
1268 shared libraries</quote> issues caused by
1269 <application>libtool</application>.
1270 </para>
1271
1272 <para>
1273 With <option>-Wl,-z,pack-relative-relocs</option>, the linker
1274 generates a more compacted form of the relative relocation entries
1275 for PIEs and shared libraries. It reduces the size of the linked
1276 PIE or shared library, and speeds up the loading of the PIE or
1277 shared library.
1278 </para>
1279
1280 <para>
1281 The <option>-Wl,</option> prefix is necessary because despite the
1282 variable is named <envar>LDFLAGS</envar>, its content is actually
1283 passed to <command>gcc</command> (or <command>g++</command>,
1284 <command>clang</command>, etc.) during the link stage, not directly
1285 passed to <command>ld</command>.
1286 </para>
1287
1288 </sect2>
1289
1290 <sect2 id="hardening">
1291 <title>Options for hardening the build</title>
1292
1293 <para>
1294 Even on desktop systems, there are still a lot of exploitable
1295 vulnerabilities. For many of these, the attack comes via javascript
1296 in a browser. Often, a series of vulnerabilities are used to gain
1297 access to data (or sometimes to pwn, i.e. own, the machine and
1298 install rootkits). Most commercial distros will apply various
1299 hardening measures.
1300 </para>
1301
1302 <para>
1303 In the past, there was Hardened LFS where gcc (a much older version)
1304 was forced to use hardening (with options to turn some of it off on a
1305 per-package basis). The current LFS and BLFS books are carrying
1306 forward a part of its spirit by enabling PIE
1307 (<option>-fPIE -pie</option>) and SSP
1308 (<option>-fstack-protector-strong</option>) as the defaults
1309 for GCC and clang. What is being covered here is different - first
1310 you have to make sure that the package is indeed using your added
1311 flags and not over-riding them.
1312 </para>
1313
1314 <para>
1315 For hardening options which are reasonably cheap, there is some
1316 discussion in the 'tuning' link above (occasionally, one or more
1317 of these options might be inappropriate for a package). These
1318 options are <option>-D_FORTIFY_SOURCE=2</option> and
1319 (for C++) <option>-D_GLIBCXX_ASSERTIONS</option>. On modern
1320 machines these should only have a little impact on how fast things
1321 run, and often they will not be noticeable.
1322 </para>
1323
1324 <para>
1325 The main distros use much more, such as RELRO (Relocation Read Only)
1326 and perhaps <option>-fstack-clash-protection</option>. You may also
1327 encounter the so-called <quote>userspace retpoline</quote>
1328 (<option>-mindirect-branch=thunk</option> etc.) which
1329 is the equivalent of the spectre mitigations applied to the linux
1330 kernel in late 2018. The kernel mitigations caused a lot of complaints
1331 about lost performance, if you have a production server you might wish
1332 to consider testing that, along with the other available options, to
1333 see if performance is still sufficient.
1334 </para>
1335
1336 <para>
1337 Whilst gcc has many hardening options, clang/LLVM's strengths lie
1338 elsewhere. Some options which gcc provides are said to be less effective
1339 in clang/LLVM.
1340 </para>
1341
1342 </sect2>
1343
1344</sect1>
Note: See TracBrowser for help on using the repository browser.