blob: 59b76f622a5a633dab60359a3c36686a152665b3 [file] [log] [blame]
David Chen8fe82a32016-08-24 10:55:41 +00001---
2layout: documentation
3title: Test Encyclopedia
4---
5<h1>Writing tests</h1>
6
7<p class="lead">An Exhaustive Specification of the Test Execution Environment</p>
8
9<h2>Background</h2>
10
11<p>The Bazel BUILD language includes rules which can be used to define
12automated test programs in many languages.</p>
13
14<p>Tests are run using <code><a href="bazel-user-manual.html#test">bazel test</a></code>.
15
16Users may also execute test binaries directly. This is allowed but not endorsed, as such
17an invocation will not adhere to the mandates described below.</p>
18
19<p>Tests should be <i>hermetic</i>: that is, they ought to access only those
20resources on which they have a declared dependency. If tests are not properly
21hermetic then they do not give historically reproducible results. This could be a
22significant problem for culprit finding (determining which change broke a test),
23release engineering auditability, and resource isolation of tests (automated
24testing frameworks ought not DDOS a server because some tests happen to
25talk to it).<p>
26
27<h2>Objective</h2>
28
29<p>The goal of this document is to formally establish the runtime environment
30for and expected behavior of Bazel tests. It will also impose requirements on
31the test runner and the build system.
32
33Our intent is to help test authors avoid relying on unspecified
34behavior, and thus give the testing infrastructure more freedom to make
35implementation changes. We will also take the opportunity to tighten up some
36holes which currently allow many tests to pass despite not being
37properly hermetic, deterministic, and reentrant.</p>
38
39<p>This document is intended to be both normative and authoritative. If
40this specification and the implemented behavior of test runner disagree, the
41specification takes precedence.</p>
42
43
44<h3>Purpose of Tests</h3>
45
46<p>The purpose of Bazel tests is to confirm some property of the source files
47checked into the repository. (In this document, "source files" includes test data,
48golden outputs, and anything else kept under version control.) One
49user writes a test to assert an invariant which they expect to be maintained.
50Other users execute the test later to check whether the invariant has been
51broken. If the test depends on any variables other than source files
52(non-hermetic), its value is diminished, because the later users cannot be sure
53their changes are at fault when the test stops passing.</p>
54
55<p>Therefore the outcome of a test must depend only on:</p>
56<ul>
57 <li>source files on which the test has a declared dependency</li>
58 <li>products of the build system on which the test has a declared dependency</li>
59 <li>resources whose behavior is guaranteed by the test runner to remain constant</li>
60</ul>
61
62<p>Currently, such behavior is not enforced. However, test runners reserve the
63right to add such enforcement in the future.</p>
64
65<h3>Role of the Build System</h3>
66
67<p>Test rules are analogous to binary rules in that each must yield an
68executable program. For some languages, this is a stub program which combines
69a language-specific harness with the test code. Test rules must produce other
70outputs as well. In addition to the primary test executable, the test runner
71will need a manifest of <b>runfiles</b>, input files which should be made
72available to the test at runtime, and it may need information about the type,
73size, and tags of a test.</p>
74
75<p>The build system may use the runfiles to deliver code as well as data. (This
76might be used as an optimization to make each test binary smaller by sharing
77files across tests, e.g. through the use of dynamic linking.) The build system
78should ensure that the generated executable loads these files via the runfiles
79image provided by the test runner, rather than hardcoded references to absolute
80locations in the source or output tree.</p>
81
82<h3>Role of the Test Runner</h3>
83
84<p>From the point of view of the test runner, each test is a program which can
85be invoked with <code>execve()</code>. There may be other ways to execute
86tests; for example, an IDE might allow the execution of Java tests in-process.
87However, the result of running the test as a standalone process must be
88considered authoritative. If a test process runs to completion and terminates
89normally with an exit code of zero, the test has passed. Any other result is
90considered a test failure. In particular, writing any of the strings
91<code>PASS</code> or <code>FAIL</code> to stdout has no significance to the test
92runner.</p>
93
94<p>If a test takes too long to execute, exceeds some resource limit, or the test
95runner otherwise detects prohibited behavior, it may choose to kill the test
96and treat the run as a failure. The runner must not report the test as passing
97after sending a signal to the test process or any children thereof.</p>
98
99<p id="timeout">The whole test target (not individual methods or tests) is given a
100limited amount of time to run to completion. The time limit for a test is based
101on its timeout attribute according to the following table:</p>
102
103<table class="table table-bordered table-striped table-condensed">
104 <thead>
105 <tr><th>timeout</th><th>Time Limit (sec.)</th></tr>
106 </thead>
107 <tbody>
108 <tr><td><code>short</code></td><td>60</td></tr>
109 <tr><td><code>moderate</code></td><td>300</td></tr>
110 <tr><td><code>long</code></td><td>900</td></tr>
111 <tr><td><code>eternal</code></td><td>3600</td></tr>
112 </tbody>
113</table>
114
115<p id="size">Tests which do not explicitly specify a timeout have one implied based on the
116test's <code>size</code> as follows:</p>
117
118<table class="table table-bordered table-striped table-condensed">
119 <thead>
120 <tr><th>size</th><th>Implied timeout label</th></tr>
121 </thead>
122 <tbody>
123 <tr><td><code>small</code></td><td>short</td></tr>
124 <tr><td><code>medium</code></td><td>moderate</td></tr>
125 <tr><td><code>large</code></td><td>long</td></tr>
126 <tr><td><code>enormous</code></td><td>eternal</td></tr>
127 </tbody>
128</table>
129<p>For example a "large" test with no explicit timeout setting will be allotted
130900 seconds to run. A "medium" test with a timeout of "short" will be allotted
13160 seconds.</p>
132
133<p>All combinations of <code>size</code> and <code>timeout</code> labels are
134legal, so an "enormous" test may be declared to have a timeout of "short".
135Presumably it would do some really horrible things very quickly.</p>
136<p>Tests may return arbitrarily fast regardless of timeout. A test is not
137penalized for an overgenerous timeout, although a warning may be issued: you
138should generally set your timeout as tight as you can without incurring any
139flakiness.</p>
140
141<p>There is also a recommended lower bound for test timeouts as follows: </p>
142
143<table class="table table-bordered table-striped table-condensed">
144 <thead>
145 <tr><th>size</th><th>Time minimum (sec.)</th></tr>
146 </thead>
147 <tbody>
148 <tr><td><code>short</code></td><td>0</td></tr>
149 <tr><td><code>moderate</code></td><td>30</td></tr>
150 <tr><td><code>long</code></td><td>300</td></tr>
151 <tr><td><code>eternal</code></td><td>900</td></tr>
152 </tbody>
153</table>
154
155<p>For example, if a "moderate" test completes in 5.5s, consider setting
156<code>timeout</code>="short" or <code>size</code>="small". Using the bazel
157<code>--test_verbose_timeout_warnings</code> command line option will show the
158tests whose specified size is too big.</p>
159
160<p>Test sizes and timeouts are specified in the BUILD file according to the specification
161<a href="be/common-definitions.html#common-attributes-tests">here</a>.
162Any test that does not specify a recognized size will default to being a medium
163test.</p>
164
165<p>If the main process of a test exits, but some of its children are still
166running, the test runner should consider the run complete and count it as a
167success or failure based on the exit code observed from the main process. The
168test runner may kill any stray processes. Tests should not leak processes in
169this fashion.</p>
170
171<p>When executing a test, the test runner must establish certain initial
172conditions.</p>
173
174<h3>Initial Conditions</h3>
175
176<p>The test runner must invoke each test with the path to the test
177executable in <code>argv[0]</code>. This path must be relative and
178beneath the test's current directory (which is in the runfiles tree,
179see below).
180The test runner should not pass any other arguments to a
181test unless the user explicitly requests it.</p>
182
183<p>The initial environment block shall be composed as follows:</p>
184
185<table class="table table-bordered table-striped table-condensed">
186 <thead>
187 <tr><th>Variable</th><th>Value</th><th>Status</th></tr>
188 </thead>
189 <tbody>
190
191 <tr><td><code>HOME</code></td><td>value of <code>$TEST_TMPDIR</code></td><td>recommended</td></tr>
192 <tr><td><code>LANG</code></td><td><i>unset</i></td><td>required</td></tr>
193 <tr><td><code>LANGUAGE</code></td><td><i>unset</i></td><td>required</td></tr>
194 <tr><td><code>LC_ALL</code></td><td><i>unset</i></td><td>required</td></tr>
195 <tr><td><code>LC_COLLATE</code></td><td><i>unset</i></td><td>required</td></tr>
196 <tr><td><code>LC_CTYPE</code></td><td><i>unset</i></td><td>required</td></tr>
197 <tr><td><code>LC_MESSAGES</code></td><td><i>unset</i></td><td>required</td></tr>
198 <tr><td><code>LC_MONETARY</code></td><td><i>unset</i></td><td>required</td></tr>
199 <tr><td><code>LC_NUMERIC</code></td><td><i>unset</i></td><td>required</td></tr>
200 <tr><td><code>LC_TIME</code></td><td><i>unset</i></td><td>required</td></tr>
201 <tr><td><code>LD_LIBRARY_PATH</code></td><td>colon-separated list of directories containing shared libraries</td><td>optional</td></tr>
202 <tr><td><code>JAVA_RUNFILES</code></td><td>value of <code>$TEST_SRCDIR</code></td><td>deprecated</td></tr>
203
204 <tr><td><code>LOGNAME</code></td><td>value of <code>$USER</code></td><td>required</td></tr>
205
206 <tr><td><code>PATH</code></td><td><code>/usr/local/bin:/usr/local/sbin:/usr/bin:/usr/sbin:/bin:/sbin:.</code></td><td>recommended</td></tr>
207 <tr><td><code>PWD</code></td><td><code>$TEST_SRCDIR/<i>workspace-name</i></code></td><td>recommended</td></tr>
208 <tr><td><code>SHLVL</code></td><td><code>2</code></td><td>recommended</td></tr>
209 <tr><td><code>TEST_PREMATURE_EXIT_FILE</code></td><td>absolute path to a private file in a writable directory (used for catching calls to exit())</td><td>optional</td></tr>
Ulf Adams5a7ea192017-01-20 09:35:32 +0000210 <tr>
211 <td><code>TEST_RANDOM_SEED</code></td>
212 <td colspan="2">If the <code class='flag'>--runs_per_test</code> option is used, TEST_RANDOM_SEED
213 is set to the <var>run number</var> (starting with 1) for each individual test run.</td>
214 <td>optional</td>
215 </tr>
David Chen8fe82a32016-08-24 10:55:41 +0000216 <tr><td><code>TEST_SIZE</code></td><td>The test <a href="#size"><code>size</code></a></td><td>optional</td></tr>
217 <tr><td><code>TEST_TIMEOUT</code></td><td>The test <a href="#timeout"><code>timeout</code></a></td><td>optional</td></tr>
218 <tr><td><code>TEST_SRCDIR</code></td><td>absolute path to the base of the runfiles tree</td><td>required</td></tr>
219 <tr><td><code>TEST_TMPDIR</code></td><td>absolute path to a private writable directory</td><td>required</td></tr>
David Chen8fe82a32016-08-24 10:55:41 +0000220 <tr><td><code>TEST_UNDECLARED_OUTPUTS_DIR</code></td><td>absolute path to a private writable directory (used to write undeclared test outputs)</td><td>optional</td></tr>
221 <tr><td><code>TEST_UNDECLARED_OUTPUTS_ANNOTATIONS_DIR</code></td><td>absolute path to a private writable directory (used to write undeclared test output annotation .part files).
222
223 </td><td>optional</td></tr>
224 <tr><td><code>TEST_WARNINGS_OUTPUT_FILE</code></td><td>absolute path to a private file in a writable directory (used to write test target warnings)</td><td>optional</td></tr>
225
226 <tr><td><code>TZ</code></td><td><code>UTC</code></td><td>required</td></tr>
David Chen8fe82a32016-08-24 10:55:41 +0000227 <tr><td><code>USER</code></td><td>value of <code>getpwuid(getuid())-&gt;pw_name</code></td><td>required</td></tr>
228
229 <tr><td><code>XML_OUTPUT_FILE</code></td><td>Location of the <code>ANT</code>-like XML output file</td><td>optional</td></tr>
230 <tr><td><code>TEST_WORKSPACE</code></td><td>the local repository's workspace name</td><td>optional</td></tr>
David Chen8fe82a32016-08-24 10:55:41 +0000231 </tbody>
232</table>
233<br>
234<p>The environment may contain additional entries. Tests should not depend on the
235presence, absence, or value of any environment variable not listed above.</p>
236
237<p>The initial working directory shall be <code>$TEST_SRCDIR/$TEST_WORKSPACE</code>.</p>
238<p> The current process id, process group id, session id, and parent process
239id are unspecified. The process may or may not be a process group leader or a
240session leader. The process may or may not have a controlling terminal. The
241process may have zero or more running or unreaped child processes. The process
242should not have multiple threads when the test code gains control.</p>
243
244<p>File descriptor 0 (stdin) shall be open for reading, but what it is attached
245to is unspecified. Tests must not read from it. File descriptors 1 (stdout)
246and 2 (stderr) shall be open for writing, but what they are attached to is
247unspecified. It could be a terminal, a pipe, a regular file, or anything else
248to which characters can be written. They may share an entry in the open file
249table (meaning that they cannot seek independently). Tests should not inherit
250any other open file descriptors.</p>
251
252<p>The initial umask shall be 022 or 027.</p>
253
254<p>No alarm or interval timer shall be pending.</p>
255
256<p>The initial mask of blocked signals shall be empty. All signals shall be set
257to their default action.</p>
258
259<p>The initial resource limits, both soft and hard, should be set as follows:</p>
260
261<table class="table table-bordered table-striped table-condensed">
262 <thead>
263 <tr><th>Resource</th><th>Limit</th></tr>
264 </thead>
265 <tbody>
266 <tr><td>RLIMIT_AS</td><td>unlimited</td></tr>
267 <tr><td>RLIMIT_CORE</td><td>unspecified</td></tr>
268 <tr><td>RLIMIT_CPU</td><td>unlimited</td></tr>
269 <tr><td>RLIMIT_DATA</td><td>unlimited</td></tr>
270 <tr><td>RLIMIT_FSIZE</td><td>unlimited</td></tr>
271 <tr><td>RLIMIT_LOCKS</td><td>unlimited</td></tr>
272 <tr><td>RLIMIT_MEMLOCK</td><td>unlimited</td></tr>
273 <tr><td>RLIMIT_MSGQUEUE</td><td>unspecified</td></tr>
274 <tr><td>RLIMIT_NICE</td><td>unspecified</td></tr>
275 <tr><td>RLIMIT_NOFILE</td><td>at least 1024</td></tr>
276 <tr><td>RLIMIT_NPROC</td><td>unspecified</td></tr>
277 <tr><td>RLIMIT_RSS</td><td>unlimited</td></tr>
278 <tr><td>RLIMIT_RTPRIO</td><td>unspecified</td></tr>
279 <tr><td>RLIMIT_SIGPENDING</td><td>unspecified</td></tr>
280 <tr><td>RLIMIT_STACK</td><td>unlimited, or 2044KB &lt;= rlim &lt;= 8192KB</td></tr>
281 </tbody>
282</table>
283
284<p>The initial process times (as returned by <code>times()</code>) and resource
285utilization (as returned by <code>getrusage()</code>) are unspecified.</p>
286
287<p>The initial scheduling policy and priority are unspecified.</p>
288
289<h3>Role of the Host System</h3>
290
291<p>In addition to the aspects of user context under direct control of the
292test runner, the operating system on which tests execute must satisfy certain
293properties for a test run to be valid.</p>
294
295<h4>Filesystem</h4>
296
297<p>
298The root directory observed by a test may or may not be the real root directory.<br>
299<code>/proc</code> shall be mounted.<br>
300All build tools shall be present at the absolute paths under <code>/usr</code> used by a local installation.<br>
301Paths starting with <code>/home</code> may not be available. Tests should not access any such paths.<br>
302<code>/tmp</code> and <code>/export/hda3/tmp</code> shall be writable, but tests should avoid using these paths.<br>
303
304Tests must not assume that any constant path is available for their exclusive use.<br>
305Tests must not assume that atimes are enabled for any mounted filesystem.<br>
306</p>
307
308<h4>Users and groups</h4>
309
310<p>The users root, nobody, and unittest must exist. The groups root, nobody,
311and eng must exist.</p>
312
313<p>Tests must be executed as a non-root user. The real and effective user ids
314must be equal; likewise for group ids. Beyond this, the current user id, group
315id, user name, and group name are unspecified. The set of supplementary group
316ids is unspecified.</p>
317
318<p>The current user id and group id must have corresponding names which can be
319retrieved with <code>getpwuid()</code> and <code>getgrgid()</code>. The same
320may not be true for supplementary group ids.</p>
321
322<p>The current user must have a home directory. It may not be writable. Tests
323must not attempt to write to it.</p>
324
325<h4>Networking</h4>
326
327<p>The hostname is unspecified. It may or may not contain a dot. Resolving
328the hostname must give an IP address of the current host. Resolving the
329hostname cut after the first dot must also work. The hostname localhost must
330resolve.</p>
331
332<h4>Other resources</h4>
333
334<p>Tests are granted at least one CPU core. Others may be available but this
philwoda21ba72017-05-02 20:25:12 +0200335is not guaranteed. Other performance aspects of this core are not specified.
336You can increase the reservation to a higher number of CPU cores by adding the
philwo8bd43122017-05-04 12:47:19 +0200337tag "cpu:n" (where n is a positive number) to a test rule. If a machine has
338less total CPU cores than requested, Bazel will still run the test. If a test
339uses sharding, each individual shard will reserve the number of CPU cores
340specified here.</p>
David Chen8fe82a32016-08-24 10:55:41 +0000341
342<p>Tests may create subprocesses, but not process groups or sessions.</p>
343
Googler57742cc2017-07-10 20:58:53 +0200344<p>There is a limit on the number of input files a test may consume. This limit
345is subject to change, but is currently in the range of tens of thousands of inputs.
346
347</p>
David Chen8fe82a32016-08-24 10:55:41 +0000348
349<h4>Time and date</h4>
350
351<p>The current time and date are unspecified. The system timezone is unspecified.
352
353</p>
354
355<p>X Windows may or may not be available. Tests that need an X server should
356start Xvfb.</p>
357
358<h3>Test interaction with the filesystem</h3>
359<p>All file paths specified in test environment variables point to
360somewhere on the local filesystem, unless otherwise specified.</p>
361
362<p>
363Tests should create files only within the directories specified by
364<code>$TEST_TMPDIR</code> and <code>$TEST_UNDECLARED_OUTPUTS_DIR</code>
365(if set).<br>
366These directories will be initially empty.<br>
367Tests must not attempt to remove, chmod, or otherwise alter these directories.<br>
368These directories may be a symbolic links.<br>
369The filesystem type of <code>$TEST_TMPDIR/.</code> remains unspecified.<br>
370Tests may also write .part files to the <code>$TEST_UNDECLARED_OUTPUTS_ANNOTATIONS_DIR</code>
371to annotate undeclared output files.</p>
372
373<p>Tests must access inputs through the <b>runfiles</b> mechanism, or other
374parts of the execution environment which are specifically intended to make
375input files available.
376
377Tests must not access other outputs of the
378build system at paths inferred from the location of their own executable.</p>
379
380<p>It is unspecified whether the runfiles tree contains regular files, symbolic
381links, or a mixture. The runfiles tree may contain symlinks to directories.
382Tests should avoid using paths containing <code>..</code> components within the
383runfiles tree.</p>
384
385<p>No directory, file, or symlink within the runfiles tree (including paths
386which traverse symlinks) should be writable. (It follows that the initial
387working directory should not be writable.) Tests must not assume that any part
388of the runfiles is writable, or owned by the current user (i.e. chmod and chgrp
389may fail).</p>
390
391<p>The runfiles tree (including paths which traverse symlinks) must not change
392during test execution. Parent directories and filesystem mounts must not change
393in any way which affects the result of resolving a path within the runfiles
394tree.</p>
395
396<p>In order to catch early exit, a test may create a file at the path specified by
397<code>TEST_PREMATURE_EXIT_FILE</code> upon start and remove it upon exit. If
398Bazel sees the file when the test finishes, it will assume that the test exited
399prematurely and mark it as having failed.</p>
400
401<h3>Tag conventions</h3>
402
403<p>
404 Some tags in the test rules have a special
405 meaning.
406</p>
407
408<table class="table table-bordered table-striped table-condensed">
409 <thead>
410 <tr><th>Tag</th><th>Meaning</th></tr>
411 </thead>
412 <tbody>
413 <tr>
414 <th><code>exclusive</code></th>
415
416 <td>run no other test at the same time</td>
417 </tr>
418 <tr>
419 <th><code>external</code></th>
420 <td>test has an external dependency; disable test caching</td>
421 </tr>
422 <tr>
423 <th><code>large</code></th>
424 <td><code>test_suite</code> convention; suite of large tests<br/>
425
426 </td>
427 </tr>
428
429 <tr>
430 <th><code>manual</code></th>
431
432 <td>don't include test target in wildcard target patterns like <code>:...</code>, <code>:*</code>, or <code>:all</code>)</td>
433 </tr>
434 <tr>
435 <th><code>medium</code></th>
436
437 <td><code>test_suite</code> convention; suite of medium tests
438 </tr>
439
440 <tr>
441 <th><code>small</code></th>
442
443 <td><code>test_suite</code> convention; suite of small tests</td>
444 </tr>
445
446 <tr>
447 <th><code>smoke</code></th>
448
449 <td>
450 <code>test_suite</code> convention; means it should be run before committing code changes
451 into the version control system
452 </td>
453 </tr>
454
455 </tbody>
456</table>
457
458<h3>Runfiles</h3>
459
460<p>In the following, assume there is a *_binary() rule labeled <code>//foo/bar:unittest</code>,
461with a run-time dependency on the rule labeled <code>//deps/server:server</code>.</p>
462
463<h4>Location</h4>
464<p>The runfiles directory for a target <code>//foo/bar:unittest</code> is the directory
465<code>$(WORKSPACE)/$(BINDIR)/foo/bar/unittest.runfiles</code>. This path is referred to as the
466<code>runfiles_dir</code>.</p>
467
468<h4>Dependencies</h4>
469<p>The runfiles directory is declared as a compile-time dependency of the *_binary() rule.
470The runfiles directory itself depends on the set of BUILD files that affect the *_binary() rule
471or any of its compile-time or run-time dependencies. Modifying source files does not affect the
472structure of the runfiles directory, and thus does not trigger any rebuilding.</p>
473
474<h4>Contents</h4>
475<p>The runfiles directory contains the following:</p>
476<ul>
477 <li><b>Symlinks to run-time dependencies</b>: each OutputFile and CommandRule that is a run-time
478dependency of the *_binary() rule is represented by one symlink in the runfiles directory.
479The name of the symlink is <code>$(WORKSPACE)/package_name/rule_name</code>. For example, the
480symlink for server would be named <code>$(WORKSPACE)/deps/server/server</code>, and the full path
481would be <code>$(WORKSPACE)/foo/bar/unittest.runfiles/$(WORKSPACE)/deps/server/server</code>.
482The destination of the symlink is the OutputFileName() of the OutputFile or CommandRule,
483expressed as an absolute path. Thus, the destination of the symlink might be
484<code>$(WORKSPACE)/linux-dbg/deps/server/42/server</code>.</li>
485 <li><b>Symlinks to sub-runfiles</b>: for every *_binary() Z that is a run-time depdendency of
486*_binary() C, there is a second link in the runfiles directory of C to the runfiles of Z.
487The name of the symlink is <code>$(WORKSPACE)/package_name/rule_name.runfiles</code>.
488The target of the symlink is the runfiles directory. I.e. all subprograms share a common
489runfiles directory.</li>
490</ul>
491