@@ -1012,17 +1012,16 @@ syntax tree.
1012
1012
A very special macro is ` @generated ` , which allows you to define so-called * generated functions* .
1013
1013
These have the capability to generate specialized code depending on the types of their arguments
1014
1014
with more flexibility and/or less code than what can be achieved with multiple dispatch. While
1015
- macros work with expressions at parsing- time and cannot access the types of their inputs, a generated
1015
+ macros work with expressions at parse time and cannot access the types of their inputs, a generated
1016
1016
function gets expanded at a time when the types of the arguments are known, but the function is
1017
1017
not yet compiled.
1018
1018
1019
1019
Instead of performing some calculation or action, a generated function declaration returns a quoted
1020
1020
expression which then forms the body for the method corresponding to the types of the arguments.
1021
- When called, the body expression is first evaluated and compiled, then the returned expression
1022
- is compiled and run. To make this efficient, the result is often cached. And to make this inferable,
1023
- only a limited subset of the language is usable. Thus, generated functions provide a flexible
1024
- framework to move work from run-time to compile-time, at the expense of greater restrictions on
1025
- the allowable constructs.
1021
+ When a generated function is called, the expression it returns is compiled and then run.
1022
+ To make this efficient, the result is usually cached. And to make this inferable, only a limited
1023
+ subset of the language is usable. Thus, generated functions provide a flexible way to move work from
1024
+ run time to compile time, at the expense of greater restrictions on allowed constructs.
1026
1025
1027
1026
When defining generated functions, there are four main differences to ordinary functions:
1028
1027
@@ -1038,7 +1037,7 @@ When defining generated functions, there are four main differences to ordinary f
1038
1037
This means they can only read global constants, and cannot have any side effects.
1039
1038
In other words, they must be completely pure.
1040
1039
Due to an implementation limitation, this also means that they currently cannot define a closure
1041
- or untyped generator.
1040
+ or generator.
1042
1041
1043
1042
It's easiest to illustrate this with an example. We can declare a generated function ` foo ` as
1044
1043
@@ -1053,9 +1052,8 @@ foo (generic function with 1 method)
1053
1052
Note that the body returns a quoted expression, namely ` :(x * x) ` , rather than just the value
1054
1053
of ` x * x ` .
1055
1054
1056
- From the caller's perspective, they are very similar to regular functions; in fact, you don't
1057
- have to know if you're calling a regular or generated function - the syntax and result of the
1058
- call is just the same. Let's see how ` foo ` behaves:
1055
+ From the caller's perspective, this is identical to a regular function; in fact, you don't
1056
+ have to know whether you're calling a regular or generated function. Let's see how ` foo ` behaves:
1059
1057
1060
1058
``` jldoctest generated
1061
1059
julia> x = foo(2); # note: output is from println() statement in the body
@@ -1199,7 +1197,7 @@ end and at the call site; however, *don't copy them*, for the following reasons:
1199
1197
when, how often or how many times these side-effects will occur
1200
1198
* the ` bar ` function solves a problem that is better solved with multiple dispatch - defining ` bar(x) = x `
1201
1199
and ` bar(x::Integer) = x ^ 2 ` will do the same thing, but it is both simpler and faster.
1202
- * the ` baz ` function is pathologically insane
1200
+ * the ` baz ` function is pathological
1203
1201
1204
1202
Note that the set of operations that should not be attempted in a generated function is unbounded,
1205
1203
and the runtime system can currently only detect a subset of the invalid operations. There are
@@ -1317,3 +1315,59 @@ the two tuples, multiplication and addition/subtraction. All the looping is perf
1317
1315
and we avoid looping during execution entirely. Thus, we only loop * once per type* , in this case
1318
1316
once per ` N ` (except in edge cases where the function is generated more than once - see disclaimer
1319
1317
above).
1318
+
1319
+ ### Optionally-generated functions
1320
+
1321
+ Generated functions can achieve high efficiency at run time, but come with a compile time cost:
1322
+ a new function body must be generated for every combination of concrete argument types.
1323
+ Typically, Julia is able to compile "generic" versions of functions that will work for any
1324
+ arguments, but with generated functions this is impossible.
1325
+ This means that programs making heavy use of generated functions might be impossible to
1326
+ statically compile.
1327
+
1328
+ To solve this problem, the language provides syntax for writing normal, non-generated
1329
+ alternative implementations of generated functions.
1330
+ Applied to the ` sub2ind ` example above, it would look like this:
1331
+
1332
+ ``` julia
1333
+ function sub2ind_gen (dims:: NTuple{N} , I:: Integer... ) where N
1334
+ if N != length (I)
1335
+ throw (ArgumentError (" Number of dimensions must match number of indices." ))
1336
+ end
1337
+ if @generated
1338
+ ex = :(I[$ N] - 1 )
1339
+ for i = (N - 1 ): - 1 : 1
1340
+ ex = :(I[$ i] - 1 + dims[$ i] * $ ex)
1341
+ end
1342
+ return :($ ex + 1 )
1343
+ else
1344
+ ind = I[N] - 1
1345
+ for i = (N - 1 ): - 1 : 1
1346
+ ind = I[i] - 1 + dims[i]* ind
1347
+ end
1348
+ return ind + 1
1349
+ end
1350
+ end
1351
+ ```
1352
+
1353
+ Internally, this code creates two implementations of the function: a generated one where
1354
+ the first block in ` if @generated ` is used, and a normal one where the ` else ` block is used.
1355
+ Inside the ` then ` part of the ` if @generated ` block, code has the same semantics as other
1356
+ generated functions: argument names refer to types, and the code should return an expression.
1357
+ Multiple ` if @generated ` blocks may occur, in which case the generated implementation uses
1358
+ all of the ` then ` blocks and the alternate implementation uses all of the ` else ` blocks.
1359
+
1360
+ Notice that we added an error check to the top of the function.
1361
+ This code will be common to both versions, and is run-time code in both versions
1362
+ (it will be quoted and returned as an expression from the generated version).
1363
+ That means that the values and types of local variables are not available at code generation
1364
+ time --- the code-generation code can only see the types of arguments.
1365
+
1366
+ In this style of definition, the code generation feature is essentially an optional
1367
+ optimization.
1368
+ The compiler will use it if convenient, but otherwise may choose to use the normal
1369
+ implementation instead.
1370
+ This style is preferred, since it allows the compiler to make more decisions and compile
1371
+ programs in more ways, and since normal code is more readable than code-generating code.
1372
+ However, which implementation is used depends on compiler implementation details, so it
1373
+ is essential for the two implementations to behave identically.
0 commit comments