Skip to content

Commit 687b63d

Browse files
authored
docs: update examples (numaproj#40)
Signed-off-by: Edward Lee <[email protected]>
1 parent dfcfdeb commit 687b63d

7 files changed

+40
-39
lines changed

README.md

+4-3
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,7 @@ NumaFlow installs in less than a minute and is easier and cheaper for simple dat
2424
* Data aggregation (e.g. group-by)
2525

2626
## Resources
27-
- Check out [QUICK START](docs/QUICK_START.md) to try it out.
28-
- Take a look at [DEVELOPMENT](docs/DEVELOPMENT.md) to set up development environment.
29-
- Refer to [CONTRIBUTING](https://github.com/numaproj/numaproj/blob/main/CONTRIBUTING.md) to contribute to the project.
27+
- [QUICK_START](docs/QUICK_START.md)
28+
- [EXAMPLES](examples)
29+
- [DEVELOPMENT](docs/DEVELOPMENT.md)
30+
- [CONTRIBUTING](https://github.com/numaproj/numaproj/blob/main/CONTRIBUTING.md)

examples/1-simple-pipeline.yaml

+4-4
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ metadata:
44
name: simple-pipeline
55
spec:
66
vertices:
7-
- name: input
7+
- name: in
88
source:
99
# A self data generating source
1010
generator:
@@ -14,12 +14,12 @@ spec:
1414
udf:
1515
builtin:
1616
name: cat # A builtin UDF which simply cats the message
17-
- name: output
17+
- name: out
1818
sink:
1919
# A simple log printing sink
2020
log: {}
2121
edges:
22-
- from: input
22+
- from: in
2323
to: cat
2424
- from: cat
25-
to: output
25+
to: out

examples/2-http-pipeline.yaml

+4-4
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ metadata:
44
name: http-pipeline
55
spec:
66
vertices:
7-
- name: input
7+
- name: in
88
source:
99
# An http endpoint
1010
http:
@@ -21,12 +21,12 @@ spec:
2121
udf:
2222
builtin:
2323
name: cat # A builtin UDF which simply cats the message
24-
- name: output
24+
- name: out
2525
sink:
2626
# A simple log printing sink
2727
log: {}
2828
edges:
29-
- from: input
29+
- from: in
3030
to: cat
3131
- from: cat
32-
to: output
32+
to: out

examples/3-pipeline-filter.yaml

+4-4
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ metadata:
44
name: filter-pipeline
55
spec:
66
vertices:
7-
- name: input
7+
- name: in
88
source:
99
# A self data generating source
1010
http:
@@ -22,12 +22,12 @@ spec:
2222
name: filter
2323
kwargs:
2424
expression: int(json(payload).id) < 100
25-
- name: output
25+
- name: out
2626
sink:
2727
# A simple log printing sink
2828
log: {}
2929
edges:
30-
- from: input
30+
- from: in
3131
to: filter
3232
- from: filter
33-
to: output
33+
to: out

examples/4-udsink-pipeline.yaml

+4-4
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ metadata:
44
name: udsink-pipeline
55
spec:
66
vertices:
7-
- name: input
7+
- name: in
88
source:
99
generator:
1010
rpu: 1
@@ -14,13 +14,13 @@ spec:
1414
udf:
1515
builtin:
1616
name: cat
17-
- name: output
17+
- name: out
1818
sink:
1919
udsink:
2020
container:
2121
image: my-sink:latest
2222
edges:
23-
- from: input
23+
- from: in
2424
to: p1
2525
- from: p1
26-
to: output
26+
to: out

examples/5-auto-scaling.yaml

+4-4
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ metadata:
44
name: auto-scaling-pipeline
55
spec:
66
vertices:
7-
- name: input
7+
- name: in
88
source:
99
# A self data generating source
1010
generator:
@@ -17,15 +17,15 @@ spec:
1717
udf:
1818
builtin:
1919
name: cat # A builtin UDF which simply cats the message
20-
- name: output
20+
- name: out
2121
scale:
2222
min: 3 # Minimal replicas
2323
max: 5 # Maximum replicas
2424
sink:
2525
# A simple log printing sink
2626
log: {}
2727
edges:
28-
- from: input
28+
- from: in
2929
to: cat
3030
- from: cat
31-
to: output
31+
to: out

examples/README.md

+16-16
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,18 @@
11
# NumaFlow by Example
22

3-
Welcome to the NumaFlow Community! This document provides an example-by example guide to using NumaFlow.
3+
Welcome to the NumaFlow Community! This document provides an example-by-example guide to using NumaFlow.
44

55
If you haven't already, install NumaFlow by following the [QUICK START](../docs/QUICK_START.md) instructions.
66

7-
The top-level abstraction in NumaFlow is the `Pipeline`. A `Pipeline` consists of a set of `vertices` connected by `edges`. A vertex can be a `source`, `sink`, or `processing` vertex. In the example below, we have a source vertex named *input* that generates messages at a specified rate, a sink vertex named *ouput* that logs messages, and a processing vertex named *cat* that produces any input message as output. Lastly, there are two edges, one connecting the *input* to the *cat* vertex and another connecting the *cat* to the *output* vertex. The resulting pipeline simply copies internally generated messages to the log.
7+
The top-level abstraction in NumaFlow is the `Pipeline`. A `Pipeline` consists of a set of `vertices` connected by `edges`. A vertex can be a `source`, `sink`, or `processing` vertex. In the example below, we have a source vertex named *in* that generates messages at a specified rate, a sink vertex named *out* that logs messages, and a processing vertex named *cat* that produces any input message as output. Lastly, there are two edges, one connecting the *in* to the *cat* vertex and another connecting the *cat* to the *out* vertex. The resulting pipeline simply copies internally generated messages to the log.
88
```yaml
99
apiVersion: numaflow.numaproj.io/v1alpha1
1010
kind: Pipeline
1111
metadata:
1212
name: simple-pipeline
1313
spec:
1414
vertices:
15-
- name: input
15+
- name: in
1616
source:
1717
# Generate 5 messages every second
1818
# xxx what does rpu stand for? how large are the messages by default, what data is contained in the message? rename duration to interval?
@@ -23,18 +23,18 @@ spec:
2323
udf: # A user defined function
2424
builtin: # Use a builtin function as the udf
2525
name: cat # cats the message
26-
- name: output
26+
- name: out
2727
sink:
2828
# Output message to the log
2929
# xxx which log? k8s events?
3030
log: {}
3131

32-
# input -> cat -> output
32+
# in -> cat -> out
3333
edges:
34-
- from: input
34+
- from: in
3535
to: cat
3636
- from: cat
37-
to: output
37+
to: out
3838
```
3939
4040
Below we have a simple variation on the above example that takes input from an http endpoint.
@@ -45,7 +45,7 @@ metadata:
4545
name: http-pipeline
4646
spec:
4747
vertices:
48-
- name: input
48+
- name: in
4949
source:
5050
http:
5151
# Whether to create a ClusterIP Service, defaults to false
@@ -62,26 +62,26 @@ spec:
6262
udf:
6363
builtin:
6464
name: cat # A builtin UDF which simply cats the message
65-
- name: output
65+
- name: out
6666
sink:
6767
# A simple log printing sink
6868
log: {}
6969
edges:
70-
- from: input
70+
- from: in
7171
to: cat
7272
- from: cat
73-
to: output
73+
to: out
7474
```
7575
76-
Let's modify the UDF in the first example to passthrough only messages with an *id* less than 100
76+
Let's modify the UDF in the first example to pass-through only messages with an *id* less than 100
7777
```yaml
7878
apiVersion: numaflow.numaproj.io/v1alpha1
7979
kind: Pipeline
8080
metadata:
8181
name: filter-pipeline
8282
spec:
8383
vertices:
84-
- name: input
84+
- name: in
8585
source:
8686
generator:
8787
rpu: 5
@@ -92,13 +92,13 @@ spec:
9292
name: filter
9393
kwargs:
9494
expression: int(json(payload).id) < 100
95-
- name: output
95+
- name: out
9696
sink:
9797
log: {}
9898
edges:
99-
- from: input
99+
- from: in
100100
to: filter
101101
- from: filter
102-
to: output
102+
to: out
103103
```
104104

0 commit comments

Comments
 (0)