Skip to content

Commit 3866054

Browse files
committed
Update README.md
1 parent eb1cc30 commit 3866054

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

README.md

+6-6
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Here is an example of deep bi-directional RNN-CRF network. It contains 3 hidden
1818
Here is the inner structure of one bi-directional hidden layer.
1919
![](https://github.com/zhongkaifu/RNNSharp/blob/master/RNNSharpLayer.jpg)
2020

21-
Here is the neural network for sequence-to-sequence task. "TokenN" are from source sequence, and "ELayerX-Y" are auto-encoder's hidden layers. Auto-encoder is defined in feature configuration file. "<s>" is always the beginning of target sentence, and "DLayerX-Y" means the decoder's hidden layers. In decoder, it generates one token at one time until "</s>" is generated.
21+
Here is the neural network for sequence-to-sequence task. "TokenN" are from source sequence, and "ELayerX-Y" are auto-encoder's hidden layers. Auto-encoder is defined in feature configuration file. &lt;s&gt; is always the beginning of target sentence, and "DLayerX-Y" means the decoder's hidden layers. In decoder, it generates one token at one time until &lt;/s&gt; is generated.
2222
![](https://github.com/zhongkaifu/RNNSharp/blob/master/RNNSharpSeq2Seq.jpg)
2323

2424
## Supported Feature Types
@@ -47,7 +47,7 @@ U13:C%x[-1,0]/%x[-1,1]
4747
U14:C%x[0,0]/%x[0,1]
4848
U15:C%x[1,0]/%x[1,1]
4949

50-
The rule-string has two types, one is constant string, and the other is variable. The simplest variable format is {“%x[row,col]”}. Row specifies the offset between current focusing token and generate feature token in row. Col specifies the absolute column position in corpus. Moreover, variable combination is also supported, for example: {“%x[row1, col1]/%x[row2, col2]”}. When we build feature set, variable will be expanded to specific string. Here is an example in training data:
50+
The rule-string has two types, one is constant string, and the other is variable. The simplest variable format is {“%x[row,col]”}. Row specifies the offset between current focusing token and generate feature token in row. Col specifies the absolute column position in corpus. Moreover, variable combination is also supported, for example: {“%x[row1, col1]/%x[row2, col2]”}. When we build feature set, variable will be expanded to specific string. Here is an example in training data for named entity task.
5151

5252
Word | Pos | Tag
5353
-----------|------|----
@@ -204,10 +204,10 @@ your
204204
name
205205
?
206206

207-
I
208-
am
209-
Zhongkai
210-
Fu
207+
I
208+
am
209+
Zhongkai
210+
Fu
211211

212212
In above example, "What is your name ?" is the source sentence, and "I am Zhongkai Fu" is the target sentence generated by RNNSharp seq-to-seq model. In source sentence, beside word features, any other feautes can be added for it as well such as postag feature in sequence labeling task in above.
213213

0 commit comments

Comments
 (0)