what is the role of 'maxlen' parameter?

'maxlen' is one of the parameters in 'train_nmt.py', set to 50 by default.
I get the following message during the training process: **"Minibatch with zero sample under length  100"**
Investigating the source code shows that this message is appear [when there is a batch size that the length of the source **and** target is greater than 'maxlen'](https://github.com/nyu-dl/dl4mt-tutorial/blob/master/session2/nmt.py#L173-L184).
On the other hand, in 'data_iterator.py' [training samples have been skipped when the length of source and target is greater than 'maxlen'](https://github.com/nyu-dl/dl4mt-tutorial/blob/master/session2/data_iterator.py#L108-L109).
1. Why such a contradiction is exist? -passing samples in data_iterator and then filter them in 'prpare-data'
2. If I set maxlen to a large value (1000 for example), the updating time is significantly increase, would you describe why?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

what is the role of 'maxlen' parameter? #55

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

what is the role of 'maxlen' parameter? #55

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions