[Stop Sequences] support stop sequences #2712
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR is to support stop sequences, which is specific tokens or phrases that signal the model to terminate the current generation process.
Related environment variables
FD_USE_STOP_SEQ
: Whether to use stop sequences, default is 0FD_STOP_SEQS_MAX_LEN
: Maximum length of stop sequences, default is 8FD_MAX_STOP_SEQS_NUM
: Maximum number of stop sequences, default is 5Usage
online serving
And response is below, terminating at the stop sequence:
or give a
stop_token_ids
offline inference
run with
export FD_USE_STOP_SEQ=1