Skip to content

Conversation

tonywang10101
Copy link

Description

This PR addresses the issue where the VLLM tutorial cannot run in BLS mode. The problem was initially pointed out in the pb_stub.cc file, line 1426.

Changes

Moved parameters to the input field instead of using the parameter field in InferenceRequest.

@tonywang10101 tonywang10101 marked this pull request as draft September 18, 2023 03:07
@tonywang10101 tonywang10101 marked this pull request as ready for review September 18, 2023 03:07
@the-david-oy
Copy link
Contributor

the-david-oy commented Sep 21, 2023

Stellar contribution! Thanks for submitting this, Tony. Can you submit the Contributor License Agreement (instructions here) so that we can review this?

CC: @tanmayv25 @rmccorm4

@the-david-oy
Copy link
Contributor

the-david-oy commented Sep 21, 2023

Tested locally. This passes the vLLM example.

@tanmayv25
Copy link
Contributor

@nnshah1 Do you think moving the sampling parameters from request parameters to input should be fine? Looks like it should be useful for BLS as well as PA profiling? Note that perf_analyzer currently do not support request parameters.
cc @matthewkotila

@debermudez
Copy link

It is in active development so the decision here will impact our final PR.

@tonywang10101
Copy link
Author

Stellar contribution! Thanks for submitting this, Tony. Can you submit the Contributor License Agreement (instructions here) so that we can review this?

CC: @tanmayv25 @rmccorm4

Thanks @dyastremsky. I have signed the CLA and mailed it.

@the-david-oy
Copy link
Contributor

Thank you for getting that done so quickly, Tony! We had a discussion and would like to support both the parameter and tensor paths. Since this PR removes support for the parameter path, would you be able to amend it to support both?

Quoting @rmccorm4's suggestion:

So, we can add the tensor, but should include optional: true in the model config for that tensor.
In the model, if the tensor is provided, use it. If not provided, use the parameters.
If both are provided, either error or join the 2 dicts? Not sure. Joining is probably a better experience with a doc note somewhere mentioning it.
In the client, we can demonstrate both passing as parameters or tensor. Maybe we add a --mode {parameters, tensor} with default="tensor" to control it?

@nnshah1
Copy link
Contributor

nnshah1 commented Sep 28, 2023

@tonywang10101 to ease merging with support for parameters and tensors I created this PR on your original. Can you take a look and merge?

tonywang10101#1

@the-david-oy
Copy link
Contributor

Hi Tony! Thanks for all your work here.

This PR has been merged with you and Neelay as co-authors here: #52

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants