Skip to content

[Question]: Why can't I successfully invoke the rerank model deployed by vllm when using the API at /api/v1/retrieval? #7985

Open
@lovekarri

Description

@lovekarri

Self Checks

  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit this report (Language Policy).
  • Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) (Language Policy).
  • Please do not modify this template :) and fill in all the required fields.

Describe your problem

my rerank model was successfully deployed with vllm,and I have successfully use this rerank model to test callback with my knowledge base
Image

but when I try to use api , the request was wrong with this response:
{ "code": 100, "data": null, "message": "LookupError('Model(vllm/bge-reranker-v2-m3) not authorized')" }

this is my api code:
try: response = requests.post( url="http://192.168.191.101:8092/api/v1/retrieval", headers={ "Authorization": "Bearer ragflow-kzNmJhYWE4MmRlODeXZjBhN2IyMGE2Nj", "content-Type": "application/json" }, json={ "question": "特种设备检验检测过程中发现设备存在严重事故隐患时该如何处理?", "dataset_ids": ["9ec38706301b11f08abf0a66212dace0"], "top_k": 3, "page": 1, "page_size": 3, "keyword": False, "highlight": False, "similarity_threshold": 0.5, "vector_similarity_weight": 0.7, "rerank_id": "vllm/bge-reranker-v2-m3", }, timeout=120, ) response.raise_for_status() except requests.exceptions.RequestException as e:

how can I use this rerank model?

Metadata

Metadata

Assignees

No one assigned

    Labels

    🐞 bugSomething isn't working, pull request that fix bug.🙋‍♀️ questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions