Skip to main content

Create reader

POST 

/reader

Create reader

Takes in a ReaderConfig payload and loads up new python reader

Request​

Body

required

    access_tokenstringnullable
    constrained_decoding_backendstringnullable
    consumer_groupstringrequired
    cuda_graph_cache_capacityint32nullable
    cuda_visible_devicesstringnullable
    devicestringrequired
    disable_cuda_graphint32nullable
    enable_document_processingbooleannullable
    internal_gateway_ipstringnullable
    lmfe_max_consecutive_whitespacesint32nullable
    log_levelstringnullable
    lorasstringnullable
    max_batch_sizeint64nullable
    max_sequence_lengthint32nullable
    model_namestringrequired
    nvlink_unavailableint32nullable
    page_cache_sizestringnullable
    prefill_chunk_sizeint64nullable
    quant_typestringnullable
    quantize_cache_bitsint32nullable
    reader_log_as_jsonstringnullable
    ssd_cache_sizeintegernullable
    tensor_parallelint32nullable

Responses​

Takes in a JSON payload and loads up new model and/or backend

Schema

    _reader_config

    object

    required

    access_tokenstringnullable
    constrained_decoding_backendstringnullable
    consumer_groupstringrequired
    cuda_graph_cache_capacityint32nullable
    cuda_visible_devicesstringnullable
    devicestringrequired
    disable_cuda_graphint32nullable
    enable_document_processingbooleannullable
    internal_gateway_ipstringnullable
    lmfe_max_consecutive_whitespacesint32nullable
    log_levelstringnullable
    lorasstringnullable
    max_batch_sizeint64nullable
    max_sequence_lengthint32nullable
    model_namestringrequired
    nvlink_unavailableint32nullable
    page_cache_sizestringnullable
    prefill_chunk_sizeint64nullable
    quant_typestringnullable
    quantize_cache_bitsint32nullable
    reader_log_as_jsonstringnullable
    ssd_cache_sizeintegernullable
    tensor_parallelint32nullable
    _reader_idstringrequired