Write your own pipeline

Add a dataset

Inherit from retrieval_qa_benchmark.schema.BaseDataset
Implement build function that parse your data to list of retrieval_qa_benchmark.schema.QARecord.
Register your dataset with retrieval_qa_benchmark.utils.registry.REGISTRY.register_dataset like this:

@retrieval_qa_benchmark.utils.registry.REGISTRY.register_dataset("type name you want")
class YourDataset(retrieval_qa_benchmark.schema.BaseDataset):
    pass

Create a PR on github.

Add a transform

Inherit from retrieval_qa_benchmark.schema.BaseTransform.
Implement any of
- retrieval_qa_benchmark.schema.BaseTransform.transform_question
- retrieval_qa_benchmark.schema.BaseTransform.transform_choices
- retrieval_qa_benchmark.schema.BaseTransform.transform_context
- and other fields you would like to change in QARecord
Register your transform with retrieval_qa_benchmark.utils.registry.REGISTRY.register_transform like this:

@retrieval_qa_benchmark.utils.registry.REGISTRY.register_transform("type name you want")
class YourTransform(retrieval_qa_benchmark.schema.BaseTransform):
    pass

Create a PR on github

Add a LLM

Inherit from retrieval_qa_benchmark.schema.BaseLLM
Implement all of below
- retrieval_qa_benchmark.schema.BaseLLM.build
- retrieval_qa_benchmark.schema.BaseLLM.generate
Register your language model with retrieval_qa_benchmark.utils.registry.REGISTRY.register_model like this:

@retrieval_qa_benchmark.utils.registry.REGISTRY.register_model("type name you want")
class YourLLM(retrieval_qa_benchmark.schema.BaseLLM):
    pass

Create a PR on github

Add a evaluator

Inherit from retrieval_qa_benchmark.schema.BaseEvaluator
Change the matcher function of retrieval_qa_benchmark.schema.BaseEvaluator like this
Register your dataset with retrieval_qa_benchmark.utils.registry.REGISTRY.register_evaluator like this:

@retrieval_qa_benchmark.utils.registry.REGISTRY.register_evaluator("type name you want")
class YourLLM(retrieval_qa_benchmark.schema.BaseLLM):
    pass

Create a PR on github