LLMeBench issues

Results 58 LLMeBench issues

Sort by recently updated

Feat/dialect identification shami corpus

Added Zero-Shot assets for dialect identification on the SHAMI corpus

baselmousi

Tests for new model

Add tests for the new model VLLM.

MaramHasanain

Feat/ArAIEval23 tasks data

- We need to upload the dataset for download. The are currently on the main server location 'data_for_download' - The datasets comes with four different task definitions, (task 1 with...

firojalam

Issue with OSACT4SubtaskB dataset loading

File: llmebench/datasets/OSACT4SubtaskB.py Line: 44 If we take the first two as text and label, the labels are ["HS", "NOT_HS", "OFF", "NOT_OFF"]. The "OFF" and "NOT_OFF" labels are for subtaskA. the...

AridHasan

Add score Jais/sarcasm/ar sarcasm2

AridHasan

LLMeBench
LLMeBench copied to clipboard

Metadata

Feat/dialect identification shami corpus

Jais/demographic/location

Jais/sarcasm/ar sarcasm2

feat/subjectivity/jais13b zeroshot

Feat/demographic attributes/jais13b

Feat/news categorization/jais13b -- zeroshot

Tests for new model

Feat/ArAIEval23 tasks data

Issue with OSACT4SubtaskB dataset loading

Add score Jais/sarcasm/ar sarcasm2

← Metadata

Owner

Metadata

LLMeBench LLMeBench copied to clipboard

Metadata

← Metadata

Owner

Metadata

LLMeBench
LLMeBench copied to clipboard