New prompts for Shades FR/EN #742

aneveol · 2022-04-26T14:17:55Z

This is a test PR for the BigScience eval hackathon. The goals is to use these three new prompts for crowS-pairs to the following end:
1/ test process/pipeline (I think that works)
2/ get feedback on prompts format: are they acceptable?
3/ how will the metric "other" be defined? Here, we essentially need a count of answers, to convert into the proportion of "sentence A" answers.

(Our goal this week is to get the bias eval done on the revised/upgraded crowS-pairs_multilingual dataset).

KhalidAlt · 2022-04-26T21:30:30Z

The template with id:493b3361-f28d-4489-87b3-b42ae90e5e3a does not have a target or output. You need to add the output as you did for the reset : ||| {{answer_choices[stereo_antistereo]}}

aneveol · 2022-04-26T21:47:30Z

Thank you for the suggestion. I made the change. However, I am not sure I understand whether this is going to have the desired effect for the metric planned ("other").

oskarvanderwal · 2022-04-27T07:50:54Z

@aneveol, answer_choices[stereo_antistereo] wouldn't work as the target, since stereo_antistereo is not an integer label of which choice is the most stereotypical answer, but a string indicating whether it is about a stereotypical choice or an anti-stereotypical choice. I think we should always suggest the sent_more (more stereotypical choice) as the target.

I have fixed this and also added your prompts to my PR.

…orkshop#778) * accelerate `get_infos` by caching the `DataseInfoDict`s * quality * consistency

jzf2101 · 2022-05-25T14:25:06Z

@oskarvanderwal we have merge conflicts that have to be resolved - also I don't see the dataset listed on promptsource. I think this PR needs to be cleaned up before merging. For some reason the PR did not build.

oskarvanderwal · 2022-05-25T16:18:30Z

@jzf2101 Sorry, I don't have access to this repository (yet). My PR for CrowS-Pairs can be found here. This PR is about the bias-shades dataset, but I agree we need to clean some things up.

@aneveol Maybe you could rename this PR to "Adding prompts for French and English bias-shades"? Then also

remove the CrowS-Pairs prompts,
add BigScienceEvalWG as user, and
make sure your repository is up-to-date with the original.
I can help out with this if you add me as contributor to this repo :)

oskarvanderwal · 2022-06-07T18:28:56Z

@jzf2101 I have cleaned this PR and fixed the prompt answer choices. Let me know if there is anything else required for this PR!

jzf2101 · 2022-06-27T04:14:19Z

@oskarvanderwal there are merge conflicts, could you resolve?

…dataset metadatas

* fix empty documents - multi_news * fix test - unrecognized variable

* Added languages widget to UI. * Style fixes. * Added English tag to existing datasets. * Add languages to viewer mode. * Update language codes. * Update CONTRIBUTING.md. * Update screenshot. * Add "Prompt" to UI to clarify languages tag usage.

StellaAthena · 2022-09-06T23:43:48Z

@jzf2101 Are there additional changes that need to be made to this PR or can it be merged?

* fix `get_dataset` * format

oskarvanderwal · 2022-11-02T15:49:01Z

@jzf2101, I have created a new PR for adding French and English prompts without any blocking merge conflicts here: #837

This PR can be closed.

jzf2101 · 2022-11-02T16:04:04Z

Closed for #837

jzf2101 self-requested a review April 26, 2022 22:18

jzf2101 self-assigned this Apr 26, 2022

jzf2101 changed the base branch from main to eval-hackathon April 26, 2022 23:27

jzf2101 mentioned this pull request Apr 27, 2022

Add CrowS-Pairs to Full Benchmark bigscience-workshop/evaluation#37

Open

oskarvanderwal mentioned this pull request Apr 27, 2022

Added prompts for CrowS-Pairs-multilingual #748

Merged

Accelerate get_infos by caching the DataseInfoDicts (bigscience-w…

d1f16cf

…orkshop#778) * accelerate `get_infos` by caching the `DataseInfoDict`s * quality * consistency

aneveol changed the title ~~3 new prompts for crowS-pairs~~ New prompts for Shades FR/EN May 26, 2022

VictorSanh and others added 3 commits July 3, 2022 18:48

fix filter_english_datasets since languages became language in …

6b1560f

…dataset metadatas

fix empty documents - multi_news (bigscience-workshop#793)

ab6ad7e

* fix empty documents - multi_news * fix test - unrecognized variable

Language tags (bigscience-workshop#771)

0cc4b0c

* Added languages widget to UI. * Style fixes. * Added English tag to existing datasets. * Add languages to viewer mode. * Update language codes. * Update CONTRIBUTING.md. * Update screenshot. * Add "Prompt" to UI to clarify languages tag usage.

VictorSanh added 3 commits October 12, 2022 20:20

track large files with lfs

8d7f576

update link to hosted version

536339e

fix get_dataset (bigscience-workshop#835)

40ae287

* fix `get_dataset` * format

oskarvanderwal mentioned this pull request Nov 2, 2022

Add bias-shades bigscience-workshop/lm-evaluation-harness#37

Merged

oskarvanderwal force-pushed the main branch from a9b9796 to 40ae287 Compare November 2, 2022 15:33

jzf2101 closed this Nov 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New prompts for Shades FR/EN #742

New prompts for Shades FR/EN #742

Uh oh!

aneveol commented Apr 26, 2022

Uh oh!

KhalidAlt commented Apr 26, 2022

Uh oh!

aneveol commented Apr 26, 2022

Uh oh!

oskarvanderwal commented Apr 27, 2022

Uh oh!

jzf2101 commented May 25, 2022 •

edited

Loading

Uh oh!

oskarvanderwal commented May 25, 2022 •

edited

Loading

Uh oh!

oskarvanderwal commented Jun 7, 2022

Uh oh!

jzf2101 commented Jun 27, 2022

Uh oh!

StellaAthena commented Sep 6, 2022

Uh oh!

oskarvanderwal commented Nov 2, 2022

Uh oh!

jzf2101 commented Nov 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

New prompts for Shades FR/EN #742

New prompts for Shades FR/EN #742

Uh oh!

Conversation

aneveol commented Apr 26, 2022

Uh oh!

KhalidAlt commented Apr 26, 2022

Uh oh!

aneveol commented Apr 26, 2022

Uh oh!

oskarvanderwal commented Apr 27, 2022

Uh oh!

jzf2101 commented May 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oskarvanderwal commented May 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oskarvanderwal commented Jun 7, 2022

Uh oh!

jzf2101 commented Jun 27, 2022

Uh oh!

StellaAthena commented Sep 6, 2022

Uh oh!

oskarvanderwal commented Nov 2, 2022

Uh oh!

jzf2101 commented Nov 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

jzf2101 commented May 25, 2022 •

edited

Loading

oskarvanderwal commented May 25, 2022 •

edited

Loading