Text preprocessing V2 TODOs #1373

mfeurer · 2022-01-19T14:37:13Z

Louquinze · 2022-02-15T10:45:52Z

can not find Improve the way feature types are passed to the meta-feature computation (search for the following todo: Todo make this more cohesive to the overall structure (quick bug fix)) in the open to do's

edit: metafeatures.py:1089

mfeurer · 2022-02-15T16:23:07Z

I think the comment right now says: TODO make this more cohesive to the overall structure (quick bug fix)

automl#1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution

automl#1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24.

Louquinze · 2022-02-25T09:01:54Z

can we rename the point "Potentially move the text feature reduction to a different module" to "Potentially rename the text feature reduction to a different module" ?

mfeurer · 2022-02-25T09:16:12Z

Sure

automl#1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24.

* rename "ngram_range" to "ngram_upper_bound" this includes renaming it in all *csv and *json files for metalearning * rename "ngram_range" to "ngram_upper_bound" this includes renaming it in all *csv and *json files for metalearning * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * limit 20NG to 5 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity * limit 20NG to 5 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity * limit 20NG to 2 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity * limit 20NG to 2 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity

…emain active.

* rename "ngram_range" to "ngram_upper_bound" this includes renaming it in all *csv and *json files for metalearning * rename "ngram_range" to "ngram_upper_bound" this includes renaming it in all *csv and *json files for metalearning * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * handle the following issue #1373 (comment) this commit fixes the first 3 bullet points on the to do list. 1. rename hyperparameter "ngram_range" --> "ngram_upper_bound" this includes changing all *csv and *json files 2. Create a new textpreprocessing example_text_preprocessing.py, this new example features the 20Newsgroups dataset import in example_text_preprocessing.py to long, but i can not come up with a good solution include feedback from 02.24. * limit 20NG to 5 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity * limit 20NG to 5 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity * limit 20NG to 2 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity * limit 20NG to 2 labels. automl.leaderboard has problems if the ensamble contains only one model. Therefore we reduced the problem complexity

mfeurer added the enhancement A new improvement or feature label Jan 19, 2022

mfeurer assigned Louquinze Jan 19, 2022

mfeurer mentioned this issue Jan 24, 2022

Text Processing #1300

Merged

Louquinze mentioned this issue Feb 21, 2022

Change HP Name & Include Text example #1410

Merged

Louquinze assigned eddiebergman Mar 15, 2022

Louquinze added a commit to Louquinze/auto-sklearn that referenced this issue May 17, 2022

fix issue automl#1373 automl#741 that non existing features classes r…

a6002e0

…emain active.

Louquinze added a commit to Louquinze/auto-sklearn that referenced this issue May 17, 2022

fix issue automl#1373 automl#741 that non existing features classes r…

045864e

…emain active.

Louquinze added a commit to Louquinze/auto-sklearn that referenced this issue May 17, 2022

fix issue automl#1373 automl#741 that non existing features classes r…

2125985

…emain active.

Louquinze added a commit to Louquinze/auto-sklearn that referenced this issue May 17, 2022

fix issue automl#1373 automl#741 that non existing features classes r…

273b6af

…emain active.

Louquinze added a commit to Louquinze/auto-sklearn that referenced this issue May 19, 2022

fix issue automl#1373 automl#741 that non existing features classes r…

c3dcd93

…emain active.

Louquinze added a commit to Louquinze/auto-sklearn that referenced this issue May 19, 2022

fix issue automl#1373 automl#741 that non existing features classes r…

b46ac4a

…emain active.

eddiebergman linked a pull request Jun 10, 2022 that will close this issue

Fixing hps remain active & meta hp configuration old #1489

Closed

eddiebergman added this to the V0.15 milestone Jun 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Text preprocessing V2 TODOs #1373

Text preprocessing V2 TODOs #1373

mfeurer commented Jan 19, 2022 •

edited by Louquinze

Loading

Louquinze commented Feb 15, 2022 •

edited

Loading

Uh oh!

mfeurer commented Feb 15, 2022

Uh oh!

Louquinze commented Feb 25, 2022

Uh oh!

mfeurer commented Feb 25, 2022

Uh oh!

Text preprocessing V2 TODOs #1373

Text preprocessing V2 TODOs #1373

Comments

mfeurer commented Jan 19, 2022 • edited by Louquinze Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

General implementation

Hyperparameter space

Louquinze commented Feb 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer commented Feb 15, 2022

Uh oh!

Louquinze commented Feb 25, 2022

Uh oh!

mfeurer commented Feb 25, 2022

Uh oh!

mfeurer commented Jan 19, 2022 •

edited by Louquinze

Loading

Louquinze commented Feb 15, 2022 •

edited

Loading