Skip to content

TSV Dataset with commas doesn't work #327

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
beccamc opened this issue Nov 7, 2019 · 3 comments
Closed

TSV Dataset with commas doesn't work #327

beccamc opened this issue Nov 7, 2019 · 3 comments
Assignees
Milestone

Comments

@beccamc
Copy link
Contributor

beccamc commented Nov 7, 2019

AirBnb Dataset from this website and from this data source.

We can't process this data set. I think it may be because of commas in the data set (e.g. "East-West Highway, Silver Spring, MD 20910, United States").

Ask Becca for dataset

@JakeRadMSFT JakeRadMSFT added this to the December 2019 milestone Nov 8, 2019
@LittleLittleCloud
Copy link
Contributor

This bug also exists on AutoML, raised an issue here

@LittleLittleCloud LittleLittleCloud self-assigned this Nov 8, 2019
@LittleLittleCloud
Copy link
Contributor

It's a bug in TextLoader, (or a feature I would say)
A solution to bypass this restriction might be

after we get IDataView, we can go to the next step (to train the models)

@LittleLittleCloud
Copy link
Contributor

LittleLittleCloud commented Dec 23, 2019

Raise a fixing PR here in machinelearning

We should still try the bypass method though

UPDATE

ML.Net team has started a fix on TextLoader to make it work with newline in double quote

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants