-
Notifications
You must be signed in to change notification settings - Fork 1.9k
'Type not implemented or supported' exception message from TextLoader is not descriptive #128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Will datetime type be supported at some point ? |
@v-tsymbalistyi, Currently string, float and bool are supported. Can you please confirm if you used other than these type that cause this exception? if not it would be good if you can share your sample? @remware, can you please share any use case where datatime type will be useful? (just to make a case to support it) |
oh, yes I would need DateTime and int if possible. As of now DateTime would be useful. In my example if I input a value of 0.3 the system predicts different states in different runs so probably the values are quite close. Do you have a pointer to other trainer models? |
@remware, Thanks for sharing example. I mean how will you use datetime type in a learning algorithm. All the machine learning algorithms only operates on numbers. Some learning tasks only depend on year so in this case year part is extracted as a feature from the datetime field or some learning task require day, month, year as feature in that case these three are extracted from the datetime field. So, its not just supporting datetime type but also supporting appropriate transform that can operate on this datetime type and extract features from that. I wanted know if you have any end-to-end case where datetime will be read as proper type, how exactly you transform datatime into some features and used in a learning algorithm? PS: most of the time when data depends on time, the problem become time-series then simply a classification or regression problem. |
what leaner are you using? |
@zeahmed Same for me. I tried to use DateTime and int as well. I can live without it for now. I just couldn't figure out what was going on at the start. That is why I decided to create this issue. |
I guess the DateTime use case is bit complex. So we need something like when we mark a date, all the dates consecutive to marker are considered as state dependent. Not sure yet how to do the transformation but that would be next step. At the moment I am using StochasticDualCoordinateAscentClassifier but for my tests I would need Naive Bayes, Clojure, A1DE and MLP Classifier. Also notice that the predicted column in data model is always "Label" is this designed on purpose or can I use other name there ? Assuming you sync also the name in Dictionarizer |
StochasticDualCoordinateAscentClassifier is stochastic as name indicates. It initializes values through a stochastic process at the start similar to other linear algorithms. So, prediction are expected to be a little bit off after every training run. To get the deterministic results, there is a seed value parameter that is not exposed yet. The issue #9 has already been open against it. Here are other learners that you can try:
Once issue #34 is resolved you would be able to use bunch of binary classier for multi-class classification case. |
Great news @zeahmed |
(Correcting previous question: This issue is only when I have int). Why is it I should always use float/double? I had a simple row of data with headers in the csv as follows: When I use the following line of code: To make sure the file Test.csv is being read properly, I used I get: {System.Exception: Type not implemented or supported. |
@raghumuttana, Just looking at your example, I see that your file has header. However, in TextLoader you are setting "useHeader: false". It should be "true". I hope that will fix the issue. Otherwise, please post your code to get deeper understanding of your issue. |
@zeahmed , do you have any idea which types are going to be supported eventually? |
Here is the list of types currently supported.
|
What about using string as feature? I try with Dictionarizer but got an error. Is there any way to assign values to the ocurrence of certaing strings in a text field? Was thinking to do kind of sentiment analysis but for issues reported so I could assign automatically certain bugs/issues to corresponding team or component. Is that possible? |
Make a 'not supported field type' exception more readable, so the developer could figure out why he can't load the data This closes dotnet#128
Issue
The text was updated successfully, but these errors were encountered: