Skip to content

Commit 179e4df

Browse files
maziyarpanahijsl-modelsNaveen-004ahmedlone127prabod
authored
Models hub (#13876)
* Add model 2023-04-13-CyberbullyingDetection_ClassifierDL_tfhub_en (#13757) Co-authored-by: Naveen-004 <[email protected]> * 2023-04-20-distilbert_base_uncased_mnli_en (#13761) * Add model 2023-04-20-distilbert_base_uncased_mnli_en * Add model 2023-04-20-distilbert_base_turkish_cased_allnli_tr * Add model 2023-04-20-distilbert_base_turkish_cased_snli_tr * Add model 2023-04-20-distilbert_base_turkish_cased_multinli_tr * Update and rename 2023-04-20-distilbert_base_turkish_cased_allnli_tr.md to 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_allnli_tr.md * Update and rename 2023-04-20-distilbert_base_turkish_cased_multinli_tr.md to 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr.md * Update and rename 2023-04-20-distilbert_base_turkish_cased_snli_tr.md to 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr.md * Update and rename 2023-04-20-distilbert_base_uncased_mnli_en.md to distilbert_base_zero_shot_classifier_turkish_cased_snli * Rename distilbert_base_zero_shot_classifier_turkish_cased_snli to distilbert_base_zero_shot_classifier_turkish_cased_snli_en.md * Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr.md * Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr.md * Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_allnli_tr.md --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr (#13763) * Add model 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr * Add model 2023-04-20-distilbert_base_zero_shot_classifier_uncased_mnli_en * Add model 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr * Add model 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_allnli_tr * Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_multinli_tr.md * Update 2023-04-20-distilbert_base_zero_shot_classifier_turkish_cased_snli_tr.md --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-05-04-roberta_base_zero_shot_classifier_nli_en (#13781) * Add model 2023-05-04-roberta_base_zero_shot_classifier_nli_en * Fix Spark version to 3.0 --------- Co-authored-by: ahmedlone127 <[email protected]> Co-authored-by: Maziyar Panahi <[email protected]> * 2023-05-09-distilbart_xsum_6_6_en (#13788) * Add model 2023-05-09-distilbart_xsum_6_6_en * Add model 2023-05-09-distilbart_xsum_12_6_en * Add model 2023-05-09-distilbart_cnn_12_6_en * Add model 2023-05-09-distilbart_cnn_6_6_en * Add model 2023-05-09-bart_large_cnn_en * Update 2023-05-09-bart_large_cnn_en.md * Update 2023-05-09-distilbart_cnn_12_6_en.md * Update 2023-05-09-distilbart_cnn_6_6_en.md * Update 2023-05-09-distilbart_xsum_12_6_en.md * Update 2023-05-09-distilbart_xsum_6_6_en.md --------- Co-authored-by: prabod <[email protected]> Co-authored-by: Maziyar Panahi <[email protected]> * 2023-05-11-distilbart_cnn_12_6_en (#13795) * Add model 2023-05-11-distilbart_cnn_12_6_en * Add model 2023-05-11-distilbart_cnn_6_6_en * Add model 2023-05-11-distilbart_xsum_12_6_en * Add model 2023-05-11-distilbart_xsum_6_6_en * Add model 2023-05-11-bart_large_cnn_en * Update 2023-05-11-bart_large_cnn_en.md * Update 2023-05-11-distilbart_cnn_12_6_en.md * Update 2023-05-11-distilbart_cnn_6_6_en.md * Update 2023-05-11-distilbart_xsum_12_6_en.md * Update 2023-05-11-distilbart_xsum_6_6_en.md --------- Co-authored-by: prabod <[email protected]> Co-authored-by: Maziyar Panahi <[email protected]> * 2023-05-19-match_pattern_en (#13805) * Add model 2023-05-19-match_pattern_en * Add model 2023-05-19-dependency_parse_en * Add model 2023-05-20-explain_document_md_fr * Add model 2023-05-20-dependency_parse_en * Add model 2023-05-20-explain_document_md_it * Add model 2023-05-20-entity_recognizer_lg_fr * Add model 2023-05-20-entity_recognizer_md_fr * Add model 2023-05-20-entity_recognizer_lg_it * Add model 2023-05-20-entity_recognizer_md_it * Add model 2023-05-20-check_spelling_en * Add model 2023-05-20-match_datetime_en * Add model 2023-05-20-match_pattern_en * Add model 2023-05-20-clean_pattern_en * Add model 2023-05-20-clean_stop_en * Add model 2023-05-20-movies_sentiment_analysis_en * Add model 2023-05-20-explain_document_ml_en * Add model 2023-05-20-analyze_sentiment_en * Add model 2023-05-20-explain_document_dl_en * Add model 2023-05-20-recognize_entities_dl_en * Add model 2023-05-20-recognize_entities_bert_en * Add model 2023-05-20-explain_document_md_de * Add model 2023-05-21-entity_recognizer_lg_de * Add model 2023-05-21-entity_recognizer_md_de * Add model 2023-05-21-onto_recognize_entities_sm_en * Add model 2023-05-21-onto_recognize_entities_lg_en * Add model 2023-05-21-match_chunks_en * Add model 2023-05-21-explain_document_lg_es * Add model 2023-05-21-explain_document_md_es * Add model 2023-05-21-explain_document_sm_es * Add model 2023-05-21-entity_recognizer_lg_es * Add model 2023-05-21-entity_recognizer_md_es * Add model 2023-05-21-entity_recognizer_sm_es * Add model 2023-05-21-explain_document_lg_ru * Add model 2023-05-21-explain_document_md_ru * Add model 2023-05-21-explain_document_sm_ru * Add model 2023-05-21-entity_recognizer_lg_ru * Add model 2023-05-21-entity_recognizer_md_ru * Add model 2023-05-21-entity_recognizer_sm_ru * Add model 2023-05-21-text_cleaning_en * Add model 2023-05-21-explain_document_lg_pt * Add model 2023-05-21-explain_document_md_pt * Add model 2023-05-21-explain_document_sm_pt * Add model 2023-05-21-entity_recognizer_lg_pt * Add model 2023-05-21-entity_recognizer_md_pt * Add model 2023-05-21-entity_recognizer_sm_pt * Add model 2023-05-21-explain_document_lg_pl * Add model 2023-05-21-explain_document_md_pl * Add model 2023-05-21-explain_document_sm_pl * Add model 2023-05-21-entity_recognizer_lg_pl * Add model 2023-05-21-entity_recognizer_md_pl * Add model 2023-05-21-entity_recognizer_sm_pl * Add model 2023-05-21-explain_document_lg_nl * Add model 2023-05-21-explain_document_md_nl * Add model 2023-05-21-explain_document_sm_nl * Add model 2023-05-21-entity_recognizer_lg_nl * Add model 2023-05-21-entity_recognizer_md_nl * Add model 2023-05-21-entity_recognizer_sm_nl * Add model 2023-05-21-analyze_sentimentdl_glove_imdb_en * Add model 2023-05-21-explain_document_lg_no * Add model 2023-05-21-explain_document_md_no * Add model 2023-05-21-explain_document_sm_no * Add model 2023-05-21-entity_recognizer_lg_no * Add model 2023-05-21-entity_recognizer_md_no * Add model 2023-05-21-entity_recognizer_sm_no * Add model 2023-05-21-explain_document_lg_sv * Add model 2023-05-21-explain_document_md_sv * Add model 2023-05-21-explain_document_sm_sv * Add model 2023-05-21-entity_recognizer_lg_sv * Add model 2023-05-21-entity_recognizer_md_sv * Add model 2023-05-21-entity_recognizer_sm_sv * Add model 2023-05-21-explain_document_lg_da * Add model 2023-05-21-explain_document_md_da * Add model 2023-05-21-explain_document_sm_da * Add model 2023-05-21-entity_recognizer_lg_da * Add model 2023-05-21-entity_recognizer_md_da * Add model 2023-05-21-entity_recognizer_sm_da * Add model 2023-05-21-explain_document_lg_fi * Add model 2023-05-21-explain_document_md_fi * Add model 2023-05-21-explain_document_sm_fi * Add model 2023-05-21-entity_recognizer_lg_fi * Add model 2023-05-21-entity_recognizer_md_fi * Add model 2023-05-21-entity_recognizer_sm_fi * Add model 2023-05-21-onto_recognize_entities_bert_base_en * Add model 2023-05-21-onto_recognize_entities_bert_large_en * Add model 2023-05-21-onto_recognize_entities_bert_medium_en * Add model 2023-05-21-onto_recognize_entities_bert_mini_en * Add model 2023-05-21-onto_recognize_entities_bert_small_en * Add model 2023-05-21-onto_recognize_entities_bert_tiny_en * Add model 2023-05-21-onto_recognize_entities_electra_base_en * Add model 2023-05-21-onto_recognize_entities_electra_small_en * Add model 2023-05-21-onto_recognize_entities_electra_large_en * Add model 2023-05-21-recognize_entities_dl_fa * Add model 2023-05-21-nerdl_fewnerd_subentity_100d_pipeline_en * Add model 2023-05-21-nerdl_fewnerd_100d_pipeline_en * Add model 2023-05-21-pos_ud_bokmaal_nb * Add model 2023-05-21-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-21-bert_token_classifier_scandi_ner_pipeline_xx * Add model 2023-05-21-bert_sequence_classifier_trec_coarse_pipeline_en * Add model 2023-05-21-bert_sequence_classifier_age_news_pipeline_en * Add model 2023-05-21-distilbert_token_classifier_typo_detector_pipeline_is * Add model 2023-05-21-distilbert_base_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-21-nerdl_restaurant_100d_pipeline_en * Add model 2023-05-21-roberta_token_classifier_timex_semeval_pipeline_en * Add model 2023-05-21-bert_token_classifier_hi_en_ner_pipeline_hi * Add model 2023-05-21-xlm_roberta_large_token_classifier_hrl_pipeline_xx * Add model 2023-05-21-spellcheck_dl_pipeline_en * Add model 2023-05-21-bert_token_classifier_dutch_udlassy_ner_pipeline_nl * Add model 2023-05-21-xlm_roberta_large_token_classifier_conll03_pipeline_de * Add model 2023-05-21-roberta_token_classifier_bne_capitel_ner_pipeline_es * Add model 2023-05-21-roberta_token_classifier_icelandic_ner_pipeline_is * Add model 2023-05-21-longformer_base_token_classifier_conll03_pipeline_en * Add model 2023-05-21-longformer_large_token_classifier_conll03_pipeline_en * Add model 2023-05-21-xlnet_base_token_classifier_conll03_pipeline_en * Add model 2023-05-21-xlm_roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-21-xlm_roberta_base_token_classifier_conll03_pipeline_en * Add model 2023-05-21-xlnet_large_token_classifier_conll03_pipeline_en * Add model 2023-05-21-albert_base_token_classifier_conll03_pipeline_en * Add model 2023-05-21-albert_large_token_classifier_conll03_pipeline_en * Add model 2023-05-21-albert_xlarge_token_classifier_conll03_pipeline_en * Add model 2023-05-21-distilroberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-21-roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-21-roberta_large_token_classifier_conll03_pipeline_en * Add model 2023-05-21-distilbert_token_classifier_typo_detector_pipeline_en --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-05-22-explain_document_md_fr (#13811) * Add model 2023-05-22-explain_document_md_fr * Add model 2023-05-22-dependency_parse_en * Add model 2023-05-22-explain_document_md_it * Add model 2023-05-22-entity_recognizer_lg_fr * Add model 2023-05-22-entity_recognizer_md_fr * Add model 2023-05-22-entity_recognizer_lg_it * Add model 2023-05-22-entity_recognizer_md_it * Add model 2023-05-22-check_spelling_en * Add model 2023-05-22-match_datetime_en * Add model 2023-05-22-match_pattern_en * Add model 2023-05-22-clean_pattern_en * Add model 2023-05-22-clean_stop_en * Add model 2023-05-22-movies_sentiment_analysis_en * Add model 2023-05-22-explain_document_ml_en * Add model 2023-05-22-analyze_sentiment_en * Add model 2023-05-22-explain_document_dl_en * Add model 2023-05-22-recognize_entities_dl_en * Add model 2023-05-22-recognize_entities_bert_en * Add model 2023-05-22-explain_document_md_de * Add model 2023-05-22-entity_recognizer_lg_de * Add model 2023-05-22-entity_recognizer_md_de * Add model 2023-05-22-onto_recognize_entities_sm_en * Add model 2023-05-22-onto_recognize_entities_lg_en * Add model 2023-05-22-match_chunks_en * Add model 2023-05-22-explain_document_lg_es * Add model 2023-05-22-explain_document_md_es * Add model 2023-05-22-explain_document_sm_es * Add model 2023-05-22-entity_recognizer_lg_es * Add model 2023-05-22-entity_recognizer_md_es * Add model 2023-05-22-entity_recognizer_sm_es * Add model 2023-05-22-explain_document_lg_ru * Add model 2023-05-22-explain_document_md_ru * Add model 2023-05-22-explain_document_sm_ru * Add model 2023-05-22-entity_recognizer_lg_ru * Add model 2023-05-22-entity_recognizer_md_ru * Add model 2023-05-22-entity_recognizer_sm_ru * Add model 2023-05-22-text_cleaning_en * Add model 2023-05-22-explain_document_lg_pt * Add model 2023-05-22-explain_document_md_pt * Add model 2023-05-22-explain_document_sm_pt * Add model 2023-05-22-entity_recognizer_lg_pt * Add model 2023-05-22-entity_recognizer_md_pt * Add model 2023-05-22-entity_recognizer_sm_pt * Add model 2023-05-22-explain_document_lg_pl * Add model 2023-05-22-explain_document_md_pl * Add model 2023-05-22-explain_document_sm_pl * Add model 2023-05-22-entity_recognizer_lg_pl * Add model 2023-05-22-entity_recognizer_md_pl * Add model 2023-05-22-entity_recognizer_sm_pl * Add model 2023-05-22-explain_document_lg_nl * Add model 2023-05-22-explain_document_md_nl * Add model 2023-05-22-explain_document_sm_nl * Add model 2023-05-22-entity_recognizer_lg_nl * Add model 2023-05-22-entity_recognizer_md_nl * Add model 2023-05-22-entity_recognizer_sm_nl * Add model 2023-05-22-analyze_sentimentdl_glove_imdb_en * Add model 2023-05-22-explain_document_lg_no * Add model 2023-05-22-explain_document_md_no * Add model 2023-05-22-explain_document_sm_no * Add model 2023-05-22-entity_recognizer_md_no * Add model 2023-05-22-entity_recognizer_sm_no * Add model 2023-05-22-explain_document_lg_sv * Add model 2023-05-22-explain_document_md_sv * Add model 2023-05-22-explain_document_sm_sv * Add model 2023-05-22-entity_recognizer_lg_sv * Add model 2023-05-22-entity_recognizer_md_sv * Add model 2023-05-22-entity_recognizer_sm_sv * Add model 2023-05-22-explain_document_lg_da * Add model 2023-05-22-explain_document_md_da * Add model 2023-05-22-explain_document_sm_da * Add model 2023-05-22-entity_recognizer_lg_da * Add model 2023-05-22-entity_recognizer_md_da * Add model 2023-05-22-entity_recognizer_sm_da * Add model 2023-05-22-explain_document_lg_fi * Add model 2023-05-22-explain_document_md_fi * Add model 2023-05-22-explain_document_sm_fi * Add model 2023-05-22-entity_recognizer_lg_fi * Add model 2023-05-22-entity_recognizer_md_fi * Add model 2023-05-22-entity_recognizer_sm_fi * Add model 2023-05-22-onto_recognize_entities_bert_base_en * Add model 2023-05-22-onto_recognize_entities_bert_large_en * Add model 2023-05-22-onto_recognize_entities_bert_medium_en * Add model 2023-05-22-onto_recognize_entities_bert_mini_en * Add model 2023-05-22-onto_recognize_entities_bert_small_en * Add model 2023-05-22-onto_recognize_entities_bert_tiny_en * Add model 2023-05-22-onto_recognize_entities_electra_base_en * Add model 2023-05-22-onto_recognize_entities_electra_small_en * Add model 2023-05-22-onto_recognize_entities_electra_large_en * Add model 2023-05-22-recognize_entities_dl_fa * Add model 2023-05-22-nerdl_fewnerd_subentity_100d_pipeline_en * Add model 2023-05-22-nerdl_fewnerd_100d_pipeline_en * Add model 2023-05-22-pos_ud_bokmaal_nb * Add model 2023-05-22-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-22-bert_token_classifier_scandi_ner_pipeline_xx * Add model 2023-05-22-bert_sequence_classifier_trec_coarse_pipeline_en * Add model 2023-05-22-bert_sequence_classifier_age_news_pipeline_en * Add model 2023-05-22-distilbert_token_classifier_typo_detector_pipeline_is * Add model 2023-05-22-distilbert_base_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-22-nerdl_restaurant_100d_pipeline_en * Add model 2023-05-22-roberta_token_classifier_timex_semeval_pipeline_en * Add model 2023-05-22-bert_token_classifier_hi_en_ner_pipeline_hi * Add model 2023-05-22-xlm_roberta_large_token_classifier_hrl_pipeline_xx * Add model 2023-05-22-spellcheck_dl_pipeline_en * Add model 2023-05-22-bert_token_classifier_dutch_udlassy_ner_pipeline_nl * Add model 2023-05-22-xlm_roberta_large_token_classifier_conll03_pipeline_de * Add model 2023-05-22-roberta_token_classifier_bne_capitel_ner_pipeline_es * Add model 2023-05-22-roberta_token_classifier_icelandic_ner_pipeline_is * Add model 2023-05-22-longformer_base_token_classifier_conll03_pipeline_en * Add model 2023-05-22-longformer_large_token_classifier_conll03_pipeline_en * Add model 2023-05-22-xlnet_base_token_classifier_conll03_pipeline_en * Add model 2023-05-22-xlm_roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-22-xlm_roberta_base_token_classifier_conll03_pipeline_en * Add model 2023-05-22-xlnet_large_token_classifier_conll03_pipeline_en * Add model 2023-05-22-albert_base_token_classifier_conll03_pipeline_en * Add model 2023-05-22-albert_large_token_classifier_conll03_pipeline_en * Add model 2023-05-22-albert_xlarge_token_classifier_conll03_pipeline_en * Add model 2023-05-22-distilroberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-22-roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-22-roberta_large_token_classifier_conll03_pipeline_en * Add model 2023-05-22-distilbert_token_classifier_typo_detector_pipeline_en --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-05-24-explain_document_md_fr (#13821) * Add model 2023-05-24-explain_document_md_fr * Add model 2023-05-24-dependency_parse_en * Add model 2023-05-24-explain_document_md_it * Add model 2023-05-24-entity_recognizer_lg_fr * Add model 2023-05-24-entity_recognizer_md_fr * Add model 2023-05-24-entity_recognizer_lg_it * Add model 2023-05-24-entity_recognizer_md_it * Add model 2023-05-24-check_spelling_en * Add model 2023-05-24-match_datetime_en * Add model 2023-05-24-match_pattern_en * Add model 2023-05-24-clean_pattern_en * Add model 2023-05-24-clean_stop_en * Add model 2023-05-24-movies_sentiment_analysis_en * Add model 2023-05-24-explain_document_ml_en * Add model 2023-05-24-analyze_sentiment_en * Add model 2023-05-24-explain_document_dl_en * Add model 2023-05-24-recognize_entities_dl_en * Add model 2023-05-24-recognize_entities_bert_en * Add model 2023-05-24-explain_document_md_de * Add model 2023-05-24-entity_recognizer_lg_de * Add model 2023-05-24-entity_recognizer_md_de * Add model 2023-05-24-onto_recognize_entities_sm_en * Add model 2023-05-24-onto_recognize_entities_lg_en * Add model 2023-05-24-match_chunks_en * Add model 2023-05-24-explain_document_lg_es * Add model 2023-05-24-explain_document_md_es * Add model 2023-05-24-explain_document_sm_es * Add model 2023-05-24-entity_recognizer_lg_es * Add model 2023-05-24-entity_recognizer_md_es * Add model 2023-05-24-entity_recognizer_sm_es * Add model 2023-05-24-explain_document_lg_ru * Add model 2023-05-24-explain_document_md_ru * Add model 2023-05-24-explain_document_sm_ru * Add model 2023-05-24-entity_recognizer_lg_ru * Add model 2023-05-24-entity_recognizer_md_ru * Add model 2023-05-24-entity_recognizer_sm_ru * Add model 2023-05-24-text_cleaning_en * Add model 2023-05-24-explain_document_lg_pt * Add model 2023-05-24-explain_document_md_pt * Add model 2023-05-24-explain_document_sm_pt * Add model 2023-05-24-entity_recognizer_lg_pt * Add model 2023-05-24-entity_recognizer_md_pt * Add model 2023-05-24-entity_recognizer_sm_pt * Add model 2023-05-24-explain_document_lg_pl * Add model 2023-05-24-explain_document_md_pl * Add model 2023-05-24-explain_document_sm_pl * Add model 2023-05-24-entity_recognizer_lg_pl * Add model 2023-05-24-entity_recognizer_md_pl * Add model 2023-05-24-entity_recognizer_sm_pl * Add model 2023-05-24-explain_document_lg_nl * Add model 2023-05-24-explain_document_md_nl * Add model 2023-05-24-explain_document_sm_nl * Add model 2023-05-24-entity_recognizer_lg_nl * Add model 2023-05-24-entity_recognizer_md_nl * Add model 2023-05-24-entity_recognizer_sm_nl * Add model 2023-05-24-analyze_sentimentdl_glove_imdb_en * Add model 2023-05-24-explain_document_lg_no * Add model 2023-05-24-explain_document_md_no * Add model 2023-05-24-explain_document_sm_no * Add model 2023-05-24-entity_recognizer_lg_no * Add model 2023-05-24-entity_recognizer_md_no * Add model 2023-05-24-entity_recognizer_sm_no * Add model 2023-05-24-explain_document_lg_sv * Add model 2023-05-24-explain_document_md_sv * Add model 2023-05-24-explain_document_sm_sv * Add model 2023-05-24-entity_recognizer_lg_sv * Add model 2023-05-24-entity_recognizer_md_sv * Add model 2023-05-24-entity_recognizer_sm_sv * Add model 2023-05-25-explain_document_lg_da * Add model 2023-05-25-explain_document_md_da * Add model 2023-05-25-explain_document_sm_da * Add model 2023-05-25-entity_recognizer_lg_da * Add model 2023-05-25-entity_recognizer_md_da * Add model 2023-05-25-entity_recognizer_sm_da * Add model 2023-05-25-explain_document_lg_fi * Add model 2023-05-25-explain_document_md_fi * Add model 2023-05-25-explain_document_sm_fi * Add model 2023-05-25-entity_recognizer_lg_fi * Add model 2023-05-25-entity_recognizer_md_fi * Add model 2023-05-25-entity_recognizer_sm_fi * Add model 2023-05-25-onto_recognize_entities_bert_base_en * Add model 2023-05-25-onto_recognize_entities_bert_large_en * Add model 2023-05-25-onto_recognize_entities_bert_medium_en * Add model 2023-05-25-onto_recognize_entities_bert_mini_en * Add model 2023-05-25-onto_recognize_entities_bert_small_en * Add model 2023-05-25-onto_recognize_entities_bert_tiny_en * Add model 2023-05-25-onto_recognize_entities_electra_base_en * Add model 2023-05-25-onto_recognize_entities_electra_small_en * Add model 2023-05-25-onto_recognize_entities_electra_large_en * Add model 2023-05-25-recognize_entities_dl_fa * Add model 2023-05-25-nerdl_fewnerd_subentity_100d_pipeline_en * Add model 2023-05-25-nerdl_fewnerd_100d_pipeline_en * Add model 2023-05-25-pos_ud_bokmaal_nb * Add model 2023-05-25-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-25-bert_token_classifier_scandi_ner_pipeline_xx * Add model 2023-05-25-bert_sequence_classifier_trec_coarse_pipeline_en * Add model 2023-05-25-bert_sequence_classifier_age_news_pipeline_en * Add model 2023-05-25-distilbert_token_classifier_typo_detector_pipeline_is * Add model 2023-05-25-distilbert_base_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-25-nerdl_restaurant_100d_pipeline_en * Add model 2023-05-25-roberta_token_classifier_timex_semeval_pipeline_en * Add model 2023-05-25-bert_token_classifier_hi_en_ner_pipeline_hi * Add model 2023-05-25-xlm_roberta_large_token_classifier_hrl_pipeline_xx * Add model 2023-05-25-spellcheck_dl_pipeline_en * Add model 2023-05-25-bert_token_classifier_dutch_udlassy_ner_pipeline_nl * Add model 2023-05-25-xlm_roberta_large_token_classifier_conll03_pipeline_de * Add model 2023-05-25-roberta_token_classifier_bne_capitel_ner_pipeline_es * Add model 2023-05-25-roberta_token_classifier_icelandic_ner_pipeline_is * Add model 2023-05-25-longformer_base_token_classifier_conll03_pipeline_en * Add model 2023-05-25-longformer_large_token_classifier_conll03_pipeline_en * Add model 2023-05-25-xlnet_base_token_classifier_conll03_pipeline_en * Add model 2023-05-25-xlm_roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-25-xlm_roberta_base_token_classifier_conll03_pipeline_en * Add model 2023-05-25-xlnet_large_token_classifier_conll03_pipeline_en * Add model 2023-05-25-albert_base_token_classifier_conll03_pipeline_en * Add model 2023-05-25-albert_large_token_classifier_conll03_pipeline_en * Add model 2023-05-25-albert_xlarge_token_classifier_conll03_pipeline_en * Add model 2023-05-25-distilroberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-25-roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-25-roberta_large_token_classifier_conll03_pipeline_en * Add model 2023-05-25-distilbert_token_classifier_typo_detector_pipeline_en --------- Co-authored-by: ahmedlone127 <[email protected]> * Add model 2023-05-25-explain_document_md_fr (#13827) Co-authored-by: ahmedlone127 <[email protected]> * 2023-05-25-dependency_parse_en (#13828) * Add model 2023-05-25-dependency_parse_en * Add model 2023-05-25-explain_document_md_it * Add model 2023-05-25-entity_recognizer_lg_fr * Add model 2023-05-25-entity_recognizer_md_fr * Add model 2023-05-25-entity_recognizer_lg_it * Add model 2023-05-25-entity_recognizer_md_it * Add model 2023-05-25-check_spelling_en * Add model 2023-05-25-match_datetime_en * Add model 2023-05-25-match_pattern_en * Add model 2023-05-25-clean_pattern_en * Add model 2023-05-25-clean_stop_en * Add model 2023-05-25-movies_sentiment_analysis_en * Add model 2023-05-25-explain_document_ml_en * Add model 2023-05-25-analyze_sentiment_en * Add model 2023-05-25-explain_document_dl_en * Add model 2023-05-25-recognize_entities_dl_en * Add model 2023-05-25-recognize_entities_bert_en * Add model 2023-05-25-explain_document_md_de * Add model 2023-05-25-entity_recognizer_lg_de * Add model 2023-05-25-entity_recognizer_md_de * Add model 2023-05-25-onto_recognize_entities_sm_en * Add model 2023-05-25-onto_recognize_entities_lg_en * Add model 2023-05-25-match_chunks_en * Add model 2023-05-25-explain_document_lg_es * Add model 2023-05-25-explain_document_md_es * Add model 2023-05-25-explain_document_sm_es * Add model 2023-05-25-entity_recognizer_lg_es * Add model 2023-05-25-entity_recognizer_md_es * Add model 2023-05-25-entity_recognizer_sm_es * Add model 2023-05-25-explain_document_lg_ru * Add model 2023-05-25-explain_document_md_ru * Add model 2023-05-25-explain_document_sm_ru * Add model 2023-05-25-entity_recognizer_lg_ru * Add model 2023-05-25-entity_recognizer_md_ru * Add model 2023-05-25-entity_recognizer_sm_ru * Add model 2023-05-25-text_cleaning_en * Add model 2023-05-25-explain_document_lg_pt * Add model 2023-05-25-explain_document_md_pt * Add model 2023-05-25-explain_document_sm_pt * Add model 2023-05-25-entity_recognizer_lg_pt * Add model 2023-05-25-entity_recognizer_md_pt * Add model 2023-05-25-entity_recognizer_sm_pt * Add model 2023-05-25-explain_document_lg_pl * Add model 2023-05-25-explain_document_md_pl * Add model 2023-05-25-explain_document_sm_pl * Add model 2023-05-25-entity_recognizer_lg_pl * Add model 2023-05-25-entity_recognizer_md_pl * Add model 2023-05-25-entity_recognizer_sm_pl * Add model 2023-05-25-explain_document_lg_nl * Add model 2023-05-25-explain_document_md_nl * Add model 2023-05-25-explain_document_sm_nl * Add model 2023-05-25-entity_recognizer_lg_nl * Add model 2023-05-25-entity_recognizer_md_nl * Add model 2023-05-25-entity_recognizer_sm_nl * Add model 2023-05-25-analyze_sentimentdl_glove_imdb_en * Add model 2023-05-25-explain_document_lg_no * Add model 2023-05-25-explain_document_md_no * Add model 2023-05-25-explain_document_sm_no * Add model 2023-05-25-entity_recognizer_lg_no * Add model 2023-05-25-entity_recognizer_md_no * Add model 2023-05-25-entity_recognizer_sm_no * Add model 2023-05-25-explain_document_lg_sv * Add model 2023-05-25-explain_document_md_sv * Add model 2023-05-25-explain_document_sm_sv * Add model 2023-05-25-entity_recognizer_lg_sv * Add model 2023-05-25-entity_recognizer_md_sv * Add model 2023-05-25-entity_recognizer_sm_sv * Add model 2023-05-25-explain_document_lg_da * Add model 2023-05-25-explain_document_md_da * Add model 2023-05-25-explain_document_sm_da * Add model 2023-05-25-entity_recognizer_lg_da * Add model 2023-05-25-entity_recognizer_md_da * Add model 2023-05-25-entity_recognizer_sm_da * Add model 2023-05-25-explain_document_lg_fi * Add model 2023-05-25-explain_document_md_fi * Add model 2023-05-25-explain_document_sm_fi * Add model 2023-05-25-entity_recognizer_lg_fi * Add model 2023-05-25-entity_recognizer_md_fi * Add model 2023-05-25-entity_recognizer_sm_fi * Add model 2023-05-25-onto_recognize_entities_bert_base_en * Add model 2023-05-25-onto_recognize_entities_bert_large_en * Add model 2023-05-25-onto_recognize_entities_bert_medium_en * Add model 2023-05-25-onto_recognize_entities_bert_mini_en * Add model 2023-05-25-onto_recognize_entities_bert_small_en * Add model 2023-05-25-onto_recognize_entities_bert_tiny_en * Add model 2023-05-25-onto_recognize_entities_electra_base_en * Add model 2023-05-25-onto_recognize_entities_electra_small_en * Add model 2023-05-25-onto_recognize_entities_electra_large_en * Add model 2023-05-26-recognize_entities_dl_fa * Add model 2023-05-26-nerdl_fewnerd_subentity_100d_pipeline_en * Add model 2023-05-26-nerdl_fewnerd_100d_pipeline_en * Add model 2023-05-26-pos_ud_bokmaal_nb * Add model 2023-05-26-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-26-bert_token_classifier_scandi_ner_pipeline_xx * Add model 2023-05-26-bert_sequence_classifier_trec_coarse_pipeline_en * Add model 2023-05-26-bert_sequence_classifier_age_news_pipeline_en * Add model 2023-05-26-distilbert_token_classifier_typo_detector_pipeline_is * Add model 2023-05-26-distilbert_base_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-26-nerdl_restaurant_100d_pipeline_en * Add model 2023-05-26-roberta_token_classifier_timex_semeval_pipeline_en * Add model 2023-05-26-bert_token_classifier_hi_en_ner_pipeline_hi * Add model 2023-05-26-xlm_roberta_large_token_classifier_hrl_pipeline_xx * Add model 2023-05-26-spellcheck_dl_pipeline_en * Add model 2023-05-26-bert_token_classifier_dutch_udlassy_ner_pipeline_nl * Add model 2023-05-26-xlm_roberta_large_token_classifier_conll03_pipeline_de * Add model 2023-05-26-roberta_token_classifier_bne_capitel_ner_pipeline_es * Add model 2023-05-26-roberta_token_classifier_icelandic_ner_pipeline_is * Add model 2023-05-26-longformer_base_token_classifier_conll03_pipeline_en * Add model 2023-05-26-longformer_large_token_classifier_conll03_pipeline_en * Add model 2023-05-26-xlnet_base_token_classifier_conll03_pipeline_en * Add model 2023-05-26-xlm_roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-26-xlm_roberta_base_token_classifier_conll03_pipeline_en * Add model 2023-05-26-xlnet_large_token_classifier_conll03_pipeline_en * Add model 2023-05-26-albert_base_token_classifier_conll03_pipeline_en * Add model 2023-05-26-albert_large_token_classifier_conll03_pipeline_en * Add model 2023-05-26-albert_xlarge_token_classifier_conll03_pipeline_en * Add model 2023-05-26-distilroberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-26-roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-26-roberta_large_token_classifier_conll03_pipeline_en * Add model 2023-05-26-distilbert_token_classifier_typo_detector_pipeline_en --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-05-25-distilcamembert_french_legal_fr (#13826) * Add model 2023-05-25-distilcamembert_french_legal_fr * Update 2023-05-25-distilcamembert_french_legal_fr.md * Update 2023-05-25-distilcamembert_french_legal_fr.md * Add model 2023-05-25-camembert_french_legal_fr * Update 2023-05-25-camembert_french_legal_fr.md * Update 2023-05-25-camembert_french_legal_fr.md * Update 2023-05-25-distilcamembert_french_legal_fr.md --------- Co-authored-by: Mary-Sci <[email protected]> Co-authored-by: Merve Ertas Uslu <[email protected]> * Update title for 2023-05-25-distilcamembert_french_legal_fr.md (#13831) * 2023-05-27-explain_document_md_fr (#13836) * Add model 2023-05-27-explain_document_md_fr * Add model 2023-05-27-dependency_parse_en * Add model 2023-05-27-explain_document_md_it * Add model 2023-05-27-entity_recognizer_lg_fr * Add model 2023-05-27-entity_recognizer_md_fr * Add model 2023-05-27-entity_recognizer_lg_it * Add model 2023-05-27-entity_recognizer_md_it * Add model 2023-05-27-check_spelling_en * Add model 2023-05-27-match_datetime_en * Add model 2023-05-27-match_pattern_en * Add model 2023-05-27-clean_pattern_en * Add model 2023-05-27-clean_stop_en * Add model 2023-05-27-movies_sentiment_analysis_en * Add model 2023-05-27-explain_document_ml_en * Add model 2023-05-27-analyze_sentiment_en * Add model 2023-05-27-explain_document_dl_en * Add model 2023-05-27-recognize_entities_dl_en * Add model 2023-05-27-recognize_entities_bert_en * Add model 2023-05-27-explain_document_md_de * Add model 2023-05-27-entity_recognizer_lg_de * Add model 2023-05-27-entity_recognizer_md_de * Add model 2023-05-27-onto_recognize_entities_sm_en * Add model 2023-05-27-onto_recognize_entities_lg_en * Add model 2023-05-27-match_chunks_en * Add model 2023-05-27-explain_document_lg_es * Add model 2023-05-27-explain_document_md_es * Add model 2023-05-27-explain_document_sm_es * Add model 2023-05-27-entity_recognizer_lg_es * Add model 2023-05-27-entity_recognizer_md_es * Add model 2023-05-27-entity_recognizer_sm_es * Add model 2023-05-27-explain_document_lg_ru * Add model 2023-05-27-explain_document_md_ru * Add model 2023-05-27-explain_document_sm_ru * Add model 2023-05-27-entity_recognizer_lg_ru * Add model 2023-05-27-entity_recognizer_md_ru * Add model 2023-05-27-entity_recognizer_sm_ru * Add model 2023-05-27-text_cleaning_en * Add model 2023-05-27-explain_document_lg_pt * Add model 2023-05-27-explain_document_md_pt * Add model 2023-05-27-explain_document_sm_pt * Add model 2023-05-27-entity_recognizer_lg_pt * Add model 2023-05-27-entity_recognizer_md_pt * Add model 2023-05-27-entity_recognizer_sm_pt * Add model 2023-05-27-explain_document_lg_pl * Add model 2023-05-27-explain_document_md_pl * Add model 2023-05-27-explain_document_sm_pl * Add model 2023-05-27-entity_recognizer_lg_pl * Add model 2023-05-27-entity_recognizer_md_pl * Add model 2023-05-27-entity_recognizer_sm_pl * Add model 2023-05-27-explain_document_lg_nl * Add model 2023-05-27-explain_document_md_nl * Add model 2023-05-27-explain_document_sm_nl * Add model 2023-05-27-entity_recognizer_lg_nl * Add model 2023-05-27-entity_recognizer_md_nl * Add model 2023-05-27-entity_recognizer_sm_nl * Add model 2023-05-27-analyze_sentimentdl_glove_imdb_en * Add model 2023-05-27-explain_document_lg_no * Add model 2023-05-27-explain_document_md_no * Add model 2023-05-27-explain_document_sm_no * Add model 2023-05-27-entity_recognizer_lg_no * Add model 2023-05-27-entity_recognizer_md_no * Add model 2023-05-27-entity_recognizer_sm_no * Add model 2023-05-27-explain_document_lg_sv * Add model 2023-05-27-explain_document_md_sv * Add model 2023-05-27-explain_document_sm_sv * Add model 2023-05-27-entity_recognizer_lg_sv * Add model 2023-05-27-entity_recognizer_md_sv * Add model 2023-05-27-entity_recognizer_sm_sv * Add model 2023-05-27-explain_document_lg_da * Add model 2023-05-27-explain_document_md_da * Add model 2023-05-27-explain_document_sm_da * Add model 2023-05-27-entity_recognizer_lg_da * Add model 2023-05-27-entity_recognizer_md_da * Add model 2023-05-27-entity_recognizer_sm_da * Add model 2023-05-27-explain_document_lg_fi * Add model 2023-05-27-explain_document_md_fi * Add model 2023-05-27-explain_document_sm_fi * Add model 2023-05-27-entity_recognizer_lg_fi * Add model 2023-05-27-entity_recognizer_md_fi * Add model 2023-05-27-entity_recognizer_sm_fi * Add model 2023-05-27-onto_recognize_entities_bert_base_en * Add model 2023-05-27-onto_recognize_entities_bert_large_en * Add model 2023-05-27-onto_recognize_entities_bert_medium_en * Add model 2023-05-27-onto_recognize_entities_bert_mini_en * Add model 2023-05-27-onto_recognize_entities_bert_small_en * Add model 2023-05-27-onto_recognize_entities_bert_tiny_en * Add model 2023-05-27-onto_recognize_entities_electra_base_en * Add model 2023-05-27-onto_recognize_entities_electra_small_en * Add model 2023-05-27-onto_recognize_entities_electra_large_en * Add model 2023-05-27-recognize_entities_dl_fa * Add model 2023-05-27-nerdl_fewnerd_subentity_100d_pipeline_en * Add model 2023-05-27-nerdl_fewnerd_100d_pipeline_en * Add model 2023-05-27-pos_ud_bokmaal_nb * Add model 2023-05-27-xlm_roberta_large_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-27-bert_token_classifier_scandi_ner_pipeline_xx * Add model 2023-05-27-bert_sequence_classifier_trec_coarse_pipeline_en * Add model 2023-05-27-bert_sequence_classifier_age_news_pipeline_en * Add model 2023-05-27-distilbert_token_classifier_typo_detector_pipeline_is * Add model 2023-05-27-distilbert_base_token_classifier_masakhaner_pipeline_xx * Add model 2023-05-27-nerdl_restaurant_100d_pipeline_en * Add model 2023-05-27-roberta_token_classifier_timex_semeval_pipeline_en * Add model 2023-05-27-bert_token_classifier_hi_en_ner_pipeline_hi * Add model 2023-05-27-xlm_roberta_large_token_classifier_hrl_pipeline_xx * Add model 2023-05-27-spellcheck_dl_pipeline_en * Add model 2023-05-27-bert_token_classifier_dutch_udlassy_ner_pipeline_nl * Add model 2023-05-27-xlm_roberta_large_token_classifier_conll03_pipeline_de * Add model 2023-05-27-roberta_token_classifier_bne_capitel_ner_pipeline_es * Add model 2023-05-27-roberta_token_classifier_icelandic_ner_pipeline_is * Add model 2023-05-27-longformer_base_token_classifier_conll03_pipeline_en * Add model 2023-05-27-longformer_large_token_classifier_conll03_pipeline_en * Add model 2023-05-27-xlnet_base_token_classifier_conll03_pipeline_en * Add model 2023-05-27-xlm_roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-27-xlm_roberta_base_token_classifier_conll03_pipeline_en * Add model 2023-05-27-xlnet_large_token_classifier_conll03_pipeline_en * Add model 2023-05-27-albert_base_token_classifier_conll03_pipeline_en * Add model 2023-05-27-albert_large_token_classifier_conll03_pipeline_en * Add model 2023-05-27-albert_xlarge_token_classifier_conll03_pipeline_en * Add model 2023-05-27-distilroberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-27-roberta_base_token_classifier_ontonotes_pipeline_en * Add model 2023-05-27-roberta_large_token_classifier_conll03_pipeline_en * Add model 2023-05-27-distilbert_token_classifier_typo_detector_pipeline_en --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-05-28-longformer_base_english_legal_en (#13838) * Add model 2023-05-28-longformer_base_english_legal_en * Update 2023-05-28-longformer_base_english_legal_en.md --------- Co-authored-by: Mary-Sci <[email protected]> Co-authored-by: Merve Ertas Uslu <[email protected]> * 2023-05-28-xlm_longformer_base_english_legal_en (#13839) * Add model 2023-05-28-xlm_longformer_base_english_legal_en * Update 2023-05-28-xlm_longformer_base_english_legal_en.md * Add model 2023-05-28-longformer_large_english_legal_en * Update 2023-05-28-longformer_large_english_legal_en.md --------- Co-authored-by: Mary-Sci <[email protected]> Co-authored-by: Merve Ertas Uslu <[email protected]> * 2023-06-21-bert_embeddings_distil_clinical_en (#13861) * Add model 2023-06-21-bert_embeddings_distil_clinical_en * Add model 2023-06-21-bert_embeddings_carlbert_webex_mlm_spatial_en * Add model 2023-06-21-bert_embeddings_chemical_uncased_finetuned_cust_c2_en * Add model 2023-06-21-bert_embeddings_lsg16k_Italian_Legal_it * Add model 2023-06-21-bert_embeddings_chemical_uncased_finetuned_cust_c1_cust_en * Add model 2023-06-21-bert_embeddings_legalbert_adept_en * Add model 2023-06-21-bert_embeddings_base_uncased_issues_128_en * Add model 2023-06-21-bert_embeddings_pretrain_ko * Add model 2023-06-21-bert_embeddings_olm_base_uncased_oct_2022_en * Add model 2023-06-21-legalectra_small_es * Add model 2023-06-21-biobert_pubmed_base_cased_v1.2_en * Add model 2023-06-21-bert_embeddings_jobbert_base_cased_en * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_700000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_800000_cased_generator_de * Add model 2023-06-21-legalectra_base_es * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_900000_cased_generator_de * Add model 2023-06-21-bert_embeddings_scibert_scivocab_finetuned_cord19_en * Add model 2023-06-21-bert_embeddings_InLegalBERT_en * Add model 2023-06-21-bert_embeddings_InCaseLawBERT_en * Add model 2023-06-21-bert_base_uncased_contracts_en * Add model 2023-06-21-electra_embeddings_electra_base_turkish_mc4_uncased_generator_tr * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_500000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_generator_en * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_200000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_italian_xxl_cased_generator_it * Add model 2023-06-21-bert_embeddings_bioclinicalbert_finetuned_covid_papers_en * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_1000000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_600000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_400000_cased_generator_de * Add model 2023-06-21-electra_embeddings_finance_koelectra_base_generator_ko * Add model 2023-06-21-electra_embeddings_koelectra_base_v2_generator_ko * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_300000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_turkish_mc4_cased_generator_tr * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_0_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_small_generator_en * Add model 2023-06-21-electra_embeddings_electra_large_generator_en * Add model 2023-06-21-electra_embeddings_electricidad_base_generator_es * Add model 2023-06-21-electra_embeddings_gelectra_large_generator_de * Add model 2023-06-21-electra_embeddings_koelectra_base_generator_ko * Add model 2023-06-21-electra_embeddings_koelectra_base_v3_generator_ko * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_0_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_100000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_400000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_base_gc4_64k_600000_cased_generator_de * Add model 2023-06-21-electra_embeddings_electra_tagalog_small_cased_generator_tl * Add model 2023-06-21-electra_embeddings_gelectra_base_generator_de * Add model 2023-06-21-electra_embeddings_electra_tagalog_base_cased_generator_tl * Add model 2023-06-21-bert_sentence_embeddings_financial_de * Add model 2023-06-21-electra_embeddings_electra_small_japanese_generator_ja * Add model 2023-06-21-electra_embeddings_electra_tagalog_base_uncased_generator_tl * Add model 2023-06-21-electra_embeddings_koelectra_small_generator_ko * Add model 2023-06-21-electra_embeddings_finance_koelectra_small_generator_ko * Add model 2023-06-21-bert_embeddings_sec_bert_base_en * Add model 2023-06-21-electra_embeddings_kr_electra_generator_ko * Add model 2023-06-21-bert_embeddings_sec_bert_sh_en * Add model 2023-06-21-bert_embeddings_german_financial_statements_bert_de * Add model 2023-06-21-electra_embeddings_electra_tagalog_small_uncased_generator_tl * Add model 2023-06-21-bert_embeddings_javanese_bert_small_jv * Add model 2023-06-21-bert_embeddings_finest_bert_en * Add model 2023-06-21-bert_embeddings_indic_transformers_te_bert_te * Add model 2023-06-21-bert_embeddings_gbert_base_de * Add model 2023-06-21-bert_embeddings_indic_transformers_hi_bert_hi * Add model 2023-06-21-bert_embeddings_hateBERT_en * Add model 2023-06-21-bert_embeddings_false_positives_scancode_bert_base_uncased_L8_1_en * Add model 2023-06-21-bert_embeddings_finbert_pretrain_yiyanghkust_en * Add model 2023-06-21-bert_embeddings_indic_transformers_te_bert_te * Add model 2023-06-21-bert_embeddings_hseBert_it_cased_it * Add model 2023-06-21-bert_embeddings_finbert_pretrain_yiyanghkust_en * Add model 2023-06-21-bert_embeddings_dpr_spanish_question_encoder_allqa_base_es * Add model 2023-06-21-bert_embeddings_dziribert_ar * Add model 2023-06-21-bert_embeddings_deberta_base_uncased_en * Add model 2023-06-21-bert_embeddings_dbert_ko * Add model 2023-06-21-bert_embeddings_javanese_bert_small_imdb_jv * Add model 2023-06-21-bert_embeddings_dpr_spanish_passage_encoder_squades_base_es * Add model 2023-06-21-bert_embeddings_dpr_spanish_question_encoder_squades_base_es * Add model 2023-06-21-bert_embeddings_crosloengual_bert_en * Add model 2023-06-21-bert_embeddings_clinical_pubmed_bert_base_512_en * Add model 2023-06-21-bert_embeddings_dpr_spanish_passage_encoder_allqa_base_es * Add model 2023-06-21-bert_embeddings_legal_bert_base_uncased_en * Add model 2023-06-21-biobert_embeddings_all_pt * Add model 2023-06-21-bert_embeddings_wineberto_italian_cased_it * Add model 2023-06-21-bert_embeddings_clinical_pubmed_bert_base_128_en * Add model 2023-06-21-biobert_embeddings_clinical_pt * Add model 2023-06-21-bert_embeddings_telugu_bertu_te * Add model 2023-06-21-bert_embeddings_wobert_chinese_plus_zh * Add model 2023-06-21-bert_embeddings_wineberto_italian_cased_it * Add model 2023-06-21-bert_embeddings_sikuroberta_zh * Add model 2023-06-21-biobert_embeddings_biomedical_pt * Add model 2023-06-21-bert_embeddings_sikubert_zh * Add model 2023-06-21-bert_embeddings_psych_search_en * Add model 2023-06-21-bert_embeddings_marathi_bert_mr * Add model 2023-06-21-bert_embeddings_netbert_en * Add model 2023-06-21-bert_embeddings_mbert_ar_c19_ar * Add model 2023-06-21-bert_embeddings_multi_dialect_bert_base_arabic_ar * Add model 2023-06-21-bert_embeddings_lic_class_scancode_bert_base_cased_L32_1_en * Add model 2023-06-21-bert_embeddings_MARBERTv2_ar * Add model 2023-06-21-bert_embeddings_bert_base_cased_pt_lenerbr_pt * Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_half_ar * Add model 2023-06-21-bert_embeddings_bert_base_german_cased_oldvocab_de * Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_ar * Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_eighth_ar * Add model 2023-06-21-bert_embeddings_bert_base_german_uncased_de * Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_quarter_ar * Add model 2023-06-21-bert_embeddings_bert_base_historical_german_rw_cased_de * Add model 2023-06-21-bert_embeddings_bert_base_italian_xxl_uncased_it * Add model 2023-06-21-bert_embeddings_bert_base_arabertv2_ar * Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_msa_sixteenth_ar * Add model 2023-06-21-bert_embeddings_bert_base_arabic_camelbert_mix_ar * Add model 2023-06-21-bert_embeddings_bert_base_italian_xxl_cased_it * Add model 2023-06-21-bert_embeddings_bert_base_gl_cased_pt * Add model 2023-06-21-bert_embeddings_MARBERT_ar * Add model 2023-06-21-bert_embeddings_AraBertMo_base_V1_ar * Add model 2023-06-21-bert_embeddings_bert_base_arabic_ar * Add model 2023-06-21-bert_embeddings_DarijaBERT_ar * Add model 2023-06-21-bert_embeddings_Ara_DialectBERT_ar * Add model 2023-06-21-bert_embeddings_German_MedBERT_de * Add model 2023-06-21-bert_embeddings_bert_base_arabertv02_twitter_ar * Add model 2023-06-21-bert_embeddings_FinancialBERT_en * Add model 2023-06-21-bert_embeddings_ARBERT_ar * Add model 2023-06-21-bert_embeddings_COVID_SciBERT_en * Add model 2023-06-21-bert_embeddings_alberti_bert_base_multilingual_cased_es * Add model 2023-06-21-bert_embeddings_agriculture_bert_uncased_en * Add model 2023-06-21-bert_embeddings_bangla_bert_bn * Add model 2023-06-21-bert_embeddings_bert_kor_base_ko * Add model 2023-06-21-bert_embeddings_bert_base_arabertv02_ar * Add model 2023-06-21-bert_embeddings_arabert_c19_ar * Add model 2023-06-21-bert_embeddings_bert_base_5lang_cased_es * Add model 2023-06-21-bert_embeddings_bert_base_arabertv01_ar * Add model 2023-06-21-bert_embeddings_bangla_bert_base_bn * Add model 2023-06-21-bert_embeddings_bert_medium_arabic_ar * Add model 2023-06-21-bert_embeddings_bert_political_election2020_twitter_mlm_en * Add model 2023-06-21-bert_embeddings_bert_mini_arabic_ar * Add model 2023-06-21-bert_embeddings_bert_base_arabert_ar * Add model 2023-06-21-bert_embeddings_beto_gn_base_cased_es * Add model 2023-06-21-bert_embeddings_chemical_bert_uncased_en * Add model 2023-06-21-bert_embeddings_bert_base_ko * Add model 2023-06-21-bert_embeddings_chefberto_italian_cased_it * Add model 2023-06-21-bert_embeddings_childes_bert_en * Add model 2023-06-21-bert_embeddings_bert_base_portuguese_cased_finetuned_peticoes_pt * Add model 2023-06-21-bert_embeddings_bert_base_portuguese_cased_finetuned_tcu_acordaos_pt * Add model 2023-06-21-bert_embeddings_bert_base_portuguese_cased_pt * Add model 2023-06-21-bert_embeddings_bert_base_qarib60_1790k_ar * Add model 2023-06-21-bert_embeddings_bert_base_uncased_dstc9_en * Add model 2023-06-21-bert_embeddings_bert_base_uncased_mnli_sparse_70_unstructured_no_classifier_en * Add model 2023-06-21-bert_embeddings_bert_base_qarib_ar * Add model 2023-06-21-bert_embeddings_bert_base_uncased_sparse_70_unstructured_en * Add model 2023-06-21-ms_bluebert_base_uncased_en * Add model 2023-06-21-bert_embeddings_bert_base_qarib60_860k_ar * fixing wrong spark version and removing tensorflow --------- Co-authored-by: ahmedlone127 <[email protected]> Co-authored-by: MaziyarPanahi <[email protected]> * 2023-06-26-distilbert_embeddings_finetuned_sarcasm_classification_en (#13867) * Add model 2023-06-26-distilbert_embeddings_finetuned_sarcasm_classification_en * Add model 2023-06-26-distilbert_embeddings_distilbert_base_indonesian_id * Add model 2023-06-26-distilbert_embeddings_BERTino_it * Add model 2023-06-26-distilbert_embeddings_distilbert_base_uncased_sparse_85_unstructured_pruneofa_en * Add model 2023-06-26-distilbert_embeddings_malaysian_distilbert_small_ms * Add model 2023-06-26-distilbert_embeddings_distilbert_fa_zwnj_base_fa * Add model 2023-06-26-distilbert_embeddings_javanese_distilbert_small_jv * Add model 2023-06-26-distilbert_embeddings_javanese_distilbert_small_imdb_jv * Add model 2023-06-26-distilbert_embeddings_indic_transformers_hi_distilbert_hi * Add model 2023-06-26-distilbert_embeddings_marathi_distilbert_mr * Add model 2023-06-26-distilbert_embeddings_indic_transformers_bn_distilbert_bn * Add model 2023-06-26-distilbert_embeddings_distilbert_base_uncased_sparse_90_unstructured_pruneofa_en * Add model 2023-06-26-deberta_embeddings_xsmall_dapt_scientific_papers_pubmed_en * Add model 2023-06-26-deberta_embeddings_spm_vie_vie * Add model 2023-06-26-deberta_embeddings_vie_small_vie * Add model 2023-06-26-deberta_embeddings_tapt_nbme_v3_base_en * Add model 2023-06-26-deberta_embeddings_erlangshen_v2_chinese_sentencepiece_zh * Add model 2023-06-26-deberta_v3_xsmall_en * Add model 2023-06-26-deberta_embeddings_mlm_test_en * Add model 2023-06-26-deberta_v3_small_en * Add model 2023-06-26-roberta_base_swiss_legal_gsw --------- Co-authored-by: ahmedlone127 <[email protected]> * 2023-06-27-roberta_embeddings_robertinh_gl (#13868) * Add model 2023-06-27-roberta_embeddings_robertinh_gl * Add model 2023-06-27-roberta_embeddings_roberta_base_wechsel_german_de * Add model 2023-06-27-roberta_embeddings_roberta_base_russian_v0_ru * Add model 2023-06-27-roberta_embeddings_ruperta_base_finetuned_spa_constitution_en * Add model 2023-06-27-roberta_embeddings_robasqu_eu * Add model 2023-06-27-roberta_embeddings_roberta_ko_small_ko * Add model 2023-06-27-roberta_embeddings_hindi_hi * Add model 2023-06-27-roberta_embeddings_sundanese_roberta_base_su * Add model 2023-06-27-roberta_embeddings_roberta_pubmed_en * Add model 2023-06-27-roberta_embeddings_distilroberta_base_climate_f_en * Add model 2023-06-27-roberta_embeddings_roberta_urdu_small_ur * Add model 2023-06-27-roberta_embeddings_BR_BERTo_pt * Add model 2023-06-27-roberta_embeddings_distilroberta_base_climate_d_s_en * Add model 2023-06-27-roberta_embeddings_distilroberta_base_climate_d_en * Add model 2023-06-27-roberta_embeddings_ukr_roberta_base_uk * Add model 2023-06-27-roberta_embeddings_roberta_base_wechsel_french_fr * Add model 2023-06-27-roberta_embeddings_Bible_roberta_base_en * Add model 2023-06-27-roberta_embeddings_bertin_roberta_large_spanish_es * Add model 2023-06-27-roberta_embeddings_roberta_base_wechsel_chinese_zh * Add model 2023-06-27-roberta_embeddings_bertin_roberta_base_spanish_es * Add model 2023-06-27-roberta_embeddings_bertin_base_gaussian_es * Add model 2023-06-27-roberta_embeddings_bertin_base_random_exp_512seqlen_es * Add model 2023-06-27-roberta_embeddings_RuPERTa_base_es * Add model 2023-06-27-roberta_embeddings_roberta_base_bne_es * Add model 2023-06-27-roberta_embeddings_bertin_base_stepwise_exp_512seqlen_es * Add model 2023-06-27-roberta_embeddings_MedRoBERTa.nl_nl * Add model 2023-06-27-roberta_embeddings_bertin_base_random_es * Add model 2023-06-27-roberta_embeddings_RoBERTalex_es * Add model 2023-06-27-roberta_embeddings_SecRoBERTa_en * Add model 2023-06-27-roberta_embeddings_KanBERTo_kn * Add model 2023-06-27-roberta_embeddings_distilroberta_base_finetuned_jira_qt_issue_title_en * Add model 2023-06-27-roberta_embeddings_MedRoBERTa.nl_nl * Add model 2023-06-27-roberta_embeddings_distilroberta_base_finetuned_jira_qt_issue_titles_and_bodies_en * Add model 2023-06-27-roberta_embeddings_bertin_base_stepwise_es * Add model 2023-06-27-roberta_embeddings_KanBERTo_kn * Add model 2023-06-27-roberta_embeddings_bertin_base_gaussian_exp_512seqlen_es * Add model 2023-06-27-roberta_embeddings_mlm_spanish_roberta_base_es * Add model 2023-06-27-roberta_embeddings_KNUBert_kn * Add model 2023-06-27-roberta_embeddings_javanese_roberta_small_jv * Add model 2023-06-27-roberta_embeddings_indonesian_roberta_base_id * Add model 2023-06-27-roberta_embeddings_indic_transformers_hi_roberta_hi * Add model 2023-06-27-roberta_embeddings_indo_roberta_small_id * Add model 2023-06-27-roberta_embeddings_fairlex_scotus_minilm_en * Add model 2023-06-27-roberta_embeddings_indic_transformers_te_roberta_te * Add model 2023-06-27-roberta_embeddings_javanese_roberta_small_imdb_jv * Add model 2023-06-27-roberta_embeddings_jurisbert_es * Add model 2023-06-27-roberta_embeddings_roberta_base_indonesian_522M_id * Add model 2023-06-27-roberta_embeddings_fairlex_ecthr_minilm_en * Add model 2023-06-27-roberta_embeddings_muppet_roberta_base_en --------- Co-authored-by: ahmedlone127 <[email protected]> * Add model 2023-06-29-xlmroberta_embeddings_paraphrase_mpnet_base_v2_xx (#13872) Co-authored-by: Damla-Gurbaz <[email protected]> * 2023-06-08-instructor_base_en (#13850) * Add model 2023-06-08-instructor_base_en * Update 2023-06-08-instructor_base_en.md * Add model 2023-06-21-e5_base_v2_en * Add model 2023-06-21-e5_base_en * Add model 2023-06-21-e5_large_v2_en * Add model 2023-06-21-e5_large_en * Add model 2023-06-21-e5_small_v2_en * Add model 2023-06-21-e5_small_en * Add model 2023-06-21-instructor_large_en --------- Co-authored-by: prabod <[email protected]> Co-authored-by: Maziyar Panahi <[email protected]> * 2023-06-28-roberta_base_en (#13871) * Add model 2023-06-28-roberta_base_en * Add model 2023-06-28-roberta_base_opt_en * Add model 2023-06-28-roberta_base_quantized_en * Add model 2023-06-28-small_bert_L2_768_en * Add model 2023-06-28-small_bert_L2_768_opt_en * Add model 2023-06-28-small_bert_L2_768_quantized_en * Add model 2023-06-28-distilbert_base_cased_en * Add model 2023-06-28-distilbert_base_cased_opt_en * Add model 2023-06-28-distilbert_base_cased_quantized_en * Add model 2023-06-28-deberta_v3_base_en * Add model 2023-06-28-deberta_v3_base_opt_en * Add model 2023-06-28-deberta_v3_base_quantized_en * Add model 2023-06-28-distilbert_base_uncased_en * Add model 2023-06-28-distilbert_base_uncased_opt_en * Add model 2023-06-28-distilbert_base_uncased_quantized_en * Add model 2023-06-28-distilbert_base_multilingual_cased_xx * Add model 2023-06-28-distilbert_base_multilingual_cased_xx * Add model 2023-06-28-distilbert_base_multilingual_cased_opt_xx * Add model 2023-06-28-distilbert_base_multilingual_cased_quantized_xx * Add model 2023-06-28-distilbert_embeddings_distilbert_base_german_cased_de * Add model 2023-06-28-distilbert_embeddings_distilbert_base_german_cased_opt_de * Add model 2023-06-28-distilbert_embeddings_distilbert_base_german_cased_quantized_de * Add model 2023-06-29-bert_base_cased_en * Add model 2023-06-29-bert_base_cased_opt_en * Add model 2023-06-29-bert_base_cased_quantized_en --------- Co-authored-by: ahmedlone127 <[email protected]> --------- Co-authored-by: jsl-models <[email protected]> Co-authored-by: Naveen-004 <[email protected]> Co-authored-by: ahmedlone127 <[email protected]> Co-authored-by: prabod <[email protected]> Co-authored-by: Mary-Sci <[email protected]> Co-authored-by: Merve Ertas Uslu <[email protected]> Co-authored-by: Damla-Gurbaz <[email protected]>
1 parent d732eaa commit 179e4df

File tree

246 files changed

+34769
-0
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

246 files changed

+34769
-0
lines changed
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,100 @@
1+
---
2+
layout: model
3+
title: Multilingual XLMRoBerta Embeddings Cased Model
4+
author: John Snow Labs
5+
name: xlmroberta_embeddings_paraphrase_mpnet_base_v2
6+
date: 2023-06-29
7+
tags: [xx, embeddings, xlmroberta, open_source, transformer, tensorflow]
8+
task: Embeddings
9+
language: xx
10+
edition: Spark NLP 4.4.4
11+
spark_version: 3.0
12+
supported: true
13+
engine: tensorflow
14+
annotator: XlmRoBertaEmbeddings
15+
article_header:
16+
type: cover
17+
use_language_switcher: "Python-Scala-Java"
18+
---
19+
20+
## Description
21+
22+
Pretrained XLMRoberta Embeddings model is a multilingual embedding model adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.
23+
24+
## Predicted Entities
25+
26+
27+
28+
{:.btn-box}
29+
<button class="button button-orange" disabled>Live Demo</button>
30+
<button class="button button-orange" disabled>Open in Colab</button>
31+
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/xlmroberta_embeddings_paraphrase_mpnet_base_v2_xx_4.4.4_3.0_1688073546075.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
32+
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/xlmroberta_embeddings_paraphrase_mpnet_base_v2_xx_4.4.4_3.0_1688073546075.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}
33+
34+
## How to use
35+
36+
37+
38+
<div class="tabs-box" markdown="1">
39+
{% include programmingLanguageSelectScalaPythonNLU.html %}
40+
```python
41+
documentAssembler = DocumentAssembler() \
42+
.setInputCol("text") \
43+
.setOutputCol("document")
44+
45+
tokenizer = Tokenizer() \
46+
.setInputCols("document") \
47+
.setOutputCol("token")
48+
49+
embeddings = XlmRoBertaEmbeddings.pretrained("xlmroberta_embeddings_paraphrase_mpnet_base_v2","xx") \
50+
.setInputCols(["document", "token"]) \
51+
.setOutputCol("embeddings") \
52+
.setCaseSensitive(True)
53+
54+
pipeline = Pipeline(stages=[documentAssembler,
55+
tokenizer,
56+
embeddings])
57+
58+
data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")
59+
result = pipeline.fit(data).transform(data)
60+
```
61+
```scala
62+
val documentAssembler = new DocumentAssembler()
63+
.setInputCol("text")
64+
.setOutputCol("document")
65+
66+
val tokenizer = new Tokenizer()
67+
.setInputCols("document")
68+
.setOutputCol("token")
69+
70+
val embeddings = XlmRoBertaEmbeddings.pretrained("xlmroberta_embeddings_paraphrase_mpnet_base_v2", "xx")
71+
.setInputCols(Array("document", "token"))
72+
.setOutputCol("embeddings")
73+
74+
val pipeline = new Pipeline().setStages(Array(documentAssembler,
75+
tokenizer,
76+
embeddings))
77+
78+
val data = Seq("I love Spark NLP").toDS.toDF("text")
79+
val result = pipeline.fit(data).transform(data)
80+
```
81+
</div>
82+
83+
{:.model-param}
84+
## Model Information
85+
86+
{:.table-model}
87+
|---|---|
88+
|Model Name:|xlmroberta_embeddings_paraphrase_mpnet_base_v2|
89+
|Compatibility:|Spark NLP 4.4.4+|
90+
|License:|Open Source|
91+
|Edition:|Official|
92+
|Input Labels:|[sentence, token]|
93+
|Output Labels:|[embeddings]|
94+
|Language:|xx|
95+
|Size:|1.0 GB|
96+
|Case sensitive:|true|
97+
98+
## References
99+
100+
https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,97 @@
1+
---
2+
layout: model
3+
title: English Legal Longformer Base Embeddings Model
4+
author: John Snow Labs
5+
name: longformer_base_english_legal
6+
date: 2023-05-28
7+
tags: [en, longformerformaskedlm, transformer, open_source, legal, tensorflow]
8+
task: Embeddings
9+
language: en
10+
edition: Spark NLP 4.4.2
11+
spark_version: 3.0
12+
supported: true
13+
engine: tensorflow
14+
annotator: LongformerEmbeddings
15+
article_header:
16+
type: cover
17+
use_language_switcher: "Python-Scala-Java"
18+
---
19+
20+
## Description
21+
22+
Pretrained Legal Longformer Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `legal-longformer-base` is a English model originally trained by `lexlms`.
23+
24+
{:.btn-box}
25+
<button class="button button-orange" disabled>Live Demo</button>
26+
<button class="button button-orange" disabled>Open in Colab</button>
27+
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/longformer_base_english_legal_en_4.4.2_3.0_1685282124579.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
28+
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/longformer_base_english_legal_en_4.4.2_3.0_1685282124579.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}
29+
30+
## How to use
31+
32+
33+
34+
<div class="tabs-box" markdown="1">
35+
{% include programmingLanguageSelectScalaPythonNLU.html %}
36+
37+
```python
38+
documentAssembler = DocumentAssembler() \
39+
.setInputCols("text") \
40+
.setOutputCols("document")
41+
42+
tokenizer = Tokenizer() \
43+
.setInputCols("document") \
44+
.setOutputCol("token")
45+
46+
embeddings = LongformerEmbeddings.pretrained("longformer_base_english_legal","en") \
47+
.setInputCols(["document", "token"]) \
48+
.setOutputCol("embeddings") \
49+
.setCaseSensitive(True)
50+
51+
pipeline = Pipeline(stages=[documentAssembler, tokenizer, embeddings])
52+
53+
data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")
54+
55+
result = pipeline.fit(data).transform(data)
56+
```
57+
```scala
58+
val documentAssembler = new DocumentAssembler()
59+
.setInputCols(Array("text"))
60+
.setOutputCols(Array("document"))
61+
62+
val tokenizer = new Tokenizer()
63+
.setInputCols("document")
64+
.setOutputCol("token")
65+
66+
val embeddings = LongformerEmbeddings.pretrained("longformer_base_english_legal","en")
67+
.setInputCols(Array("document", "token"))
68+
.setOutputCol("embeddings")
69+
.setCaseSensitive(True)
70+
71+
val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))
72+
73+
val data = Seq("I love Spark NLP").toDS.toDF("text")
74+
75+
val result = pipeline.fit(data).transform(data)
76+
```
77+
</div>
78+
79+
{:.model-param}
80+
## Model Information
81+
82+
{:.table-model}
83+
|---|---|
84+
|Model Name:|longformer_base_english_legal|
85+
|Compatibility:|Spark NLP 4.4.2+|
86+
|License:|Open Source|
87+
|Edition:|Official|
88+
|Input Labels:|[sentence, token]|
89+
|Output Labels:|[embeddings]|
90+
|Language:|en|
91+
|Size:|561.6 MB|
92+
|Case sensitive:|true|
93+
|Max sentence length:|4096|
94+
95+
## References
96+
97+
https://huggingface.co/lexlms/legal-longformer-base
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,97 @@
1+
---
2+
layout: model
3+
title: English Legal Longformer Large Embeddings Model
4+
author: John Snow Labs
5+
name: longformer_large_english_legal
6+
date: 2023-05-28
7+
tags: [en, longformerformaskedlm, transformer, open_source, legal, tensorflow]
8+
task: Embeddings
9+
language: en
10+
edition: Spark NLP 4.4.2
11+
spark_version: 3.0
12+
supported: true
13+
engine: tensorflow
14+
annotator: LongformerEmbeddings
15+
article_header:
16+
type: cover
17+
use_language_switcher: "Python-Scala-Java"
18+
---
19+
20+
## Description
21+
22+
Pretrained Legal Longformer Large Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `legal-longformer-large` is a English model originally trained by `lexlms`.
23+
24+
{:.btn-box}
25+
<button class="button button-orange" disabled>Live Demo</button>
26+
<button class="button button-orange" disabled>Open in Colab</button>
27+
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/longformer_large_english_legal_en_4.4.2_3.0_1685289330980.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
28+
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/longformer_large_english_legal_en_4.4.2_3.0_1685289330980.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}
29+
30+
## How to use
31+
32+
33+
34+
<div class="tabs-box" markdown="1">
35+
{% include programmingLanguageSelectScalaPythonNLU.html %}
36+
37+
```python
38+
documentAssembler = DocumentAssembler() \
39+
.setInputCols("text") \
40+
.setOutputCols("document")
41+
42+
tokenizer = Tokenizer() \
43+
.setInputCols("document") \
44+
.setOutputCol("token")
45+
46+
embeddings = LongformerEmbeddings.pretrained("longformer_large_english_legal","en") \
47+
.setInputCols(["document", "token"]) \
48+
.setOutputCol("embeddings") \
49+
.setCaseSensitive(True)
50+
51+
pipeline = Pipeline(stages=[documentAssembler, tokenizer, embeddings])
52+
53+
data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")
54+
55+
result = pipeline.fit(data).transform(data)
56+
```
57+
```scala
58+
val documentAssembler = new DocumentAssembler()
59+
.setInputCols(Array("text"))
60+
.setOutputCols(Array("document"))
61+
62+
val tokenizer = new Tokenizer()
63+
.setInputCols("document")
64+
.setOutputCol("token")
65+
66+
val embeddings = LongformerEmbeddings.pretrained("longformer_large_english_legal","en")
67+
.setInputCols(Array("document", "token"))
68+
.setOutputCol("embeddings")
69+
.setCaseSensitive(True)
70+
71+
val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))
72+
73+
val data = Seq("I love Spark NLP").toDS.toDF("text")
74+
75+
val result = pipeline.fit(data).transform(data)
76+
```
77+
</div>
78+
79+
{:.model-param}
80+
## Model Information
81+
82+
{:.table-model}
83+
|---|---|
84+
|Model Name:|longformer_large_english_legal|
85+
|Compatibility:|Spark NLP 4.4.2+|
86+
|License:|Open Source|
87+
|Edition:|Official|
88+
|Input Labels:|[sentence, token]|
89+
|Output Labels:|[embeddings]|
90+
|Language:|en|
91+
|Size:|1.6 GB|
92+
|Case sensitive:|true|
93+
|Max sentence length:|4096|
94+
95+
## References
96+
97+
https://huggingface.co/lexlms/legal-longformer-large
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,97 @@
1+
---
2+
layout: model
3+
title: English Legal XLM-Longformer Base Embeddings Model
4+
author: John Snow Labs
5+
name: xlm_longformer_base_english_legal
6+
date: 2023-05-28
7+
tags: [en, longformerformaskedlm, transformer, open_source, legal, tensorflow]
8+
task: Embeddings
9+
language: en
10+
edition: Spark NLP 4.4.2
11+
spark_version: 3.0
12+
supported: true
13+
engine: tensorflow
14+
annotator: LongformerEmbeddings
15+
article_header:
16+
type: cover
17+
use_language_switcher: "Python-Scala-Java"
18+
---
19+
20+
## Description
21+
22+
Pretrained Legal XLM-Longformer Embeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `legal-xlm-longformer-base` is a English model originally trained by `joelito`.
23+
24+
{:.btn-box}
25+
<button class="button button-orange" disabled>Live Demo</button>
26+
<button class="button button-orange" disabled>Open in Colab</button>
27+
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/xlm_longformer_base_english_legal_en_4.4.2_3.0_1685286936656.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
28+
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/xlm_longformer_base_english_legal_en_4.4.2_3.0_1685286936656.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}
29+
30+
## How to use
31+
32+
33+
34+
<div class="tabs-box" markdown="1">
35+
{% include programmingLanguageSelectScalaPythonNLU.html %}
36+
37+
```python
38+
documentAssembler = DocumentAssembler() \
39+
.setInputCols("text") \
40+
.setOutputCols("document")
41+
42+
tokenizer = Tokenizer() \
43+
.setInputCols("document") \
44+
.setOutputCol("token")
45+
46+
embeddings = LongformerEmbeddings.pretrained("xlm_longformer_base_english_legal","en") \
47+
.setInputCols(["document", "token"]) \
48+
.setOutputCol("embeddings") \
49+
.setCaseSensitive(True)
50+
51+
pipeline = Pipeline(stages=[documentAssembler, tokenizer, embeddings])
52+
53+
data = spark.createDataFrame([["I love Spark NLP"]]).toDF("text")
54+
55+
result = pipeline.fit(data).transform(data)
56+
```
57+
```scala
58+
val documentAssembler = new DocumentAssembler()
59+
.setInputCols(Array("text"))
60+
.setOutputCols(Array("document"))
61+
62+
val tokenizer = new Tokenizer()
63+
.setInputCols("document")
64+
.setOutputCol("token")
65+
66+
val embeddings = LongformerEmbeddings.pretrained("xlm_longformer_base_english_legal","en")
67+
.setInputCols(Array("document", "token"))
68+
.setOutputCol("embeddings")
69+
.setCaseSensitive(True)
70+
71+
val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))
72+
73+
val data = Seq("I love Spark NLP").toDS.toDF("text")
74+
75+
val result = pipeline.fit(data).transform(data)
76+
```
77+
</div>
78+
79+
{:.model-param}
80+
## Model Information
81+
82+
{:.table-model}
83+
|---|---|
84+
|Model Name:|xlm_longformer_base_english_legal|
85+
|Compatibility:|Spark NLP 4.4.2+|
86+
|License:|Open Source|
87+
|Edition:|Official|
88+
|Input Labels:|[sentence, token]|
89+
|Output Labels:|[embeddings]|
90+
|Language:|en|
91+
|Size:|788.6 MB|
92+
|Case sensitive:|true|
93+
|Max sentence length:|4096|
94+
95+
## References
96+
97+
https://huggingface.co/joelito/legal-xlm-longformer-base

0 commit comments

Comments
 (0)