数据科学是关于数据的。网络上有各种来源可以为您的数据分析或机器学习项目获取数据。最受欢迎的来源之一是 Kaggle,我相信我们每个人都必须在我们的数据旅程中使用它。 最近,我遇到了一个新的来源来为我的 NLP …
最受欢迎的来源之一是 Kaggle,我相信我们每个人都必须在我们的数据旅程中使用它。 最近,我遇到了一个新的来源来为我的 NLP 项目获取数据,我很想谈谈它。这是 Hugging Face 的数据集库,一个快速高效的库,可以轻松共享和加载数据集和评估指标。因此,如果您从事自然语言理解 (NLP) 工作并希望为下一个项目提供数据...
Huggingface Datasets – A Python library for loading NLP datasets Link:https://github.com/huggingface/datasets A tool that makes NLP datasets directly available in Python IBM’s Data Asset Exchange – A collection datasets relevant for enterprise applications Link:https://developer.ibm.com/exchanges/...
That being said, the following list is what we recommend as some of the best open-source datasets to start learning NLP, or you can try out the various models and follow those steps. 1.Quora Question Insincerity Dataset This dataset is pretty fun. In this NLP Challenge on Kaggle, we are...
数据科学是关于数据的。网络上有各种来源可以为您的数据分析或机器学习项目获取数据。最受欢迎的来源之一是 Kaggle,我相信我们每个人都必须在我们的数据旅程中使用它。 最近,我遇到了一个新的来源来为我的 NLP 项目获取数据,我很想谈谈它。这是 Hugging Face 的数据集库,一个快速高效的库,可以轻松共享和加载数据...
kaggle:https://www.kaggle.com 天池:https://tianchi.aliyun.com/dataset 飞桨:https://aistudio.baidu.com/aistudio/datasetoverview 讯飞:http://challenge.xfyun.cn/ 搜狗实验室:http://www.sogou.com/labs/resource/list_pingce.php DC竞赛:https://www.pkb...
This dataset can be downloaded from Kaggle, or loaded from Keras: tf.keras.datasets.fashion_mnist.load_data() Fashion-MNIST images 5. IMDB The IMDB dataset is commonly used for sentiment analysis tasks, where the goal is to classify the reviews as positive or negative based on their con...
POS/NER/Chunk annotated data Open Museums Twitter NLP Tools. Contribute to aritter/twitter_nlp development by creating an account on GitHub. PRAD-CA-Prostate-Adenocarcinoma-Canada Open Healthcare The ICGC Data Portal provides tools for visualizing, querying and downloading the data released quarterly ...
DrivenData Competitions for Social Good [Meta] ICWSM Data Challenge (since 2009) [Meta] KDD Cup by Tencent 2012 [Meta] Kaggle Competition Data [Meta] Localytics Data Visualization Challenge [Meta] Netflix Prize [Meta] Space Apps Challenge [Meta] Telecom Italia Big Data Challenge [Meta] Travis...
Colab Compatible FastAI notebooks for NLP and Computer Vision Datasets nlp classifier computer-vision embeddings kaggle jokes transfer-learning recommender-systems movie-reviews fastai smote collab colab-notebook google-colab-notebook colab-fastai computer-vision-datasets Updated May 26, 2022 Jupyter Note...