Recently, [1] presented the results of Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition. According to this paper, most of researchers used CRF. However, several researchers in this workshop described new methods, such as [...
Named entity recognitionVietnamese spoken text processingMobile virtual assistantNamed entity recognition (NER) for written documents has been studied intensively during the past decades. However, NER for spoken texts is still at its early stage. There are several challenges behind this: spoken texts ...
- We prepare a ready-to-use Vietnamese X-ray image dataset annotated with 13 symptoms of tuberculosis. - We design and evaluate the convolutional neural network model to diagnose tuberculosis. - We visualize the predicted results and analyze them in comparison with the locations of tuberculosis ...
A named entity recognition dataset for Vietnamese with 10 newly-defined entity types in the context of the COVID-19 pandemic. Data is extracted from news articles and manually annotated. In total, there are 34 984 entities over 10 027 sentences. Training set: 5027 sentences Development set: ...
Vietnamese end-to-end speech recognition using wav2vec 2.0Model descriptionOur models are pre-trained on 13k hours of Vietnamese youtube audio (un-label data) and fine-tuned on 250 hours labeled of VLSP ASR dataset on 16kHz sampled speech audio....
Vietnamese Named Entity Recognition using Token Regular Expressions and Bidirectional Inference, Phuong Le-Hong, Proceedings of Vietnamese Speech and Language Processing (VLSP), Hanoi, Vietnam, 2016. As thetagmodule, thenermodule is also an Apache Spark application, you run it by submitting the ...
Named entity recognitionVietnamese spoken text processingMobile virtual assistantNamed entity recognition (NER) for written documents has been studied intensively during the past decades. However, NER for spoken texts is still at its early stage. There are several challenges behind this: spoken texts ...
The article has applied artificial intelligence algorithms in asset pricing through text descriptions of the assets in Vietnamese. The proposed method uses Named Entity Recognition technique with a Recurrent Neural Network model in combination with Conditional Random Field model to extract asset features,...
We also don't apply any named entity recognition mechanisms within the tokenizer and have few rare cases where we fail to solve ambiguity correctly. We thus didn't want to provide exact quality comparison results as probably the goals and potential use cases of this library and of those simila...
The experimental results in the recognition module show that our system achieves 2.94% of character error rate (CER), which helps improve CER of another previous approach on the VNOnDB-Word dataset of the Vietnamese online handwritten text recognition competition (HANDS-VNOnDB or VOHTR2018). ...