Text-Annotation

One basic unit of metadata for text processing is the annotation. Every annotation has a type (String), a begin offset (int) and an end offset (int). The begin and end offsets indicate the position of the annotation in the original text. Specific annotation types may also contain additional attributes. For example, the annotation <type=PERSON, begin=0, end=4, gender=MALE> may be associated with the string "John loves Mary", indicating that the first four characters in the string represent a PERSON, with the additional feature "gender=MALE".

An annotated string contains a text string (the original raw data) and a (possibly empty) set of all annotations produced for that text string. In the UIMA framework, the set of annotations for a text is referred to as the annotation index. Annotators may access the content of the annotation index to examine pre-existing annotations. The annotation index may be accessed by type (returning all existing annotations of a given type), or by a span (begin, end), which returns all annotations of any type which are contained in the given span.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
logs		logs
.gitignore		.gitignore
AnnotatedString.py		AnnotatedString.py
Annotation.py		Annotation.py
AnnotationIndex.py		AnnotationIndex.py
AnnotationPipeline.py		AnnotationPipeline.py
Annotator.py		Annotator.py
AnswerCandidate.py		AnswerCandidate.py
AnswerCoverageRanker.py		AnswerCoverageRanker.py
CompositeRanker.py		CompositeRanker.py
DataElement.py		DataElement.py
DatasetMeasure.py		DatasetMeasure.py
DatasetResult.py		DatasetResult.py
How_To_Run.txt		How_To_Run.txt
IndependentRanker.py		IndependentRanker.py
JaccardRanker.py		JaccardRanker.py
ListAnnotationIndex.py		ListAnnotationIndex.py
MaxCombiner.py		MaxCombiner.py
MeanCombiner.py		MeanCombiner.py
MinCombiner.py		MinCombiner.py
NGramAnnotator.py		NGramAnnotator.py
Pipeline.py		Pipeline.py
QuestionCoverageRanker.py		QuestionCoverageRanker.py
README.md		README.md
RankedAnswer.py		RankedAnswer.py
RegexAnnotator.py		RegexAnnotator.py
SetRanker.py		SetRanker.py
ShantaRanker.py		ShantaRanker.py
__init__.py		__init__.py
dataset_001.csv		dataset_001.csv
dataset_002.csv		dataset_002.csv
dataset_003.csv		dataset_003.csv
dataset_004.csv		dataset_004.csv
results.css		results.css
resultsTable.html		resultsTable.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text-Annotation

About

Uh oh!

Releases

Packages

Languages

shantanu27/Text-Annotation

Folders and files

Latest commit

History

Repository files navigation

Text-Annotation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages