You are viewing the site in preview mode

Skip to main content

TableĀ 1 N-Gram description

From: Comparison of named entity recognition methodologies in biomedical documents

FeatureDescription
Unigram\( {\text{w}}_{{{\text{i}} - 2}} ,{\text{w}}_{{{\text{i}} - 1}} ,{\text{w}}_{\text{i}} ,{\text{w}}_{{{\text{i}} + 1}} ,{\text{w}}_{{{\text{i}} + 2}} \)
Bigram\( {\text{w}}_{{{\text{i}} - 2}} \left| {{\text{w}}_{{{\text{i}} - 1}} ,{\text{w}}_{{{\text{i}} - 1}} } \right|{\text{w}}_{\text{i}} ,{\text{w}}_{\text{i}} \left| {{\text{w}}_{{{\text{i}} + 1}} ,{\text{w}}_{{{\text{i}} + 1}} } \right|{\text{w}}_{{{\text{i}} + 2}} \)
Trigram\( {\text{w}}_{{{\text{i}} - 2}} \left| {{\text{w}}_{{{\text{i}} - 1}} } \right|{\text{w}}_{\text{i}} ,{\text{w}}_{{{\text{i}} - 1}} \left| {{\text{w}}_{\text{i}} } \right|{\text{w}}_{{{\text{i}} + 1}} ,{\text{w}}_{\text{i}} \left| {{\text{w}}_{{{\text{i}} + 1}} } \right|{\text{w}}_{{{\text{i}} + 2}} \)