tags - Tagging and Training NER dataset -
i have data set , want tag named entity recognition. dataset in persian. want know how should tag expressions :
*** آقای مهدی کاظمی = mr mehdi kazemi / mr smith. >>> (names titles) should tag person or first name , last name should tagged? (i mean should tag "mr")
mr >> b_per || mr >> o
mehdi >> i_per || mehdi >> b_per
kazemi >> i_per || kazemi >> i_per
*** بیمارستان نور = noor hospital >>> should tag name or name , hospital both named entity?
*** eiffel tower / ministry of defense (i mean dod example) >>> in persian called : وزارت دفاع (vezarate defa) should tag defense ? or together?
there many more examples schools, movies, cities, countries and.... since use entity class before named entity.
i appreciate if can me tagging dataset.
i'll give examples conll 2003 training data:
"mr." not tagged part of person, titles ignored.
"columbia presbyterian hospital" tagged (loc, loc, loc)
"a new york hospital" (o, loc, loc, o)
"ministry of commerce" (org, org, org)
i think "eiffel tower" should (loc, loc)
Comments
Post a Comment