tags - Tagging and Training NER dataset -


i have data set , want tag named entity recognition. dataset in persian. want know how should tag expressions :

*** آقای مهدی کاظمی = mr mehdi kazemi / mr smith. >>> (names titles) should tag person or first name , last name should tagged? (i mean should tag "mr")

mr >> b_per || mr >> o

mehdi >> i_per || mehdi >> b_per

kazemi >> i_per || kazemi >> i_per

*** بیمارستان نور = noor hospital >>> should tag name or name , hospital both named entity?

*** eiffel tower / ministry of defense (i mean dod example) >>> in persian called : وزارت دفاع (vezarate defa) should tag defense ? or together?

there many more examples schools, movies, cities, countries and.... since use entity class before named entity.

i appreciate if can me tagging dataset.

i'll give examples conll 2003 training data:

"mr." not tagged part of person, titles ignored.

"columbia presbyterian hospital" tagged (loc, loc, loc)

"a new york hospital" (o, loc, loc, o)

"ministry of commerce" (org, org, org)

i think "eiffel tower" should (loc, loc)


Comments

Popular posts from this blog

javascript - Clear button on addentry page doesn't work -

c# - Selenium Authentication Popup preventing driver close or quit -

tensorflow when input_data MNIST_data , zlib.error: Error -3 while decompressing: invalid block type -