Kurdipedia is the largest multilingual sources for Kurdish information!
About Kurdipedia
Kurdipedia Archivists
 Search
 Send
 Tools
 Languages
 My account
 Search for
 Appearance
  Dark Mode
 Default settings
 Search
 Send
 Tools
 Languages
 My account
        
 kurdipedia.org 2008 - 2026
Library
 
Send
   Advanced Search
Contact
کوردیی ناوەند
Kurmancî
کرمانجی
هەورامی
English
Français
Deutsch
عربي
فارسی
Türkçe
עברית

 More...
 More...
 
 Dark Mode
 Slide Bar
 Font Size


 Default settings
About Kurdipedia
Random item!
Terms of Use
Kurdipedia Archivists
Your feedback
User Favorites
Chronology of events
 Activities - Kurdipedia
Help
 More
 Kurdish names
 Search Click
Statistics
Articles
  586,161
Images
  124,416
Books
  22,121
Related files
  126,534
Video
  2,193
Language
کوردیی ناوەڕاست - Central Kurdish 
317,317
Kurmancî - Upper Kurdish (Latin) 
95,685
هەورامی - Kurdish Hawrami 
67,750
عربي - Arabic 
44,095
کرمانجی - Upper Kurdish (Arami) 
26,711
فارسی - Farsi 
15,883
English - English 
8,533
Türkçe - Turkish 
3,836
Deutsch - German 
2,037
لوڕی - Kurdish Luri 
1,785
Pусский - Russian 
1,145
Français - French 
359
Nederlands - Dutch 
131
Zazakî - Kurdish Zazaki 
92
Svenska - Swedish 
79
Español - Spanish 
61
Italiano - Italian 
61
Polski - Polish 
60
Հայերեն - Armenian 
57
لەکی - Kurdish Laki 
39
Azərbaycanca - Azerbaijani 
35
日本人 - Japanese 
24
Norsk - Norwegian 
22
中国的 - Chinese 
21
עברית - Hebrew 
20
Ελληνική - Greek 
19
Fins - Finnish 
14
Português - Portuguese 
14
Catalana - Catalana 
14
Esperanto - Esperanto 
10
Ozbek - Uzbek 
9
Тоҷикӣ - Tajik 
9
Srpski - Serbian 
6
ქართველი - Georgian 
6
Čeština - Czech 
5
Lietuvių - Lithuanian 
5
Hrvatski - Croatian 
5
балгарская - Bulgarian 
4
Kiswahili سَوَاحِلي -  
3
हिन्दी - Hindi 
2
Cebuano - Cebuano 
1
қазақ - Kazakh 
1
ترکمانی - Turkman (Arami Script) 
1
Group
English
Biography 
3,197
Places 
9
Parties & Organizations 
36
Publications (magazines, newspapers, websites and media, etc.) 
50
Miscellaneous 
4
Image and Description 
78
Artworks 
17
Dates & Events 
1
Maps 
26
Quotes 
1
Archaeological places 
44
Library 
2,164
Articles 
2,538
Martyrs 
65
Genocide 
21
Documents 
251
Clan - the tribe - the sect 
18
Statistics and Surveys 
5
Video 
2
Environment of Kurdistan 
1
Poem 
2
Womens Issues 
1
Offices 
2
Repository
MP3 
1,499
PDF 
34,764
MP4 
3,993
IMG 
234,717
∑   Total 
274,973
Content search
Developing a Fine-grained Corpus for a Less-resourced Language: the case of Kurdish
Group: Articles
Articles language: English
Kurdipedia guarantees the right to public information for every Kurdish individual!
Share
Copy Link0
E-Mail0
Facebook0
LinkedIn0
Messenger0
Pinterest0
SMS0
Telegram0
Twitter0
Viber0
WhatsApp0
Ranking item
Excellent
Very good
Average
Poor
Bad
Add to my favorites
Write your comment about this item!
Items history
Metadata
RSS
Search in Google for images related to the selected item!
Search in Google for selected item!
کوردیی ناوەڕاست - Central Kurdish0
Kurmancî - Upper Kurdish (Latin)0
عربي - Arabic0
فارسی - Farsi0
Türkçe - Turkish0
עברית - Hebrew0
Deutsch - German0
Español - Spanish0
Français - French0
Italiano - Italian0
Nederlands - Dutch0
Svenska - Swedish0
Ελληνική - Greek0
Azərbaycanca - Azerbaijani0
Catalana - Catalana0
Čeština - Czech0
Esperanto - Esperanto0
Fins - Finnish0
Hrvatski - Croatian0
Lietuvių - Lithuanian0
Norsk - Norwegian0
Ozbek - Uzbek0
Polski - Polish0
Português - Portuguese0
Pусский - Russian0
Srpski - Serbian0
балгарская - Bulgarian0
қазақ - Kazakh0
Тоҷикӣ - Tajik0
Հայերեն - Armenian0
हिन्दी - Hindi0
ქართველი - Georgian0
中国的 - Chinese0
日本人 - Japanese0
Developing a Fine-grained Corpus for a Less-resourced Language: the case of Kurdish
Developing a Fine-grained Corpus for a Less-resourced Language: the case of Kurdish
Developing a Fine-grained Corpus for a Less-resourced Language: the case of Kurdish.
Roshna Omer Abdulrahman, Hossein Hassani, Sina Ahmadi.
2019.
Kurdish is a less-resourced language consisting of different dialects written in various scripts. Approximately 30 million people in different countries speak the language. The lack of corpora is one of the main obstacles in Kurdish language processing. In this paper, we present KTC-the Kurdish Textbooks Corpus, which is composed of 31 K-12 textbooks in Sorani dialect. The corpus is normalized and categorized into 12 educational subjects containing 693,800 tokens (110,297 types). Our resource is publicly available for non-commercial use under the CC BY-NC-SA 4.0 license. [1]
=KTML_Link_External_Begin=https://www.kurdipedia.org/docviewer.aspx?id=445060&document=0001.PDF=KTML_Link_External_Between= Click to read the article: Developing a Fine-grained Corpus for a Less-resourced Language: the case of Kurdish=KTML_Link_External_End=

Kurdipedia is not responsible for the content of this item. We recorded it for archival purposes.
This item has been viewed 936 times
Write your comment about this item!
HashTag
Sources
[1] Website | English | academia.edu
Related files: 1
Linked items: 1
Group: Articles
Articles language: English
Content category: Linguistic
Country - Province: Kurdistan
Document Type: Original language
Language - Dialect: English
Publication Type: Born-digital
Technical Metadata
Item Quality: 99%
99%
Added by ( Rapar Osman Uzery ) on 13-11-2022
This article has been reviewed and released by ( Ziryan Serchinari ) on 14-11-2022
This item recently updated by ( Rozhgar Kerkuki ) on: 17-08-2024
Title
This item according to Kurdipedia's Standards is not finalized yet!
This item has been viewed 936 times
QR Code
  New Item
  Random item! 
  Exclusively for women 
  
  Kurdipedia's Publication 

Kurdipedia.org (2008 - 2026) version: 17.17
| Contact | CSS3 | HTML5

| Page generation time: 0.329 second(s)!