Data base “languages of the world”



Download 17,1 Kb.
Date conversion16.03.2017
Size17,1 Kb.

DATA BASE “LANGUAGES OF THE WORLD”

  • DB JM SOFTWARE SURVEY: 2010
  • Vladimir Polyakov
  • (Institute of Linguistics of RAS )

Software Products related to DB JM

  • Versions of DataBase
  • DOS Version
  • Windows
  • Version
  • Web Version
  • www.dblang.ru
  • Quantitative And Other Research Products
  • Includes comparison of two languages as function
  • Similarity –
  • Software for similarity measure calculations
  • LangFam –
  • Software for language family portraits calculations, genetic markers revealing, deal with rare features filters, investigate typological shift etc.
  • Special software for modeling of evolution
  • Special software for clusterization task
  • Special software for phylogeny with different metrics of feature space
  • BiCoTree –software for easy tree building on DB.
  • Some other research programs, developed for different aims during partial investigations in areal, historical and typological linguistics (Gusareva, Loginova, Fashutdinov, Omlin, Polyakov, Solovyev).
  • Includes comparison of two languages as function
  • Living Diagrams – reference software with possibility of integration source data and quantitative diagrams
  • EduDBLANG – educational version of DB with full spectrum of reference possibilities
  • The Web-version of “Living diagrams” is prepared.
  • Outer tools applicable to JM data
  • R – statistical sotware tools
  • Phylogeny tools: …

Kernel versions of DB JM

  • Program language or environment /
  • Data Base Engine and data format
  • Programmers, year of issue
  • English
  • interface and content
  • Main functions
  • Compatibility
  • DOS Version
  • Clipper /
  • Dbase compatible,
  • DBF
  • Skokan †, 1997
  • (*)
  • Correction of model, add new languages, browse, export, import, save, search , comparison
  • With Win version via files of essay export/import
  • Pascal Delphi / Borland Data Base Engine,
  • DBF
  • Logunov,
  • Polyakov, 2002
  • (*)
  • Yes, but not synchronized with RUS-version on content
  • Correction of model, add new languages, browse, navigation, export, import, save, simple and complex search, comparison, alphabetic and thematic indices
  • Web version
  • Goncharov
  • (1st var.), 2005
  • Khanukaev
  • (2nd var.), 2006
  • (**)
  • There is also a Linux-version (at KSU).
  • The content Is fulfilled (Yaroslavtceva, Makarova).
  • Interface is fulfilled (Khanukaev).
  • We are finishing the work.
  • Browse, tree navigation, comparison
  • Loads data from Win version via direct conversion of data base files
  • (*) Task formalization was done by Novikov †
  • (**) Task formalization was done by Polyakov

Source of Data for DB JM

  • Encyclopedic issue “Jaziki Mira”(Languages of the World) – 14 volumes, printed by Institute of Linguistics of Russian Academy of Science from 1993 to 2006.
  • Large Encyclopedic Dictionary. Linguistics (Edited by Yarceva V.N.) – includes interpretation of all terms of model of DB.
  • Main work on language description in DB format was fulfilled by Yelena Yaroslavceva, DSc.

List of Encyclopedic Publications “Jaziki Mira”(Languages of the World)

  • Languages of the world: Uralic (1993).
  • Languages of the world. Paleoasiatic languages. Мoscow: Publ. “Indricк”. (1996). - 231 p.
  • Languages of the world: Turkic. Мoscow: Publ. “Indricк”. (1997). - 544 p.
  • Languages of the world: Mongolic languages. Manchu-Tungus languages. Japan. Korean. (Ed.: Kibrik A.A., Rogova N.B., Romanova O.I.). Мoscow: Publ. “Indricк”. (1997). - 408 p.
  • Languages of the world: Iranian languages. I. South-Western Iranian languages. Мoscow: Publ. “Indricк”. (1997). - 207 p.
  • Languages of the world: Iranian languages. II. North-Western Iranian languages. Мoscow: Publ. “Indricк”. (1999). – 302 p.
  • Languages of the world: Dardic and Nuristani languages. Мoscow: Publ. “Indricк”. (1998). - 143 p.
  • Languages of the world: Iranian languages. III. East Iranian languages. Мoscow: Publ. “Indricк”. (1999). - 343 p.
  • Languages of the world: Germanic languages. Celtic languages. Moscow: Publ. “Academia”. (1999). - 472 p.
  • Languages of the world: Caucasian languages. RAS. Institute of Linguistics. Moscow: Publ. “Academia”. (2001).-480 p.
  • Languages of the world: Romance languages. Moscow: Publ. “Academia”. (2001). - 720 p.
  • Languages of the world: Indo-Aryan languages of Ancient and Middle Period. Moscow: Publ. “Academia”. (2004). - 160 p.
  • Languages of the world: Slavonic languages. RAS. Institute of Linguistics. /Ed. A.M. Moldovan, S.S. Skorvid, A.A. Kibrik/ Moscow: Publ. “Academia”. (2005). - 656 p.
  • Languages of the world: Baltic languages. RAS. Institute of Linguistics. /Ed. V.N.Toporov, M.V.Zavyalov, A.A. Kibrik/. Moscow: Publ. “Academia”. (2006), 224 p.
  • Also a new volume about semitic languages was issued.

Characteristics of Data Base “Languages of the World” Content

    • The Data Base “Languages of the World” has the following quantitative characteristics.
  • - contains more than 3800 features
  • - the number of languages is 313 Eurasian languages
  • - contains the description of the following spheres of language: phonetics, morphology, syntax.
  • - representation of data: binary
  • In Data Base “Languages of the World” the following language families and unities are represented: Austroasian, Austronesian, Altaic, Afroasian, Indoeuropean, Caucasian, Paleoasian, Sinotibetic, Uralic, Hurrito-Urartean. DB contains the description of languages-isolates: Ainu, Nivch, Burushaski, Sumeran, Elamite. The unique peculiarity of Data Base “Languages of the World” is a large collection of extinct languages description, that includes 55 essays. There is no analogues of such detailed and systematic description of exinct languages.
  • The main principles forming of the model of language description are binarity, hierarchicity and paradigmaticity.

Quantitative And Other Research Products

  • Product
  • Program language or environment /
  • Data Base Engine and data format
  • Programmers, year of issue
  • Main functions
  • Similarity
  • VBA, Excel
  • Polyakov, 2006
  • LangFam
  • VBA, Excel
  • Polyakov, 2006
  • Software for language family portraits calculations, genetic markers revealing, deal with rare features filters, investigate typologycal shift etc.
  • Special software for modeling of evolution
  • Pascal Delphi
  • Yuzhikov, 2006
  • (*)
  • Modeling of process of appearance, borrowing, extinction of features. Uses different parameters of model, gives different quantitative values.
  • Special software for clusterization task
  • Pascal Delphi
  • Dvoenosova (1st var), 2006
  • Zheleznovsky (2nd var), 2008 (*)
  • Clusterization of languages and features by different techniques of classic cluster analysis
  • Special software for phylogeny wspaceith different metrics of feature
  • Visual C
  • Faskhutdinov , 2008
  • (*)
  • Use two heuristic ideas of L- and S- metrics for calculation of distance between languages.
  • BiCoTree –software for easy tree building on DB.
  • Pascal Delphi
  • Sarvarov, 2010 (*)
  • Some other research programs, developed for different aims during partial investigations in areal, historical and typological linguistics (Gusareva, Loginova, Fashutdinov, Omlin, Polyakov, Solovyev).
  • C, Pascal Delphi
  • Allow to solve different tasks:
  • To calculate a core of relevant features for different language families;
  • To calculate a motherland for different language families using grammar features;
  • To calculate stability index using different metrics;
  • Etc.
  • (*) Task formalization was done by Valery Solovyev

Reference and Educational Products (under constr.)

  • Product
  • Program language or environment /
  • Data Base Engine and data format
  • Programmers
  • Main functions
  • Living Diagrams
  • C# and .NET
  • MS SQL Server
  • Excel
  • Khanukaev
  • (*)
  • Reference software with possibility of integration source data and quantitative diagrams. Allows to draw quantitative pictures or tables and to do queries to source data immediately from picture. Has purpose to improve confidence of linguists to quantitative results.
  • EduDBLANG
  • C# and .NET
  • MS SQL Server
  • Excel
  • Belyaev
  • (*)
  • Educational version of DB with full spectrum of reference possibilities. Includes genetic and geographic indices, annotation and examples for features, full texts of papers according to the best WALS traditions. New concept of user interface.
  • (*) Task formalization is done by Polyakov

Specific problems, related to the software development

  • Problem
  • Solution
  • Problem of compatibility
  • Special converters of data are needed. Partly solved in Kernel versions of DB, not solved in related products .
  • Live cycle of product is more then “life time” of OS, program environment and even programmers.
  • Solved by keeping of key members of the team and organization of permanent knowledge inheritance
  • English interface
  • Easy solved
  • English content
  • Very hard problem because of enormous volume of source data and problems of correct terms translation
  • May be solved by careful testing and data format choosing
  • Content adding and support
  • Solved by high qualified team of content developers

Dictionary and source books

  • Dictionary
  • Two of 14 source books

Screenshots. Win Version

Screenshots. Living diagrams.#1

Screenshots. Living diagrams. #2

Screenshots. Living diagrams. #3

Screenshots. Living diagrams. #4

Screenshots. Living diagrams. # 5

Web-version

  • www.dblang.ru

  • THANK YOU!
  • Contacts:
  • Vladimir Polyakov (pvn-65@mail.ru)


The database is protected by copyright ©sckool.org 2016
send message

    Main page