Video analytics for compliance and quality monitoring in MoRD skill development centres
Deen Dayal Upadhyaya Grameen Kaushalya Yojana (DDU-GKY) is a placement linked skill development scheme for rural poor youth by the Ministry of Rural Development (MoRD). It is an important component of the National Skill Development Policy. The DDU-GKY skilling ecosystem consists of MoRD; State missions; project implementing agencies or training partners; and technical support agencies. Both generic (soft skills, English and Information Technology) and trade-specific training is offered to youth at various training centers in partnership with the third-party training agencies.
Assessment of farmer producer organisations using Rubric methodology
Farmer producer organisation (FPO) is a type of producer organisation (PO) which is a legal entity formed by primary producers, viz. farmers, milk producers, fishermen, weavers, rural artisans, craftsmen and the alike. The primary objective of a FPO is to mobilise small and marginal farmers (~80% of total farmers in India) into member-owned producer organizations. It helps to foster technology penetration, improve productivity and access to the inputs, investments, markets and services, and increases farmer incomes, thereby strengthening their sustainable agriculture based livelihoods.
Search across knowledge sources: Components of building semantic search systems
Semantic search can be described as the effort to improve the accuracy of the search process by understanding the context and limiting the ambiguity. Semantic search engines are more likely to try to understand the meanings that are hidden in retrieved documents and users’ queries, by means of adding semantic tags into texts, in order to bring structure into and conceptualise the objects within documents. The primary components of the semantic web, ontologies and knowledge graphs (populated ontologies), are rich sources of domain knowledge.
IndoWordNet
WordNets are lexical structures composed of sets of synonyms called synsets, and semantic relations between these synsets. Wordnets help in various natural language processing (NLP) tasks such as word sense disambiguation, machine translation, etc. Unavailability of a crucial lexical resource like Wordnet has impeded the development of NLP technologies for Indian languages.
Machine translation
Machine translation (MT) deals with automatic translation of text from one natural language to another. It is one of the most challenging problems in natural language processing (NLP), requiring knowledge from all sub-areas of NLP. In an increasingly connected world, human interaction requires crossing language barriers in the government, business, social and cultural spheres. MT is a key technology to overcome these language barriers.
Cognitive NLP
Cognitive NLP research at the Centre for Indian Language Technology (CFILT), attempts to gain insights into the cognitive underpinnings of human language processing and understanding. The insights are then translated to methods and models that contribute to the field of NLP by achieving the following objectives: (1) Optimising human annotation effort for better annotation management, and (2) Improving existing NLP systems by introducing cognitive features.
Information extraction and retrieval
Information extraction (IE) is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents, while information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers), using queries.The Centre for Indian Language Technology (CFILT) at IIT Bombay has a long-standing research in both information extraction and retrieval.
Word sense disambiguation
Word sense disambiguation (WSD) deals with computationally resolving ambiguities in a text. Languages have many polysemous words i.e. words having more than one meaning or sense. WSD is the process of identifying the correct sense of a word in a particular context. For instance, consider the sentence, ‘Ram is playing cricket in the park’, where the word ‘cricket’ is ambiguous with two senses: ‘a game’ and ‘an insect’.
Development and evolution of Sanskrit Wordnet
This is a project funded by Linguistic data consortium for Indian languages (LDC-IL). Sanskrit Wordnet is an online multipurpose multi-linked Thesaurus. It gives different relations between synsets or synonym sets which represent unique concepts. We have created synsets across various parts of speech (POS) namely, nouns, verbs, adjectives and adverbs.
Towards a critical edition of the Kasikavrtti
To bring out a critical edition of a text entitled Kasikavrtti written in the 7th century CE. This is a commentary on the celebrated Sanskrit grammar of Panini. Kasikavrtti is the oldest complete commentary on the grammar of Panini. We prepare the edition based on more than 200 manuscript evidences of this text written in more than 8 Indian scripts like grantha, sharada, nigari, etc., and found in the entire Indian subcontinent as well as outside (like US, UK, Germany, France, etc.)