Natural Language Processing and Information Retrieval; Principles and Applications 9781032154923, 9781032154930, 9781003244332

This book presents the basics and recent advancements in natural language processing and information retrieval in a sing

128 50 7MB

English Pages 252 Year 2023

Report DMCA / Copyright

DOWNLOAD FILE

Natural Language Processing and Information Retrieval; Principles and Applications
 9781032154923, 9781032154930, 9781003244332

Table of contents :
1 Federated learning for natural language processing

Sergei Ternovykh and Anastasia Nikiforova

1.1 Introduction

1.1.1 Centralized (standard) federated learning

1.1.2 Aggregation algorithms

1.1.3 DSSGD

1.1.4 FedAvg

1.1.5 FedProx

1.1.6 SCAFFOLD

1.1.7 FedOpt

1.1.8 Other algorithms

1.1.9 The cross-device setting

1.1.10 The cross-silo setting

1.1.11 Horizontal federated learning

1.1.12 Vertical federated learning

1.1.13 Federated transfer learning

1.2 Split learning and split federated learning 15

1.2.1 Vanilla split learning

1.2.2 Configurations of split learning

1.2.3 SplitFed learning

1.3 Decentralized (peer-to-peer) federated learning

1.4 Summary

1.5 NLP through FL

1.6 Security and privacy protection

1.7 FL platforms and datasets

1.8 Conclusion

References

2 Utility-based recommendation system for large datasets using EAHUIM

Vandna Dahiya

2.1 Introduction

2.2 Related work

2.2.1 Recommendation system

2.2.2 Recommendation systems in E-commerce

2.2.3 Recommendation system with utility itemset mining

2.2.4 Prerequisites of utility itemset mining

2.2.5 Problem definition

2.3 Proposed model

2.3.1 Design of the model – recommendation system with EAHUIM

2.4 Results and discussion

2.4.1 Setup

2.4.2 Data collection and preprocessing

2.4.3 Performance evaluation

2.4.4 Limitations of the model

2.4.5 Discussions

2.5 Conclusion

References

3 Anaphora resolution: A complete view with case study

Kalpana B. Khandale and C. Namrata Mahender

3.1 Introduction

3.1.1 Issues and challenges of an Anaphora

3.1.2 Need for anaphora in NLP applications

3.1.3 Anaphora

3.1.4 Discourse anaphora

3.2 Approaches to Anaphora resolution

3.2.1 Knowledge-rich approaches

3.2.2 Corpus based approaches

3.2.3 Knowledge-poor approaches

3.3 Case study of anaphora resolution in Marathi text

3.3.1 Development of POS tagger

3.3.2 Anaphora resolution system architecture

3.4 Case Study of Anaphora Resolution in Marathi with Python

3.4.1 Database

3.4.2 Preprocessing

3.4.3 Post-processing

3.5 Conclusion

References

4 A review of the approaches to neural machine translation

Preetpal Kaur Buttar and Manoj Kumar Sachan

4.1 Introduction

4.2 Machine translation approaches

4.3 Formulation of the NMT task

4.4 The encoder-decoder model

4.4.1 Encoder

4.4.2 Decoder

4.5 RNNs as encoder-decoder models

4.5.1 One-hot encoding

4.5.2 Variations of RNNs

4.5.3 Discussion and inferences

4.6 LSTMs: dealing with long-term dependencies and vanishing gradients

4.6.1 GRUs

4.6.2 Limitations of LSTMs

4.7 NMT with attention

4.8 Recent developments in NMT

4.8.1 Word embeddings

4.8.2 CNN-based NMT

4.8.3 Fully attention-based NMT

4.8.4 Transformer-based pre-trained models

4.8.5 Improved transformer models

4.9 NMT in low-resource languages

4.10 Vocabulary coverage problem

4.11 Datasets for machine translation

4.12 Challenges and future scope

References

5 Evolution of question-answering system from information retrieval: A scientific time travel for Bangla

Arijit Das and Diganta Saha

5.1 Introduction

5.2 The meaning and various ways of research done in the field of semantics

5.3 Semantic text retrieval: automatic question-answering system or query system

5.4 State-of-the-art performance

5.5 Latest research works on question-answering systems in related major global languages

5.6 Latest research works on question-answering system in major Indian languages

5.7 Backend database or repository used in the question-answering system

5.8 Different approaches of algorithms used in Bangla QA system research

5.9 Different algorithms used in Bangla QA system research

5.10 Results achieved by various Bangla QA systems

5.11 Conclusion

References

6 Recent advances in textual code-switching

Sergei Ternovykh and Anastasia Nikiforova

6.1 Introduction

6.2 Background

6.2.1 Theoretical approaches and types of code-switching

6.2.2 Measuring code-switching complexity

6.3 Code-switching datasets

6.3.1 Language identification

6.3.2 POS tagging

6.3.3 Named entity recognition

6.3.4 Chunking and dependency parsing

6.3.5 Sentiment analysis

6.3.6 Question-answering

6.3.7 Conversational systems

6.3.8 Machine translation

6.3.9 Natural language inference

6.4 NLP techniques for textual code-switching

6.4.1 Language modeling

6.4.2 Language identification

6.4.3 POS tagging

6.4.4 Named entity recognition

6.4.5 Dependency parsing

6.4.6 Sentiment analysis

6.4.7 Natural language inference

6.4.8 Machine translation

6.4.9 Question-answering

6.5 Evaluation of code-switched systems

6.6 Current limitations and future work

References

7 Legal document summarization using hybrid model

Deekshitha and Nandhini K.

7.1 Introduction

7.1.1 Background

7.1.2 Motivation

7.1.3 Problem definition

7.1.4 Objectives and scopes

7.1.5 Organization

7.2 Literature review

7.2.1 Automatic text summarization in the legal domain

7.3 Methodology

7.3.1 Legal document summary

7.3.2 Evaluation

7.4 Experiments and results

7.4.1 Evaluating extractive model

7.4.2 Effects of different K values (summary length)

7.4.3 Evaluating abstractive summary model

7.4.4 Comparison with extractive summarization models

7.4.5 Comparison with abstractive model

7.5 Conclusion

References

8 Concept network using network text analysis

Md Masum Billah, Dipanita Saha, Farzana Bhuiyan, and Mohammed Kaosar

8.1 Introduction

8.2 Literature review

8.3 The concept network

8.3.1 Concept-based information retrieval

8.3.2 Concept networks and extended fuzzy concept networks

8.3.3 Applications for fuzzy concept knowledge

8.3.4 Building WikiNet: using Wikipedia as the source

8.4 Network text analysis

8.4.1 Extracting context words from training documents

8.4.2 Building bigram frequency for text classification

8.4.3 Detecting related articles by using bigrams

8.5 Conclusion and future direction

References

9 Question-answering versus machine reading comprehension: Neural machine reading comprehension using transformer models

Nisha Varghese and M. Punithavalli

9.1 Introduction

9.2 Architecture of machine reading comprehension

9.2.1 Word embedding

9.2.2 Feature extraction

9.2.3 Context-question interaction

9.2.4 Answer prediction

9.3 Machine reading comprehension tasks and classification

9.3.1 Cloze tests

9.3.2 Multiple-choice questions

9.3.3 Span extraction

9.3.4 Free-form answering

9.3.5 Attribute-based classification

9.4 Datasets

9.5 Performance evaluation metrics

9.5.1 Accuracy

9.5.2 Exact match

9.5.3 Precision and recall

9.5.4 F1 score

9.5.6 ROUGE (recall-oriented understudy for gisting evaluation)

9.5.7 BLEU (bilingual evaluation understudy)

9.5.8 METEOR (Metric for Evaluation of Translation with Explicit ORdering)

9.5.9 HEQ (human equivalence score)

9.6 Transformer and BERT

9.6.1 BERT-based models

9.7 Results and discussion

9.8 Conclusion and future enhancement

References

10 Online subjective question-answering system necessity of education system

Madhav A. Kankhar, Bharat A. Shelke, and C. Namrata Mahender

10.1 Introduction

10.1.1 Brief on NLP (natural language processing)

10.1.2 Question

10.1.3 Answer

10.1.4 Question-answering system

10.1.5 Types of question-answering

10.1.6 Approaches used for developing QA system

10.1.7 Components of question-answering

10.1.8 Need for question-answering system

10.2 Question types follow into two categories

10.2.1 Objective examination

10.2.2 Subjective examination

10.2.3 Subjective examination-related work

10.2.4 Why is subjective examination important

10.2.5 Online education system

10.3 Proposed model

10.3.1 Text document

10.3.2 Preprocessing

10.3.3 POS tagging

10.3.4 Question generation

10.3.5 User (students) and user (teacher)

10.3.6 Model answer

10.3.7 Answer

10.3.8 Evaluation

10.3.9 Result

10.4 Conclusion

References

Index

Polecaj historie