Query Processing over Incomplete Databases

Query Processing over Incomplete Databases
Author :
Publisher : Springer Nature
Total Pages : 106
Release :
ISBN-10 : 9783031018633
ISBN-13 : 303101863X
Rating : 4/5 (33 Downloads)

Synopsis Query Processing over Incomplete Databases by : Yunjun Gao

Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

Query Processing over Incomplete Databases

Query Processing over Incomplete Databases
Author :
Publisher : Morgan & Claypool Publishers
Total Pages : 124
Release :
ISBN-10 : 9781681734217
ISBN-13 : 1681734214
Rating : 4/5 (17 Downloads)

Synopsis Query Processing over Incomplete Databases by : Yunjun Gao

Incomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.

Proceedings of the International Conference on Big Data, IoT, and Machine Learning

Proceedings of the International Conference on Big Data, IoT, and Machine Learning
Author :
Publisher : Springer Nature
Total Pages : 784
Release :
ISBN-10 : 9789811666360
ISBN-13 : 9811666369
Rating : 4/5 (60 Downloads)

Synopsis Proceedings of the International Conference on Big Data, IoT, and Machine Learning by : Mohammad Shamsul Arefin

This book gathers a collection of high-quality peer-reviewed research papers presented at the International Conference on Big Data, IoT and Machine Learning (BIM 2021), held in Cox’s Bazar, Bangladesh, during 23–25 September 2021. The book covers research papers in the field of big data, IoT and machine learning. The book will be helpful for active researchers and practitioners in the field.

Database Systems for Advanced Applications

Database Systems for Advanced Applications
Author :
Publisher : Springer
Total Pages : 357
Release :
ISBN-10 : 9783642290237
ISBN-13 : 364229023X
Rating : 4/5 (37 Downloads)

Synopsis Database Systems for Advanced Applications by : Hwanjo Yu

This book constitutes the workshop proceedings of the 17th International Conference on Database Systems for Advanced Applications, DASFAA 2012, held in Busan, South Korea, in April 2012. The volume contains five workshops, each focusing on specific area that contributes to the main themes of the DASFAA conference: The Second International Workshop on Flash-based Database Systems (FlashDB 2012), the First International Workshop on Information Technologies for Maritime and Logistics (ITEMS 2012), the Third International Workshop on Social Networks and Social Media Mining on the Web (SNSMW 2012), the Second International Workshop on Spatial Information Modeling, Management and Mining (SIM3 2012), and the Fifth International Workshop on Data Quality in Integration Systems (DQIS 2012).

Knowledge Graphs and Big Data Processing

Knowledge Graphs and Big Data Processing
Author :
Publisher : Springer Nature
Total Pages : 212
Release :
ISBN-10 : 9783030531997
ISBN-13 : 3030531996
Rating : 4/5 (97 Downloads)

Synopsis Knowledge Graphs and Big Data Processing by : Valentina Janev

This open access book is part of the LAMBDA Project (Learning, Applying, Multiplying Big Data Analytics), funded by the European Union, GA No. 809965. Data Analytics involves applying algorithmic processes to derive insights. Nowadays it is used in many industries to allow organizations and companies to make better decisions as well as to verify or disprove existing theories or models. The term data analytics is often used interchangeably with intelligence, statistics, reasoning, data mining, knowledge discovery, and others. The goal of this book is to introduce some of the definitions, methods, tools, frameworks, and solutions for big data processing, starting from the process of information extraction and knowledge representation, via knowledge processing and analytics to visualization, sense-making, and practical applications. Each chapter in this book addresses some pertinent aspect of the data processing chain, with a specific focus on understanding Enterprise Knowledge Graphs, Semantic Big Data Architectures, and Smart Data Analytics solutions. This book is addressed to graduate students from technical disciplines, to professional audiences following continuous education short courses, and to researchers from diverse areas following self-study courses. Basic skills in computer science, mathematics, and statistics are required.

Scalable Processing of Spatial-Keyword Queries

Scalable Processing of Spatial-Keyword Queries
Author :
Publisher : Springer Nature
Total Pages : 98
Release :
ISBN-10 : 9783031018671
ISBN-13 : 3031018672
Rating : 4/5 (71 Downloads)

Synopsis Scalable Processing of Spatial-Keyword Queries by : Ahmed R. Mahmood

Text data that is associated with location data has become ubiquitous. A tweet is an example of this type of data, where the text in a tweet is associated with the location where the tweet has been issued. We use the term spatial-keyword data to refer to this type of data. Spatial-keyword data is being generated at massive scale. Almost all online transactions have an associated spatial trace. The spatial trace is derived from GPS coordinates, IP addresses, or cell-phone-tower locations. Hundreds of millions or even billions of spatial-keyword objects are being generated daily. Spatial-keyword data has numerous applications that require efficient processing and management of massive amounts of spatial-keyword data. This book starts by overviewing some important applications of spatial-keyword data, and demonstrates the scale at which spatial-keyword data is being generated. Then, it formalizes and classifies the various types of queries that execute over spatial-keyword data. Next, it discusses important and desirable properties of spatial-keyword query languages that are needed to express queries over spatial-keyword data. As will be illustrated, existing spatial-keyword query languages vary in the types of spatial-keyword queries that they can support. There are many systems that process spatial-keyword queries. Systems differ from each other in various aspects, e.g., whether the system is batch-oriented or stream-based, and whether the system is centralized or distributed. Moreover, spatial-keyword systems vary in the types of queries that they support. Finally, systems vary in the types of indexing techniques that they adopt. This book provides an overview of the main spatial-keyword data-management systems (SKDMSs), and classifies them according to their features. Moreover, the book describes the main approaches adopted when indexing spatial-keyword data in the centralized and distributed settings. Several case studies of {SKDMSs} are presented along with the applications and query types that these {SKDMSs} are targeted for and the indexing techniques they utilize for processing their queries. Optimizing the performance and the query processing of {SKDMSs} still has many research challenges and open problems. The book concludes with a discussion about several important and open research-problems in the domain of scalable spatial-keyword processing.

Advanced Database Systems For Integration Of Media And User Environments '98: Advanced Database Research

Advanced Database Systems For Integration Of Media And User Environments '98: Advanced Database Research
Author :
Publisher : World Scientific
Total Pages : 366
Release :
ISBN-10 : 9789814545037
ISBN-13 : 9814545031
Rating : 4/5 (37 Downloads)

Synopsis Advanced Database Systems For Integration Of Media And User Environments '98: Advanced Database Research by : Yahiko Kambayashi

This volume is a progress report on the project Research and Development of Advanced Database Systems for Integration of Media and User Environments, supported by the Ministry of Education, Science, Sports and Culture of Japan. It investigates research on new database systems due to the recent development of network technology; a clearer picture of integration by database technology is drawn as a result.

Pattern Analysis, Intelligent Security and the Internet of Things

Pattern Analysis, Intelligent Security and the Internet of Things
Author :
Publisher : Springer
Total Pages : 356
Release :
ISBN-10 : 9783319173986
ISBN-13 : 3319173987
Rating : 4/5 (86 Downloads)

Synopsis Pattern Analysis, Intelligent Security and the Internet of Things by : Ajith Abraham

This Volume presents the selected papers from the 5 Parallel Symposiums of the 2014 Fourth World Congress on Information and Communication Technologies (WICT 2014) held in Malacca, Malaysia. The theme of WICT 2014 'Innovating ICT for Social Revolutions'. WICT 2014 is Co-Organized by Machine Intelligence Research Labs (MIR Labs), USA and Universiti Teknikal Malaysia Melaka, Malaysia. WICT 2014 is technically co-sponsored by IEEE Systems, Man & Cybernetics Society Malaysia and Spain Chapters and Technically Supported by IEEE Systems Man and Cybernetics Society, Technical Committee on Soft Computing.

Transaction Processing on Modern Hardware

Transaction Processing on Modern Hardware
Author :
Publisher : Springer Nature
Total Pages : 122
Release :
ISBN-10 : 9783031018701
ISBN-13 : 3031018702
Rating : 4/5 (01 Downloads)

Synopsis Transaction Processing on Modern Hardware by : Mohammad Sadoghi

The last decade has brought groundbreaking developments in transaction processing. This resurgence of an otherwise mature research area has spurred from the diminishing cost per GB of DRAM that allows many transaction processing workloads to be entirely memory-resident. This shift demanded a pause to fundamentally rethink the architecture of database systems. The data storage lexicon has now expanded beyond spinning disks and RAID levels to include the cache hierarchy, memory consistency models, cache coherence and write invalidation costs, NUMA regions, and coherence domains. New memory technologies promise fast non-volatile storage and expose unchartered trade-offs for transactional durability, such as exploiting byte-addressable hot and cold storage through persistent programming that promotes simpler recovery protocols. In the meantime, the plateauing single-threaded processor performance has brought massive concurrency within a single node, first in the form of multi-core, and now with many-core and heterogeneous processors. The exciting possibility to reshape the storage, transaction, logging, and recovery layers of next-generation systems on emerging hardware have prompted the database research community to vigorously debate the trade-offs between specialized kernels that narrowly focus on transaction processing performance vs. designs that permit transactionally consistent data accesses from decision support and analytical workloads. In this book, we aim to classify and distill the new body of work on transaction processing that has surfaced in the last decade to navigate researchers and practitioners through this intricate research subject.