Modern Algorithms of Cluster Analysis

Modern Algorithms of Cluster Analysis
Author :
Publisher : Springer
Total Pages : 433
Release :
ISBN-10 : 9783319693088
ISBN-13 : 3319693085
Rating : 4/5 (88 Downloads)

Synopsis Modern Algorithms of Cluster Analysis by : Slawomir Wierzchoń

This book provides the reader with a basic understanding of the formal concepts of the cluster, clustering, partition, cluster analysis etc. The book explains feature-based, graph-based and spectral clustering methods and discusses their formal similarities and differences. Understanding the related formal concepts is particularly vital in the epoch of Big Data; due to the volume and characteristics of the data, it is no longer feasible to predominantly rely on merely viewing the data when facing a clustering problem. Usually clustering involves choosing similar objects and grouping them together. To facilitate the choice of similarity measures for complex and big data, various measures of object similarity, based on quantitative (like numerical measurement results) and qualitative features (like text), as well as combinations of the two, are described, as well as graph-based similarity measures for (hyper) linked objects and measures for multilayered graphs. Numerous variants demonstrating how such similarity measures can be exploited when defining clustering cost functions are also presented. In addition, the book provides an overview of approaches to handling large collections of objects in a reasonable time. In particular, it addresses grid-based methods, sampling methods, parallelization via Map-Reduce, usage of tree-structures, random projections and various heuristic approaches, especially those used for community detection.

Data Clustering: Theory, Algorithms, and Applications, Second Edition

Data Clustering: Theory, Algorithms, and Applications, Second Edition
Author :
Publisher : SIAM
Total Pages : 430
Release :
ISBN-10 : 9781611976335
ISBN-13 : 1611976332
Rating : 4/5 (35 Downloads)

Synopsis Data Clustering: Theory, Algorithms, and Applications, Second Edition by : Guojun Gan

Data clustering, also known as cluster analysis, is an unsupervised process that divides a set of objects into homogeneous groups. Since the publication of the first edition of this monograph in 2007, development in the area has exploded, especially in clustering algorithms for big data and open-source software for cluster analysis. This second edition reflects these new developments, covers the basics of data clustering, includes a list of popular clustering algorithms, and provides program code that helps users implement clustering algorithms. Data Clustering: Theory, Algorithms and Applications, Second Edition will be of interest to researchers, practitioners, and data scientists as well as undergraduate and graduate students.

Spectral Algorithms

Spectral Algorithms
Author :
Publisher : Now Publishers Inc
Total Pages : 153
Release :
ISBN-10 : 9781601982742
ISBN-13 : 1601982747
Rating : 4/5 (42 Downloads)

Synopsis Spectral Algorithms by : Ravindran Kannan

Spectral methods refer to the use of eigenvalues, eigenvectors, singular values and singular vectors. They are widely used in Engineering, Applied Mathematics and Statistics. More recently, spectral methods have found numerous applications in Computer Science to "discrete" as well as "continuous" problems. Spectral Algorithms describes modern applications of spectral methods, and novel algorithms for estimating spectral parameters. The first part of the book presents applications of spectral methods to problems from a variety of topics including combinatorial optimization, learning and clustering. The second part of the book is motivated by efficiency considerations. A feature of many modern applications is the massive amount of input data. While sophisticated algorithms for matrix computations have been developed over a century, a more recent development is algorithms based on "sampling on the fly" from massive matrices. Good estimates of singular values and low rank approximations of the whole matrix can be provably derived from a sample. The main emphasis in the second part of the book is to present these sampling methods with rigorous error bounds. It also presents recent extensions of spectral methods from matrices to tensors and their applications to some combinatorial optimization problems.

Handbook of Cluster Analysis

Handbook of Cluster Analysis
Author :
Publisher : CRC Press
Total Pages : 753
Release :
ISBN-10 : 9781466551893
ISBN-13 : 1466551895
Rating : 4/5 (93 Downloads)

Synopsis Handbook of Cluster Analysis by : Christian Hennig

Handbook of Cluster Analysis provides a comprehensive and unified account of the main research developments in cluster analysis. Written by active, distinguished researchers in this area, the book helps readers make informed choices of the most suitable clustering approach for their problem and make better use of existing cluster analysis tools.The

Classification, Clustering, and Data Analysis

Classification, Clustering, and Data Analysis
Author :
Publisher : Springer Science & Business Media
Total Pages : 468
Release :
ISBN-10 : 9783642561818
ISBN-13 : 3642561810
Rating : 4/5 (18 Downloads)

Synopsis Classification, Clustering, and Data Analysis by : Krzystof Jajuga

The book presents a long list of useful methods for classification, clustering and data analysis. By combining theoretical aspects with practical problems, it is designed for researchers as well as for applied statisticians and will support the fast transfer of new methodological advances to a wide range of applications.

Mining Text Data

Mining Text Data
Author :
Publisher : Springer Science & Business Media
Total Pages : 527
Release :
ISBN-10 : 9781461432234
ISBN-13 : 1461432235
Rating : 4/5 (34 Downloads)

Synopsis Mining Text Data by : Charu C. Aggarwal

Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. Mining Text Data introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including the key research content on the topic, and the future directions of research in the field. There is a special focus on Text Embedded with Heterogeneous and Multimedia Data which makes the mining process much more challenging. A number of methods have been designed such as transfer learning and cross-lingual mining for such cases. Mining Text Data simplifies the content, so that advanced-level students, practitioners and researchers in computer science can benefit from this book. Academic and corporate libraries, as well as ACM, IEEE, and Management Science focused on information security, electronic commerce, databases, data mining, machine learning, and statistics are the primary buyers for this reference book.

Clustering Algorithms

Clustering Algorithms
Author :
Publisher : John Wiley & Sons
Total Pages : 374
Release :
ISBN-10 : UOM:39015016356829
ISBN-13 :
Rating : 4/5 (29 Downloads)

Synopsis Clustering Algorithms by : John A. Hartigan

Shows how Galileo, Newton, and Einstein tried to explain gravity. Discusses the concept of microgravity and NASA's research on gravity and microgravity.

Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering

Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering
Author :
Publisher : Springer
Total Pages : 186
Release :
ISBN-10 : 9783030106744
ISBN-13 : 3030106748
Rating : 4/5 (44 Downloads)

Synopsis Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering by : Laith Mohammad Qasim Abualigah

This book puts forward a new method for solving the text document (TD) clustering problem, which is established in two main stages: (i) A new feature selection method based on a particle swarm optimization algorithm with a novel weighting scheme is proposed, as well as a detailed dimension reduction technique, in order to obtain a new subset of more informative features with low-dimensional space. This new subset is subsequently used to improve the performance of the text clustering (TC) algorithm and reduce its computation time. The k-mean clustering algorithm is used to evaluate the effectiveness of the obtained subsets. (ii) Four krill herd algorithms (KHAs), namely, the (a) basic KHA, (b) modified KHA, (c) hybrid KHA, and (d) multi-objective hybrid KHA, are proposed to solve the TC problem; each algorithm represents an incremental improvement on its predecessor. For the evaluation process, seven benchmark text datasets are used with different characterizations and complexities. Text document (TD) clustering is a new trend in text mining in which the TDs are separated into several coherent clusters, where all documents in the same cluster are similar. The findings presented here confirm that the proposed methods and algorithms delivered the best results in comparison with other, similar methods to be found in the literature.

Machine Learning Techniques for Multimedia

Machine Learning Techniques for Multimedia
Author :
Publisher : Springer Science & Business Media
Total Pages : 297
Release :
ISBN-10 : 9783540751717
ISBN-13 : 3540751718
Rating : 4/5 (17 Downloads)

Synopsis Machine Learning Techniques for Multimedia by : Matthieu Cord

Processing multimedia content has emerged as a key area for the application of machine learning techniques, where the objectives are to provide insight into the domain from which the data is drawn, and to organize that data and improve the performance of the processes manipulating it. Arising from the EU MUSCLE network, this multidisciplinary book provides a comprehensive coverage of the most important machine learning techniques used and their application in this domain.

Finding Groups in Data

Finding Groups in Data
Author :
Publisher : Wiley-Interscience
Total Pages : 376
Release :
ISBN-10 : UCSD:31822005118112
ISBN-13 :
Rating : 4/5 (12 Downloads)

Synopsis Finding Groups in Data by : Leonard Kaufman

Partitioning around medoids (Program PAM). Clustering large applications (Program CLARA). Fuzzy analysis (Program FANNY). Agglomerative Nesting (Program AGNES). Divisive analysis (Program DIANA). Monothetic analysis (Program MONA). Appendix.