ML p(r)ior | Biclustering Readings and Manuscripts via Non-negative Matrix Factorization, with Application to the Text of Jude

Biclustering Readings and Manuscripts via Non-negative Matrix Factorization, with Application to the Text of Jude

2016-02-03
1602.01323 | cs.LG
The text-critical practice of grouping witnesses into families or texttypes often faces two obstacles: Contamination in the manuscript tradition, and co-dependence in identifying characteristic readings and manuscripts. We introduce non-negative matrix factorization (NMF) as a simple, unsupervised, and efficient way to cluster large numbers of manuscripts and readings simultaneously while summarizing contamination using an easy-to-interpret mixture model. We apply this method to an extensive collation of the New Testament epistle of Jude and show that the resulting clusters correspond to human-identified textual families from existing research.
PDF

Highlights - Most important sentences from the article

Login to like/save this paper, take notes and configure your recommendations

Related Articles

2015-09-30
1510.00012 | stat.CO

In a variety of research areas, the weighted bag of vectors and the histogram are widely used descri… show more
PDF

Highlights - Most important sentences from the article

2016-12-27

Clustering is a widely used unsupervised learning method for finding structure in the data. However,… show more
PDF

Highlights - Most important sentences from the article

2017-02-23

Topic models can provide us with an insight into the underlying latent structure of a large corpus o… show more
PDF

Highlights - Most important sentences from the article

2012-07-08
1207.1847 | cs.CL

The statistical methods derived and described in this thesis provide new ways to elucidate the struc… show more
PDF

Highlights - Most important sentences from the article

2018-04-25

Parameter pruning is a promising approach for CNN compression and acceleration by eliminating redund… show more
PDF

Highlights - Most important sentences from the article

2014-04-16

Topic modeling refers to the task of discovering the underlying thematic structure in a text corpus,… show more
PDF

Highlights - Most important sentences from the article

2018-04-08
1804.02744 | stat.ML

Gaussian Mixture Models are one of the most studied and mature models in unsupervised learning. Howe… show more
PDF

Highlights - Most important sentences from the article

2017-01-05

The problem of outlier detection is extremely challenging in many domains such as text, in which the… show more
PDF

Highlights - Most important sentences from the article

2015-04-01

Unsupervised clustering of curves according to their shapes is an important problem with broad scien… show more
PDF

Highlights - Most important sentences from the article

2019-04-15
1904.07701 | stat.ML

Diverse applications - particularly in tumour subtyping - have demonstrated the importance of integr… show more
PDF

Highlights - Most important sentences from the article

2018-11-06
1811.02456 | cs.CL

The abundance of text data being produced in the modern age makes it increasingly important to intui… show more
PDF

Highlights - Most important sentences from the article

2018-10-27

Time series clustering is the process of grouping time series with respect to their similarity or ch… show more
PDF

Highlights - Most important sentences from the article

2019-01-13
1901.03919 | stat.ML

In this paper, we solve a semi-supervised regression problem. Due to the lack of knowledge about the… show more
PDF

Highlights - Most important sentences from the article

2019-04-21
1904.09609 | stat.ML

The $K$-means algorithm is extended to allow for partitioning of skewed groups. Our algorithm is cal… show more
PDF

Highlights - Most important sentences from the article

2019-04-17

Cluster analysis plays a very important role in data analysis. In these years, cluster ensemble, as … show more
PDF

Highlights - Most important sentences from the article

2018-11-26

For more than a decade, graphs have been used to model the voting behavior taking place in parliamen… show more
PDF

Highlights - Most important sentences from the article