NII Technical Report (NII-2006-008E)：A Generic Query-Based Model for Scalable Clustering

Houle, Michael E.

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

NII Technical Report (NII-2006-008E)：A Generic Query-Based Model for Scalable Clustering

https://doi.org/10.20736/0000001226

名前 / ファイル	ライセンス	アクション
NII Technical Report (NII-2006-008E)：A Generic Query-Based Model for Scalable Clustering (304.1 kB)

Item type

レポート / Report(1)

公開日

2019-03-12

タイトル

言語

タイトル

NII Technical Report (NII-2006-008E)：A Generic Query-Based Model for Scalable Clustering

言語

eng

キーワード

言語

主題Scheme

Other

主題

テクニカルレポート

キーワード

言語

主題Scheme

Other

主題

Technical Report

資源タイプ

資源

http://purl.org/coar/resource_type/c_6501

タイプ

departmental bulletin paper

ID登録

10.20736/0000001226

ID登録タイプ

JaLC

著者

Houle, Michael E.

抄録

内容記述タイプ

Abstract

内容記述

This paper presents a generic model for clustering that requires no direct knowledge of the nature or representation of the data. In lieu of such knowledge, the relevant-set clustering (RSC) model relies solely on the existence of an oracle that accepts a query in the form of a data item, and returns a ranked set of items relevant to the query. In principle, the role of the oracle could be played by any similarity search structure, or even a commercial search engine whose ranking function and relevancy scores are kept secret. The quality of cluster candidates, the degree of association between pairs of cluster candidates, and the degree of association between clusters and data items are all assessed according to the statistical significance of a form of correlation among pairs of relevant sets and/or candidate cluster sets. A scalable clustering heuristic based on the RSC model is also presented, and demonstrated for very large, high-dimensional datasets using a fast approximate similarity search structure as the oracle.

言語

書誌情報

ja : NIIテクニカル・レポート
en : NII Technical Report

p. 1-21, 発行日 2006-05-19

出版者

言語

出版者

国立情報学研究所

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1346-5597

戻る

views

See details

	Views

Versions

Ver.1

2021-03-01 06:05:34.419480

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

NII Technical Report (NII-2006-008E)：A Generic Query-Based Model for Scalable Clustering

× Houle, Michael E.

Versions

Share

Cite as

エクスポート