Vantage-point tree

A vantage-point tree is a metric tree that segregates data in a metric space by choosing a position in the space and partitioning the data points into two parts: those points that are nearer to the vantage point than a threshold, and those points that are not. By recursively applying this procedure to partition the data into smaller and smaller sets, a tree data structure is created where neighbors in the tree are likely to be neighbors in the space.
One generalization is called a multi-vantage-point tree, or MVP tree: a data structure for indexing objects from large metric spaces for similarity search queries. It uses more than one point to partition each level.

History

Peter Yianilos claimed that the vantage-point tree was discovered independently by him
and by Jeffrey Uhlmann.
Yet, Uhlmann published this method before Yianilos in 1991.
Uhlmann called the data structure a metric tree, the name VP-tree was
proposed by Yianilos.
Vantage-point trees have been generalized to non-metric spaces using Bregman divergences by Nielsen et al.
This iterative partitioning process is similar to that of a -d tree, but uses circular rather than rectilinear partitions. In two-dimensional Euclidean space, this can be visualized as a series of circles segregating the data.
The vantage-point tree is particularly useful in dividing data in a non-standard metric space into a metric tree.

Understanding a vantage-point tree

The way a vantage-point tree stores data can be represented by a circle. First, understand that each node of this tree contains an input point and a radius. All the left children of a given node are the points inside the circle and all the right children of a given node are outside of the circle. The tree itself does not need to know any other information about what is being stored. All it needs is the distance function that satisfies the properties of the metric space.

Searching through a vantage-point tree

A vantage-point tree can be used to find the nearest neighbor of a point. The search algorithm is recursive. At any given step we are working with a node of the tree that has a vantage point and a threshold distance . The point of interest will be some distance from the vantage point. If that distance is less than then use the algorithm recursively to search the subtree of the node that contains the points closer to the vantage point than the threshold ; otherwise recurse to the subtree of the node that contains the points that are farther than the vantage point than the threshold. If the recursive use of the algorithm finds a neighboring point with distance to that is less than then it cannot help to search the other subtree of this node; the discovered node is returned. Otherwise, the other subtree also needs to be searched recursively.
A similar approach works for finding the nearest neighbors of a point. In the recursion, the other subtree is searched for nearest neighbors of the point whenever only of the nearest neighbors found so far have distance that is less than.

Advantages of a vantage-point tree

Instead of inferring multidimensional points for domain before the index being built, we build the index directly based on the distance. Doing this, avoids pre-processing steps.
Updating a vantage-point tree is relatively easy compared to the fast-map approach. For fast maps, after inserting or deleting data, there will come a time when fast-map will have to rescan itself. That takes up too much time and it is unclear to know when the rescanning will start.
Distance based methods are flexible. It is “able to index objects that are represented as feature vectors of a fixed number of dimensions."
Complexity

The time cost to build a Vantage-Point tree is approximately. For each element, the tree is descended by levels to find its placement. However there is a constant factor where is the number of vantage points per tree node.
The time cost to search a Vantage-Point tree to find a single nearest neighbor is. There are levels, each involving distance calculations, where is the number of vantage points at that position in the tree.
The time cost to search a Vantage-Point tree for a range, which may be the most important attribute, can vary greatly depending on the specifics of the algorithm used and parameters. Brin's paper gives the result of experiments with several vantage point algorithms with various parameters to investigate the cost, measured in number of distance calculations.
The space cost for a Vantage-Point tree is approximately. Each element is stored, and each tree element in each non-leaf node requires a pointer to its descendant nodes.
Note that some metric space tools require a matrix of pair-wise distance values, costing, but that is not required with Vantage-Point trees.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Vantage-point tree

History

Understanding a vantage-point tree

Searching through a vantage-point tree

Advantages of a vantage-point tree

Complexity