“Lowest common ancestor”的意思、由来-开放百科全书

词条

Lowest common ancestor

释义

History
Extension to directed acyclic graphs
Applications
See also
References
External links

{{about|lowest common ancestors in graph theory and computer science|the common ancestor of a set of species in evolutionary trees|most recent common ancestor|a common ancestor of all life forms|last universal common ancestor}}

In graph theory and computer science, the lowest common ancestor (LCA) of two nodes {{mvar|v}} and {{mvar|w}} in a tree or directed acyclic graph (DAG) {{mvar|T}} is the lowest (i.e. deepest) node that has both {{mvar|v}} and {{mvar|w}} as descendants, where we define each node to be a descendant of itself (so if {{mvar|v}} has a direct connection from {{mvar|w}}, {{mvar|w}} is the lowest common ancestor).

The LCA of {{mvar|v}} and {{mvar|w}} in {{mvar|T}} is the shared ancestor of {{mvar|v}} and {{mvar|w}} that is located farthest from the root. Computation of lowest common ancestors may be useful, for instance, as part of a procedure for determining the distance between pairs of nodes in a tree: the distance from {{mvar|v}} to {{mvar|w}} can be computed as the distance from the root to {{mvar|v}}, plus the distance from the root to {{mvar|w}}, minus twice the distance from the root to their lowest common ancestor {{harv|Djidjev|Pantziou|Zaroliagis|1991}}. In ontologies, the lowest common ancestor is also known as the least common ancestor.

In a tree data structure where each node points to its parent, the lowest common ancestor can be easily determined by finding the first intersection of the paths from {{mvar|v}} and {{mvar|w}} to the root. In general, the computational time required for this algorithm is {{mvar|O(h)}} where {{mvar|h}} is the height of the tree (length of longest path from a leaf to the root). However, there exist several algorithms for processing trees so that lowest common ancestors may be found more quickly. Tarjan's off-line lowest common ancestors algorithm, for example, preprocesses a tree in linear time to provide constant-time LCA queries. In general DAGs, similar algorithms exist, but with super-linear complexity.

History

The lowest common ancestor problem was defined by {{harvs|last1=Aho|first1=Alfred|author1-link=Alfred Aho|last2=Hopcroft|first2=John|author2-link=John Hopcroft|last3=Ullman|first3=Jeffrey|author3-link=Jeffrey Ullman|year=1973|txt}}, but {{harvs|first1=Dov|last1=Harel|first2=Robert|last2=Tarjan|author2-link=Robert Tarjan|txt|year=1984}} were the first to develop an optimally efficient lowest common ancestor data structure. Their algorithm processes any tree in linear time, using a heavy path decomposition, so that subsequent lowest common ancestor queries may be answered in constant time per query. However, their data structure is complex and difficult to implement. Tarjan also found a simpler but less efficient algorithm, based on the union-find data structure, for computing lowest common ancestors of an offline batch of pairs of nodes.

{{harvs|first1=Baruch|last1=Schieber|first2=Uzi|last2=Vishkin|author2-link=Uzi Vishkin|txt|year=1988}} simplified the data structure of Harel and Tarjan, leading to an implementable structure with the same asymptotic preprocessing and query time bounds. Their simplification is based on the principle that, in two special kinds of trees, lowest common ancestors are easy to determine: if the tree is a path, then the lowest common ancestor can be computed simply from the minimum of the levels of the two queried nodes, while if the tree is a complete binary tree, the nodes may be indexed in such a way that lowest common ancestors reduce to simple binary operations on the indices. The structure of Schieber and Vishkin decomposes any tree into a collection of paths, such that the connections between the paths have the structure of a binary tree, and combines both of these two simpler indexing techniques.{{harvs|first1=Omer|last1=Berkman|first2=Uzi|last2=Vishkin|txt|year=1993}} discovered a completely new way to answer lowest common ancestor queries, again achieving linear preprocessing time with constant query time. Their method involves forming an Euler tour of a graph formed from the input tree by doubling every edge, and using this tour to write a sequence of level numbers of the nodes in the order the tour visits them; a lowest common ancestor query can then be transformed into a query that seeks the minimum value occurring within some subinterval of this sequence of numbers. They then handle this range minimum query problem by combining two techniques, one technique based on precomputing the answers to large intervals that have sizes that are powers of two, and the other based on table lookup for small-interval queries. This method was later presented in a simplified form by {{harvs|first1=Michael|last1=Bender|first2=Martin|last2=Farach-Colton|author2-link=Martin Farach-Colton|txt|year=2000}}. As had been previously observed by {{harvtxt|Gabow|Bentley|Tarjan|1984}}, the range minimum problem can in turn be transformed back into a lowest common ancestor problem using the technique of Cartesian trees.

Further simplifications were made by {{harvtxt|Alstrup|Gavoille|Kaplan|Rauhe|2004}} and {{harvtxt|Fischer|Heun|2006}}.

A variant of the problem is the dynamic LCA problem in which the data structure should be prepared to handle LCA queries intermixed with operations that change the tree (that is, rearrange the tree by adding and removing edges) This variant can be solved using O(logN) time for all modifications and queries. This is done by maintaining the forest using the dynamic trees data structure with partitioning by size; this then maintains a heavy-light decomposition of each tree, and allows LCA queries to be carried out in logarithmic time in the size of the tree.

Without preprocessing you can also improve the naïve online algorithm's computation time to O(log h) by storing the paths through the tree using skew-binary random access lists, while still permitting the tree to be extended in constant time ([https://www.fpcomplete.com/user/edwardk/online-lca Edward Kmett (2012)]). This allows LCA queries to be carried out in logarithmic time in the height of the tree.

Extension to directed acyclic graphs

While originally studied in the context of trees, the notion of lowest common ancestors can be defined for directed acyclic graphs (DAGs), using either of two possible definitions. In both, the edges of the DAG are assumed to point from parents to children.

Given {{math|G {{=}} (V, E)}}, {{harvtxt|Aït-Kaci|Boyer|Lincoln|Nasr|1989}} define a poset {{math|(V, ≤)}} such that {{mvar|x ≤ y}} iff {{mvar|x}} is in the reflexive transitive closure of {{mvar|y}}. The lowest common ancestors of {{mvar|x}} and {{mvar|y}} are then the minimum elements under ≤ of the common ancestor set {{math|{z ∈ V {{!}} x ≤ z and y ≤ z}}}.
{{harvtxt|Bender|Farach-Colton|Pemmasani|Skiena|2005}} gave an equivalent definition, where the lowest common ancestors of {{mvar|x}} and {{mvar|y}} are the nodes of out-degree zero in the subgraph of {{mvar|G}} induced by the set of common ancestors of {{mvar|x}} and {{mvar|y}}.

In a tree, the lowest common ancestor is unique; in a DAG of {{mvar|n}} nodes, each pair of nodes may have as much as {{math|n-2}} LCAs {{harv|Bender|Farach-Colton|Pemmasani|Skiena|2005}}, while the existence of an LCA for a pair of nodes is not even guaranteed in arbitrary connected DAGs.

A brute-force algorithm for finding lowest common ancestors is given by {{harvtxt|Aït-Kaci|Boyer|Lincoln|Nasr|1989}}: find all ancestors of {{mvar|x}} and {{mvar|y}}, then return the maximum element of the intersection of the two sets. Better algorithms exist that, analogous to the LCA algorithms on trees, preprocess a graph to enable constant-time LCA queries. The problem of LCA existence can be solved optimally for sparse DAGs by means of an {{math|O({{!}}V{{!}}{{!}}E{{!}})}} algorithm due to {{harvtxt|Kowaluk|Lingas|2005}}.

{{harvtxt|Dash|Scholz|Herhut|Christianson|2013}} present a unified framework for pre-processing Directed Acyclic Graphs (DAGs) to compute the least upper bound of two vertices in constant time. Their framework uses adaptive pre-processing where the asymptotic costs depend on the number of non-tree (cross) edges in the DAG. Consequently, it is possible to achieve near-linear pre-processing times for sparse DAGs with constant time LCA look-ups. A library of their work is available for public use at.^[1]{{harvtxt|Couto|Francisco|Silva|Mário|2011}} present DiShIn a method that extends the computation of the LCA to all disjunctive ancestors in semantic similarity measures. He defines an ancestor as disjunctive if the difference between the number of distinct paths from the two nodes to that ancestor is different from using any lower common ancestor.

Applications

The problem of computing lowest common ancestors of classes in an inheritance hierarchy arises in the implementation of object-oriented programming systems {{harv|Aït-Kaci|Boyer|Lincoln|Nasr|1989}}. The LCA problem also finds applications in models of complex systems found in distributed computing {{harv|Bender|Farach-Colton|Pemmasani|Skiena|2005}}.

References

1. ^{{Cite web | url=http://www.herts.ac.uk/dag-pre-processing-downloadable-code/source-code | title=Try our source code for free}}

{{citation|last1=Aho|first1=Alfred|author1-link=Alfred Aho|last2=Hopcroft|first2=John|author2-link=John Hopcroft|last3=Ullman|first3=Jeffrey|author3-link=Jeffrey Ullman|year=1973|contribution=On finding lowest common ancestors in trees|title=Proc. 5th ACM Symp. Theory of Computing (STOC)|pages=253–265|doi=10.1145/800125.804056|title-link=Symposium on Theory of Computing}}.
{{citation |first=H. |last=Aït-Kaci |first2=R. |last2=Boyer |first3=P. |last3=Lincoln |first4=R. |last4=Nasr |title=Efficient implementation of lattice operations |journal=ACM Transactions on Programming Languages and Systems |volume=11 |issue=1 |pages=115–146 |year=1989 |url=http://hassan-ait-kaci.net/pdf/encoding-toplas-89.pdf |doi=10.1145/59287.59293|citeseerx=10.1.1.106.4911 }}.
{{citation

{{citation

{{citation |last1=Bender |first1=Michael A. |first2=Martín |last2=Farach-Colton|authorlink2 = Martin Farach-Colton |first3=Giridhar |last3=Pemmasani |first4=Steven |last4=Skiena |authorlink4=Steven Skiena |first5=Pavel |last5=Sumazin |title=Lowest common ancestors in trees and directed acyclic graphs |journal=Journal of Algorithms |volume=57 |issue=2 |year=2005 |pages=75–94 |url=http://www.cs.sunysb.edu/~bender/pub/JALG05-daglca.pdf |doi=10.1016/j.jalgor.2005.08.001}}.
{{citation

| last1 = Berkman | first1 = Omer
| last2 = Vishkin | first2 = Uzi | author2-link = Uzi Vishkin
| title = Recursive Star-Tree Parallel Data Structure
| journal = SIAM Journal on Computing
| volume = 22
| issue = 2
| pages = 221–242
| year = 1993
| doi = 10.1137/0222017| url = http://www.dtic.mil/get-tr-doc/pdf?AD=ADA227803}}.

{{citation

{{citation

{{citation

{{citation

| last1 = Harel | first1 = Dov
| last2 = Tarjan | first2 = Robert E. | author2-link = Robert Tarjan
| title = Fast algorithms for finding nearest common ancestors
| journal = SIAM Journal on Computing
| volume = 13
| issue = 2
| pages = 338–355
| year = 1984
| doi = 10.1137/0213024}}.

{{citation

{{citation

{{citation

| last1 = Schieber | first1 = Baruch
| last2 = Vishkin | first2 = Uzi | author2-link = Uzi Vishkin
| title = On finding lowest common ancestors: simplification and parallelization
| journal = SIAM Journal on Computing
| volume = 17
| pages = 1253–1262
| year = 1988
| doi = 10.1137/0217079
| issue = 6}}.

{{citation

External links

Lowest Common Ancestor of a Binary Search Tree, by Kamal Rawat
Python implementation of the algorithm of Bender and Farach-Colton for trees, by David Eppstein
[https://github.com/networkx/networkx/blob/master/networkx/algorithms/lowest_common_ancestors.py Python implementation for arbitrary directed acyclic graphs]
Lecture notes on LCAs from a 2003 MIT Data Structures course. Course by Erik Demaine, notes written by Loizos Michael and Christos Kapoutsis. Notes from 2007 offering of same course, written by Alison Cichowlas.
Lowest Common Ancestor in Binary Trees in C. A simplified version of the Schieber–Vishkin technique that works only for balanced binary trees.
[https://web.archive.org/web/20110720050818/http://stanford-online.stanford.edu/knuth/071203-knuth-300.asx Video] of Donald Knuth explaining the Schieber–Vishkin technique
Range Minimum Query and Lowest Common Ancestor article in Topcoder
Documentation for the lca package for Haskell by Edward Kmett, which includes the skew-binary random access list algorithm. Purely functional data structures for on-line LCA slides for the same package.

2 : Theoretical computer science|Trees (graph theory)

随便看

开放百科全书收录14589846条英语、德语、日语等多语种百科知识，基本涵盖了大多数领域的百科知识，是一部内容自由、开放的电子版国际百科全书。

History

Extension to directed acyclic graphs

Applications

See also

References

External links