请输入您要查询的百科知识:

 

词条 Single system image
释义

  1. Features of SSI clustering systems

     Process migration  Process checkpointing  Single process space  Single root  Single I/O space  Single IPC space  Cluster IP address 

  2. Examples

  3. See also

  4. Notes

  5. References

In distributed computing, a single system image (SSI) cluster is a cluster of machines that appears to be one single system.[1][2][3] The concept is often considered synonymous with that of a distributed operating system,[4][5] but a single image may be presented for more limited purposes, just job scheduling for instance, which may be achieved by means of an additional layer of software over conventional operating system images running on each node.[6] The interest in SSI clusters is based on the perception that they may be simpler to use and administer than more specialized clusters.

Different SSI systems may provide a more or less complete illusion of a single system.

Features of SSI clustering systems

Different SSI systems may, depending on their intended usage, provide some subset of these features.

Process migration

{{main|Process migration}}

Many SSI systems provide process migration.[7]

Processes may start on one node and be moved to another node, possibly for resource balancing or administrative reasons.[8] As processes are moved from one node to another, other associated resources (for example IPC resources) may be moved with them.

Process checkpointing

Some SSI systems allow checkpointing of running processes, allowing their current state to be saved and reloaded at a later date.[9]

Checkpointing can be seen as related to migration, as migrating a process from one node to another can be implemented by first checkpointing the process, then restarting it on another node. Alternatively checkpointing can be considered as migration to disk.

Single process space

Some SSI systems provide the illusion that all processes are running on the same machine - the process management tools (e.g. "ps", "kill" on Unix like systems) operate on all processes in the cluster.

Single root

Most SSI systems provide a single view of the file system. This may be achieved by a simple NFS server, shared disk devices or even file replication.

The advantage of a single root view is that processes may be run on any available node and access needed files with no special precautions. If the cluster implements process migration a single root view enables direct accesses to the files from the node where the process is currently running.

Some SSI systems provide a way of "breaking the illusion", having some node-specific files even in a single root. HP TruCluster provides a "context dependent symbolic link" (CDSL) which points to different files depending on the node that accesses it. HP VMScluster provides a search list logical name with node specific files occluding cluster shared files where necessary. This capability may be necessary to deal with heterogeneous clusters, where not all nodes have the same configuration. In more complex configurations such as multiple nodes of multiple architectures over multiple sites, several local disks may combine to form the logical single root.

Single I/O space

Some SSI systems allow all nodes to access the I/O devices (e.g. tapes, disks, serial lines and so on) of other nodes. There may be some restrictions on the kinds of accesses allowed (For example, OpenSSI can't mount disk devices from one node on another node).

Single IPC space

Some SSI systems allow processes on different nodes to communicate using inter-process communications mechanisms as if they were running on the same machine. On some SSI systems this can even include shared memory (can be emulated with Software Distributed shared memory).

In most cases inter-node IPC will be slower than IPC on the same machine, possibly drastically slower for shared memory. Some SSI clusters include special hardware to reduce this slowdown.

Cluster IP address

Some SSI systems provide a "cluster address", a single address visible from outside the cluster that can be used to contact the cluster as if it were one machine. This can be used for load balancing inbound calls to the cluster, directing them to lightly loaded nodes, or for redundancy, moving the cluster address from one machine to another as nodes join or leave the cluster.[10]

Examples

Examples here vary from commercial platforms with scaling capabilities, to packages/frameworks for creating distributed systems, as well as those that actually implement a single system image.

 !Name !Process migration !Process checkpoint !Single process space !Single root !Single I/O space !Single IPC space !Cluster IP address[11]  !Source Model !Latest release date[12] !Supported OS 
SSI Properties of different clustering systems
Amoeba[13] {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Unk}} {{Yes}} {{Unk}} Open}} {{dts|1996|07|30}}}} Native
AIX TCF {{Unk}} {{Unk}} {{Unk}} {{Yes}} {{Unk}} {{Unk}} {{Unk}} Closed}} {{dts|1990|03|30}}}}[14] AIX PS/2 1.2
HP Guardian {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} Closed}} {{dts|2018}}} Native, OSS
Inferno {{No}} {{No}} {{No}} {{Yes}} {{Yes}} {{Yes}} {{Unk}} Open}} {{dts|2015|03|04}}}} Native, Windows, Irix, Linux, OS X, FreeBSD, Solaris, Plan 9
Kerrighed {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Unk}} {{Yes}} {{Unk}} Open}} {{dts|2010|06|14}}}} Linux 2.6.30
LinuxPMI[15] {{Yes}} {{Yes}} {{No}} {{Yes}} {{No}} {{No}} {{Unk}} Open}} {{dts|2006|06|18}}}} Linux 2.6.17
LOCUS[16] {{Yes}} {{Unk}} {{Yes}} {{Yes}} {{Yes}} Yes[17]}} {{Unk}} Closed}} {{dts|1988}}}} Native
MOSIX {{Yes}} {{Yes}} {{No}} {{Yes}} {{No}} {{No}} {{Unk}} Closed}} {{dts|2016|05|16}}}} Linux
openMosix[18] {{Yes}} {{Yes}} {{No}} {{Yes}} {{No}} {{No}} {{Unk}} Open}} {{dts|2004|12|10}}}} Linux 2.4.26
Open-Sharedroot[19] {{No}} {{No}} {{No}} {{Yes}} {{No}} {{No}} {{Yes}} Open}} {{dts|2011|09|01}}}}[20] Linux
OpenSSI {{Yes}} {{No}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} Open}} {{dts|2010|02|18}}}} Linux 2.6.10 (Debian, Fedora)
Plan 9 {{No}}[21] {{No}} {{No}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} Open}} {{dts|2015|01|09}}}} Native
Sprite {{Yes}} {{Unk}} {{No}} {{Yes}} {{Yes}} {{No}} {{Unk}} Open}} {{dts|1992}}}} Native
TidalScale {{Yes}} {{No}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} Closed}} {{dts|2018|04|12}}}} As a Guest: Linux, FreeBSD
TruCluster {{No}} {{Unk}} {{No}} {{Yes}} {{No}} {{No}} {{Yes}} Closed}} {{dts|2010|10|1}}}} Tru64
VMScluster {{No}} {{No}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} Closed}} {{dts|2017|7|10}}}} OpenVMS
z/VM {{Yes}} {{No}} {{Yes}} {{No}} {{No}} {{Yes}} {{Unk}} Closed}} {{dts|2016|11|11}}}} Native
UnixWare NonStop Clusters[22] {{Yes}} {{No}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} {{Yes}} Closed}} {{dts|2000|6|}}}} UnixWare
1. ^{{Citation | last = Pfister | first = Gregory F. | year = 1998 | title = In search of clusters | isbn = 978-0-13-899709-0 | publisher = Prentice Hall PTR | location = Upper Saddle River, NJ | oclc = 38300954}}
2. ^{{citation |author1=Buyya, Rajkumar |author2=Cortes, Toni |author3=Jin, Hai | year = 2001 | title = Single System Image | journal = International Journal of High Performance Computing Applications | volume = 15 | issue = 2 | pages = 124 | doi = 10.1177/109434200101500205 | url = http://www.buyya.com/papers/SSI-CCWhitePaper.pdf}}
3. ^{{citation |author1=Healy, Philip |author2=Lynn, Theo |author3=Barrett, Enda |author4=Morrison, John P. | year = 2016 | title = Single system image: A survey | journal = Journal of Parallel and Distributed Computing | volume = 90-91 | pages = 35–51 | doi = 10.1016/j.jpdc.2016.01.004 | url = https://www.researchgate.net/publication/295253720}}
4. ^{{Citation | title = Distributed systems: concepts and design | year = 2005 |author1=Coulouris, George F |author2=Dollimore, Jean |author3=Kindberg, Tim | isbn = 978-0-321-26354-4 | publisher = Addison Wesley | page = 223 | url = https://books.google.com/books?id=d63sQPvBezgC&pg=PA223&dq=%22distributed+operating+system%22&cd=3#v=onepage&q=%22distributed%20operating%20system%22&f=false}}
5. ^{{Citation |author1=Bolosky, William J. |author2=Draves, Richard P. |author3=Fitzgerald, Robert P. |author4=Fraser, Christopher W. |author5=Jones, Michael B. |author6=Knoblock, Todd B. |author7=Rashid, Rick | contribution = Operating System Directions for the Next Millennium | title = 6th Workshop on Hot Topics in Operating Systems (HotOS-VI) | place = Cape Cod, MA | pages = 106–110 | date = 1997-05-05| doi = 10.1109/HOTOS.1997.595191 | citeseerx = 10.1.1.50.9538 |isbn=978-0-8186-7834-9 }}
6. ^{{Citation | title=Grid And Cluster Computing | author= Prabhu, C.S.R. | isbn=978-81-203-3428-1 | publisher=Phi Learning | year=2009 | pages=256 | url=https://books.google.com/books?id=EIVdVtGHv-0C&pg=PA177&dq=%22distributed+operating+system%22+%22single+system+image%22&cd=14#v=onepage&q=%22distributed%20operating%20system%22%20%22single%20system%20image%22&f=false}}
7. ^{{citation | last = Smith | first = Jonathan M. | year = 1988 | title = A survey of process migration mechanisms | journal = ACM SIGOPS Operating Systems Review | volume = 22 | issue = 3 | pages = 28–40 | doi = 10.1145/47671.47673 | url = http://www.cis.upenn.edu/~jms/svy-pm.pdf| citeseerx = 10.1.1.127.8095 }}
8. ^for example it may be necessary to move long running processes off a node that is to be closed down for maintenance
9. ^Checkpointing is particularly useful in clusters used for high-performance computing, avoiding lost work in case of a cluster or node restart.
10. ^"leaving a cluster" is often a euphemism for crashing
11. ^Many of the Linux based SSI clusters can use the Linux Virtual Server to implement a single cluster IP address
12. ^Green means software is actively developed
13. ^Amoeba development is carried forward by Dr. Stefan Bosse at BSS Lab {{webarchive|url=https://web.archive.org/web/20090203124419/http://bsslab.de/english/index.html |date=2009-02-03 }}
14. ^{{cite web|url=http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?infotype=OC&subtype=NA&htmlfid=897/ENUS5706-297&appname=totalstorage |title=AIX PS/2 OS}}
15. ^LinuxPMI is a successor to openMosix
16. ^LOCUS was used to create IBM AIX TCF
17. ^LOCUS used named pipes for IPC
18. ^openMosix was a fork of MOSIX
19. ^Open-Sharedroot is a shared root Cluster from ATIX
20. ^{{cite web|url=https://github.com/Open-Sharedroot?tab=repositories |title=Open-Sharedroot GitHub repository}}
21. ^{{Citation | last1 = Pike | first1 = Rob | last2 = Presotto | first2 = Dave | last3 = Thompson | first3 = Ken | last4 = Trickey | first4 = Howard | contribution = Plan 9 from Bell Labs | series = In Proceedings of the Summer 1990 UKUUG Conference | pages = 8 | year = 1990 | quote = Process migration is also deliberately absent from Plan 9. }}
22. ^UnixWare NonStop Clusters was a base for OpenSSI

See also

  • Computer clusters
  • Diskless Shared Root Cluster
  • Distributed lock manager
  • Distributed cache
  • Parallel Virtual Machine - multiple system image alternative
  • Message Passing Interface - multiple system image alternative

Notes

References

{{DEFAULTSORT:Single System Image}}

1 : Cluster computing

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/13 21:39:47