Difference between revisions of "UMBEL - Annex G"

From UMBEL Wiki
Jump to navigation Jump to search
Line 147: Line 147:
 
| valign="top" |Animals, AreaRegion, Chemistry, Facilities
 
| valign="top" |Animals, AreaRegion, Chemistry, Facilities
 
|-
 
|-
| valign="top" |'''Time-related'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Time-related'''
| valign="top" |'''Activities'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Activities'''
| valign="top" |These are ongoing activities that result (mostly) from human effort, often conducted by organizations to assist other organizations or individuals (in which case they are known as services, such as medicine, law, printing, consulting or teaching) or individual or group efforts for leisure, fun, sports, games or personal interests (activities).<br /><br />Generic, broad grouping of actions that apply to generic objects are also included in this SuperType.
+
| valign="top" style="border-top:1px solid #BBB;" |These are ongoing activities that result (mostly) from human effort, often conducted by organizations to assist other organizations or individuals (in which case they are known as services, such as medicine, law, printing, consulting or teaching) or individual or group efforts for leisure, fun, sports, games or personal interests (activities).<br /><br />Generic, broad grouping of actions that apply to generic objects are also included in this SuperType.
| valign="top" |AtomsElements, AudioInfo, BiologicalProcesses, Chemistry, Diseases, Events, Products, StructuredInfo
+
| valign="top" style="border-top:1px solid #BBB;" |AtomsElements, AudioInfo, BiologicalProcesses, Chemistry, Diseases, Events, Products, StructuredInfo
 
|-
 
|-
 
|
 
|
Line 162: Line 162:
 
| valign="top" |Activities, Events
 
| valign="top" |Activities, Events
 
|-
 
|-
| bgcolor="silver" valign="top" | Natural Matter
+
| bgcolor="silver" valign="top" style="border-top:1px solid #BBB;" | '''Natural Matter'''
| valign="top" |'''Atoms and Elements'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Atoms and Elements'''
| valign="top" |The Atoms and Elements SuperType contains all known chemical elements and the constituents of atoms.
+
| valign="top" style="border-top:1px solid #BBB;" |The Atoms and Elements SuperType contains all known chemical elements and the constituents of atoms.
| valign="top" |Activities, Chemistry, NaturalSubstances, Products
+
| valign="top" style="border-top:1px solid #BBB;" |Activities, Chemistry, NaturalSubstances, Products
 
|-
 
|-
 
| bgcolor="silver" valign="top" |
 
| bgcolor="silver" valign="top" |
Line 177: Line 177:
 
| valign="top" | Activities, Animals, AtomsElements, Drugs, FinanceEconomy, FoodDrink, Forms, NaturalSubstances, OrganicChemistry, Products
 
| valign="top" | Activities, Animals, AtomsElements, Drugs, FinanceEconomy, FoodDrink, Forms, NaturalSubstances, OrganicChemistry, Products
 
|-
 
|-
| valign="top" |'''Organic Matter'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Organic Matter'''
| valign="top" |'''Organic Chemistry'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Organic Chemistry'''
| valign="top" |The Organic Chemistry SuperType is for all chemistry involving carbon, including the biochemistry of living organisms and the materials chemistry (including polymers) of organic compounds such as fossil fuels.
+
| valign="top" style="border-top:1px solid #BBB;" |The Organic Chemistry SuperType is for all chemistry involving carbon, including the biochemistry of living organisms and the materials chemistry (including polymers) of organic compounds such as fossil fuels.
|  valign="top" |Activities, Chemistry, Drugs, FoodDrink, Products, Prokaryotes
+
|  valign="top" style="border-top:1px solid #BBB;" |Activities, Chemistry, Drugs, FoodDrink, Products, Prokaryotes
 
|-
 
|-
 
| <br />
 
| <br />
Line 187: Line 187:
 
| valign="top" |Activities
 
| valign="top" |Activities
 
|-
 
|-
| bgcolor="silver" valign="top" | '''Living Things'''
+
| bgcolor="silver" valign="top" style="border-top:1px solid #BBB;" | '''Living Things'''
| valign="top" |'''Prokaryotes'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Prokaryotes'''
| valign="top" |The Prokaryotes include all prokaryotic organisms, including the Monera, Archaebacteria, Bacteria, and Blue-green algas. Also included in this SuperType are viruses and prions.
+
| valign="top" style="border-top:1px solid #BBB;" |The Prokaryotes include all prokaryotic organisms, including the Monera, Archaebacteria, Bacteria, and Blue-green algas. Also included in this SuperType are viruses and prions.
| valign="top" |OrganicChemistry
+
| valign="top" style="border-top:1px solid #BBB;" |OrganicChemistry
 
|-
 
|-
 
| bgcolor="silver" valign="top" |
 
| bgcolor="silver" valign="top" |
Line 212: Line 212:
 
| valign="top" | Activities, Animals, Events
 
| valign="top" | Activities, Animals, Events
 
|-
 
|-
| valign="top" | '''Agents'''
+
| valign="top" style="border-top:1px solid #BBB;" | '''Agents'''
| valign="top" |'''Persons'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Persons'''
| valign="top" |The appropriate SuperType for all named, individual human beings. This SuperType also includes the assignment of formal, honorific or cultural titles given to specific human individuals. It further includes names given to humans who conduct specific jobs or activities (the latter case is known as an avocation). Examples include steelworker, waitress, lawyer, plumber, artisan. Ethnic groups are specifically included.<br /><br />Persons as living animals are included under the Animals SuperType.
+
| valign="top" style="border-top:1px solid #BBB;" |The appropriate SuperType for all named, individual human beings. This SuperType also includes the assignment of formal, honorific or cultural titles given to specific human individuals. It further includes names given to humans who conduct specific jobs or activities (the latter case is known as an avocation). Examples include steelworker, waitress, lawyer, plumber, artisan. Ethnic groups are specifically included.<br /><br />Persons as living animals are included under the Animals SuperType.
| valign="top" |'''Animals'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Animals'''
 
|-
 
|-
 
|
 
|
Line 227: Line 227:
 
| valign="top" |'''AreaRegion'''
 
| valign="top" |'''AreaRegion'''
 
|-
 
|-
| bgcolor="silver" valign="top" | Artifacts
+
| bgcolor="silver" valign="top" style="border-top:1px solid #BBB;" | '''Artifacts'''
| valign="top" |'''Products'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Products'''
| valign="top" | This is SuperType includes any instance offered for sale or performed as a commercial service. A Product is often a physical object made by humans that is not a conceptual work or a facility, such as vehicles, cars, trains, aircraft, spaceships, ships, foods, beverages, clothes, drugs, weapons.
+
| valign="top" style="border-top:1px solid #BBB;" | This is SuperType includes any instance offered for sale or performed as a commercial service. A Product is often a physical object made by humans that is not a conceptual work or a facility, such as vehicles, cars, trains, aircraft, spaceships, ships, foods, beverages, clothes, drugs, weapons.
| valign="top" |Activities, Animals, AreaRegion, AtomsElements, AudioInfo, Chemistry, Drugs, Facilities, FinanceEconomy, FoodDrink, LocationPlace, NaturalSubstances, OrganicChemistry, Plants, StructuredInfo, VisualInfo, WrittenInfo'''<br />'''
+
| valign="top" style="border-top:1px solid #BBB;" |Activities, Animals, AreaRegion, AtomsElements, AudioInfo, Chemistry, Drugs, Facilities, FinanceEconomy, FoodDrink, LocationPlace, NaturalSubstances, OrganicChemistry, Plants, StructuredInfo, VisualInfo, WrittenInfo'''<br />'''
 
|-
 
|-
 
| bgcolor="silver" valign="top" |
 
| bgcolor="silver" valign="top" |
Line 257: Line 257:
 
| valign="top" |AreaRegion, FinanceEconomy, Forms, LocationPlace, NaturalSubstances, Organizations, Products, VisualInfo
 
| valign="top" |AreaRegion, FinanceEconomy, Forms, LocationPlace, NaturalSubstances, Organizations, Products, VisualInfo
 
|-
 
|-
| valign="top" |'''Information'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Information'''
| valign="top" |'''Audio Info'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Audio Info'''
| valign="top" |This SuperType is for any audio-only human work. Examples include live music performances, record albums, or radio shows or individual radio broadcasts
+
| valign="top" style="border-top:1px solid #BBB;" |This SuperType is for any audio-only human work. Examples include live music performances, record albums, or radio shows or individual radio broadcasts
| valign="top" |Activities, Products
+
| valign="top" style="border-top:1px solid #BBB;" |Activities, Products
 
|-
 
|-
 
|
 
|
Line 277: Line 277:
 
| valign="top" |Activities, Events, FinanceEconomy, Products, VisualInfo, WrittenInfo'''<br />'''
 
| valign="top" |Activities, Events, FinanceEconomy, Products, VisualInfo, WrittenInfo'''<br />'''
 
|-
 
|-
| bgcolor="silver" valign="top" | Social
+
| bgcolor="silver" valign="top" style="border-top:1px solid #BBB;" | Social
| valign="top" |'''Finance & Economy'''
+
| valign="top" style="border-top:1px solid #BBB;" |'''Finance & Economy'''
| valign="top" |This SuperType pertains to all things financial and with respect to the economy, including chartable company performance, stock index entities, money, local currencies, taxes, incomes, accounts and accounting, mortgages and property.
+
| valign="top" style="border-top:1px solid #BBB;" |This SuperType pertains to all things financial and with respect to the economy, including chartable company performance, stock index entities, money, local currencies, taxes, incomes, accounts and accounting, mortgages and property.
| valign="top" |Activities, AreaRegion, Chemistry, Facilities, Organizations, Products, StructuredInfo, VisualInfo, '''WrittenInfo'''
+
| valign="top" style="border-top:1px solid #BBB;" |Activities, AreaRegion, Chemistry, Facilities, Organizations, Products, StructuredInfo, VisualInfo, '''WrittenInfo'''
 
|-
 
|-
 
| bgcolor="silver" valign="top" |
 
| bgcolor="silver" valign="top" |

Revision as of 03:45, 16 April 2016

__NOEDITSECTION__

UMBEL Annex G: UMBEL SuperTypes Documentation

UMBEL Annex Document - 10 May 2016

Latest version
http://techwiki.umbel.org/index.php/UMBEL_-_Annex_G
UMBEL Logo
Last update
$Date: 2016/5/10 9:22:47 $
Version
Version No.: 1.50
Volume
TR 16-5-10-G
Authors
Michael Bergman - Structured Dynamics
Frédérick Giasson - Structured Dynamics

Structured Dynamics Logo|link=

UMBEL: Upper Mapping and Binding Exchange Layer by Structured Dynamics LLC is provided under the
Creative Commons Attribution 3.0 license. See the attribution section for how to cite the effort.

[1]

Copyright © 2009-2016 by Structured Dynamics LLC.

Beginning with UMBEL version 1.20, statistics regarding numbers of reference concepts (RCs) in the ontology and splits between SuperTypes (STs) and modules have been moved to the statistics Annex Z document. As a result, earlier statistics in this and other annexes are no longer being updated, which means any statistics cited below may be out of date. Please consult Annex Z for the current UMBEL statistics.

UPDATES

Any changes to SuperTypes made since their initial introduction in version 0.80, with a subsequent update for version 1.00, are summarized in this section, and may update the general narrative throughout the remainder of this Annex.
  • In version 1.20, these SuperType (ST) changes were made:
    • The Attributes ST was expanded in its role, with the creation of an Attributes Ontology
    • A new Entities ST was added; it is non-disjoint with all STs and is largely orthogonal to the standard ST usage. The Entities ST designation, however, is useful in conjunction with other STs for filtering purposes
    • The Workplace ST was decremented and moved into the Facilities ST
    • The Markets & Industries ST was decremented and moved into the Attributes ST
    • All existing ST assignments were reviewed with mostly minor changes. One major change was a better split between the Activities and Events STs.

INTRODUCTION

This report describes the rationale for the class of SuperTypes within UMBEL and how its 35 K reference concepts (RCs) are assigned to one of a few categories of SuperTypes. This report is an update of the SuperTypes design, first introduced and updated in version 1.50. We discuss five categories of SuperTypes below, with one category, the main disjoint category, being the most important.

The first category of SuperTypes is for non-disjoint types, mostly of a shared or buiding block nature. By design, these SuperTypes participate in little or no reasoning. Most have shared aspects across all SuperTypes. SuperTypes in this category are designed to be fully non-disjoint, and do not participate in any disjoint assertions. There are seven (7) SuperTypes in this first category, specifically Abstractions, Concepts, Conventions, Primitives, Structures, Symbols and TopicsCategories. Little further is discussed about this category below.

A second category is for the Attributes SuperType. Attributes may be assigned to any of the reference concepts (RCs) associated with any of the other SuperTypes. Attributes are thus inherently non-disjoint. Little further is discussed about this category below.

A third category is for SuperTypes that are parental types for other SuperTypes. These are largely organizational in nature for helping to keep the upper portions of UMBEL manageable. Since their children are specific SuperTypes, this parental category may be used for some minor reasoning, but is not the central focus of the overall SuperTypes design. There are nineteen (19) SuperTypes in this category, and specifically include Agents, Artifacts, AVInfo, Constituents, Eukaryotes, Information, LivingThings, Manifestations, MentalProcesses, NaturalMatter, OrganicMatter, Places, Relations, SignElements, SocialProcesses, Space, Symbolic, Systems and Time. Though used for organizational purposes below, none of these are discussed further below individually.

A fourth, somewhat special SuperType is Shapes. About half of the RCs in UMBEL have a Shapes aspect; about half do not. Thus, Shapes can be used for some disjoint analysis, but is shared widely enough to not be that useful in most circumstances. Shapes is thus kept separate from the main SuperTypes category.

The fifth and last SuperTypes category is for those that are largely disjoint with one another. This main SuperTypes category contains 31 SuperTypes, specifically including:

Activities
Animals
AreaRegion
AtomsElements
AudioInfo
BiologicalProcesses
Chemistry
Diseases
Drugs
Events
Facilities
FinanceEconomy
FoodDrink
Forms
Geopolitical
LocationPlace
NaturalPhenomena
NaturalSubstances
OrganicChemistry
Organizations
Persons
Plants
Products
Prokaryotes
ProtistsFungus
Situations
Society
StructuredInfo
Times
VisualInfo
WrittenInfo

In addition, all of these SuperTypes are clustered into 9 "dimensions", which are useful for aggregation and organizational purposes, but which have no direct bearing on logic assertions or disjoint testing.

SUPPORTING FILES

Two files accompany this report and provide the actual assignments and details. They are, with brief explanations as to content and interpretation:

SuperTypes_20110119.xls

This spreadsheet contains all of the figures and summary statistics shown in the figures and tables below. It is a summation of the specifics in the next file.

As for the spreadsheet itself:

  1. Read all notes under the latter column in the Overview tab
  2. The Matrix tabs show where some areas are not disjoint with other areas. The defined non-disjoint categories -- Attributes, Abstract-level, Topics/Categories and Markets & Industries -- have much interaction with the other categories and no statistics for these are shown. In other categories the interactions are sometimes minimal and sometimes not. These overlaps are generally explained in the SuperType Intersection and Potential Overlaps columns on the Overview tab
  3. Four of the 33 SuperTypes are by definition non-disjoint. These are Abstract_level, Attributes, MarketsIndustries and TopicsCategories
  4. The remaining 29 SuperTypes are mostly disjoint.

supertypes_v100.zip

This zip file contains 876 individual files. These individual files have listings of RefConcepts by SuperType, and files for each of the intersections where RefConcepts are shared between two SuperTypes. The latter intersections define where disjoint assertions between SuperTypes are excluded. Most of the individual files list all of the RefConcepts in the applicable population as designated by the file name.

Two files in this archive deserve special mention:

  • superTypesStats.csv - this is the summary count RefConcepts by SuperType, including those shared with other SuperTypes
  • superTypesStatsIntersections.csv - this is the summary of RefConcept intersections between SuperTypes, as indicated by the headings. These summary counts recount the full RefConcept listings in the other constituent files in the zip package.

BASIS AND RATIONALE FOR THE SUPERTYPE CLASS

The assignment of UMBEL reference concepts to SuperTypes was an outgrowth of the observation that many of the concepts within UMBEL may be clustered into disjoint groupings. Most things and concepts about them are based on real, observable, physical things in the real world. Because most of these things can not occupy both the same moment in time and the same location in physical space, a useful criterion for looking at these things and concepts is disjointedness.

In a broad sense, then, we can split our concepts of the world between those ideas that are disjoint because they pertain to separable objects or ideas and those that are cross-cutting or organizational or classificatory. Attributes, such as color (pink, for example), are often cross-cutting in that they can be used to describe quite disparate things. Inherent classification schemes such as academic fields of study or library catalog systems — while useful ways to organize the world — are not themselves in-and-of the world or discrete from other ideas. Thus, classificatory or organizational concepts are inherently not disjoint.

The potential advantage of clustering into logical, disjoint groups can include:

  • A better basis for organizing a large concept space
  • Possible amenability to the use of templates for displaying similar attributes and information for similar concepts
  • Possible computational efficiency due to being able to segregate concepts into logically coherent groupings
  • Improved disambiguation by assessing concept matches in addition to entity matches via triangulation between the two assessments
  • Structure and integrity testing.

Any classificatory scheme has a degree of arbitrariness. To be useful, it must be perceived as logical and coherent and it should achieve most if not all of the potential advantages above.

Both "bottom up" (coherent clustering of related concepts) and "top down" (selecting top-level concepts and evaluating and clustering all child concepts using union, intersection or complement operators) were used to create the assignments herein. Each approach was iterated multiple times, with logic and coherence testing after every run. For example, analysis of shared parent concepts in the lineage and other structure-wide tests were employed.

Classification schemes always are subject to the tension between "lumping" and "splitting": are three groupings too few, 100 too many? This tension is also compounded by the possible sense of arbitrary boundaries, such as why "Drugs" and not "Toys"?

Classical taxonomists and other classifiers have always strove for "natural" classification systems. Based on the best information available, is the assignment of one item to Group A more defensible than it is to Group B? New knowledge or perceptions, such as the immense impact of genetics on classical systematics, can thoroughly change perceptions of what is logical and natural.

In the case of these UMBEL reference concepts, the tests employed were to find the highest degree of disjointedness while also maintaining a sense of logical coherence with the observable world. And, where non-disjointness was found, could that degree of overlap be seen as both natural and limited? For example, the SuperType of PersonTypes is non-disjoint with Animals because persons are humans; otherwise the groups are disjoint. Similarly, PersonTypes are non-disjoint with Organizations because some types of agents, such as MusicPerformingAgent, may be either an individual or a group.

These overlaps can be understood and can also be sought to be as minimal as possible.

DESCRIPTION OF THE SUPERTYPES

Here is a description of the SuperTypes, their clustering into "dimensions" and the intersections with other SuperTypes. Note that SuperType intersections with strong overlap (more than 10 assigned reference concepts involved) are noted in Bold, with very strong overlap (more than 100 assigned reference concepts involved) noted in Bold Underline:


Dimension SuperType Description SuperType Intersections
Constituents Natural Phenomena This SuperType includes natural phenomena and natural processes such as weather, weathering, erosion, fires, lightning, earthquakes, tectonics, etc. Clouds and weather processes are specifically included. Also includes climate cycles, general natural events (such as hurricanes) that are not specifically named, and biochemical processes and pathways. Activities
Area or Region The AreaRegion SuperType includes all nameable or definable areas or regions that may be found within "space". Though the distinction is not sharp, this SuperType is meant to be distinct from specific points of interest (POIs) that may be mapped (often displayed as a thumbtack). Areas or regions are best displayed on a map as a polygon (area) or path (polyline). Facilities, FinanceEconomy, Forms, Geopolitical, LocationPlace, NaturalSubstances, Organizations, Products
Location or Place The LocationPlace SuperType is for bounded and defined points in "space", which can be positiioned via some form of coordinate system and can often be shown as points of interest (POIs) on a map. This SuperType is distinguished by areas or locations, which are often best displayed as polygons or polylines on a map. AreaRegion, Facilities, Products, Situations
Shapes The Shapes SuperType captures all 1D, 2D and 3D shapes, regular or irregular. Most shapes are geometrically describable things.

Shapes has only a minor disjointedness role, with more than half of UMBEL reference concepts having some aspect of a Shapes specification.
Not cross-referenced; see text.
Forms This SuperType category includes all aspects of the shapes that objects take in space; Forms is thus closely related to Shapes. The Forms SuperType is also the collection of natural cartographic features that occur on the surface of the Earth or other planetary bodies, as well as the form shapes that naturally occurring matter may assume. Positive examples include Mountain, Ocean, and Mesa. Artificial features such as canals are excluded. Most instances of these natural features have a fixed location in space. Animals, AreaRegion, Chemistry, Facilities
Time-related Activities These are ongoing activities that result (mostly) from human effort, often conducted by organizations to assist other organizations or individuals (in which case they are known as services, such as medicine, law, printing, consulting or teaching) or individual or group efforts for leisure, fun, sports, games or personal interests (activities).

Generic, broad grouping of actions that apply to generic objects are also included in this SuperType.
AtomsElements, AudioInfo, BiologicalProcesses, Chemistry, Diseases, Events, Products, StructuredInfo
Events These are nameable occasions, games, sports events, conferences, natural phenomena, natural disasters, wars, incidents, anniversaries, holidays, or notable moments or periods in time Events have a finite duration, with a beginning and end. Individual events (such as wars, disasters, newsworthy occasions) may also have their own names. Activities, Diseases, Situations, StructuredInfo, Times
Times This SuperType is for specific time or date or period (such as eras, or days, weeks, months type intervals) references in various formats. Activities, Events
Natural Matter Atoms and Elements The Atoms and Elements SuperType contains all known chemical elements and the constituents of atoms. Activities, Chemistry, NaturalSubstances, Products
Natural Substances The Natural Substances SuperType are minerals, compounds, chemicals, or physical objects that are not the outcome of purposeful human effort, but are found naturally occurring. Other natural objects (such as rock, fossil, etc.) are also found under this SuperType. Chemicals can be Natural Substances, but only if they are naturally occurring, such as limestone or salt. AreaRegion, AtomsElements, Chemistry, Facilities, FoodDrink, Products
Chemistry This SuperType is a residual category for chemical bonds, chemical composition groupings, and the like. It is formed by what is not a natural substance or living thing (organic) substance. Organic Chemistry and Biological Processes are, by definition, separate SuperTypes. This Chemistry SuperType thus includes inorganic chemistry, physical chemistry, analytical chemistry, materials chemistry, nuclear chemistry, and theoretical chemistry. Activities, Animals, AtomsElements, Drugs, FinanceEconomy, FoodDrink, Forms, NaturalSubstances, OrganicChemistry, Products
Organic Matter Organic Chemistry The Organic Chemistry SuperType is for all chemistry involving carbon, including the biochemistry of living organisms and the materials chemistry (including polymers) of organic compounds such as fossil fuels. Activities, Chemistry, Drugs, FoodDrink, Products, Prokaryotes

Biochemical Processes The Biochemical Processes SuperType is for all sequences of reactions and chemical pathways associated with living things. Activities
Living Things Prokaryotes The Prokaryotes include all prokaryotic organisms, including the Monera, Archaebacteria, Bacteria, and Blue-green algas. Also included in this SuperType are viruses and prions. OrganicChemistry
Protists & Fungus This is the remaining cluster of eukaryotic organisms, specifically including the fungus and the protista (protozoans and slime molds). Drugs, FoodDrink, Plants
Plants This SuperType includes all plant types and flora, including flowering plants, algae, non-flowering plants, gymnosperms, cycads, and plant parts and body types. Note that all Plant Parts are also included. Chemistry, Drugs, FoodDrink, Products, ProtistsFungus
Animals This large SuperType includes all animal types, including specific animal types and vertebrates, invertebrates, insects, crustaceans, fish, reptiles, amphibia, birds, mammals, and animal body parts. Animal parts are specifically included. Also, groupings of such animals are included. Humans, as an animal, are included (versus as an individual Person). Diseases are specifically excluded. Animals have many of the similar overlaps to Plants. However, in addition, there are more terms for animal groups, animal parts, animal secretions, etc. Also Animals can include some human traits (posture, dead animal, etc) Chemistry, Diseases, FoodDrink, Forms, Persons, Products
Diseases Diseases are atypical or unusual or unhealthy conditions for (mostly human) living things, generally known as conditions, disorders, infections, diseases or syndromes. Diseases only affect living things and sometimes are caused by living things. This SuperType also includes impairments, disease vectors, wounds and injuries, and poisoning Activities, Animals, Events
Agents Persons The appropriate SuperType for all named, individual human beings. This SuperType also includes the assignment of formal, honorific or cultural titles given to specific human individuals. It further includes names given to humans who conduct specific jobs or activities (the latter case is known as an avocation). Examples include steelworker, waitress, lawyer, plumber, artisan. Ethnic groups are specifically included.

Persons as living animals are included under the Animals SuperType.
Animals
Organizations Organization is a broad SuperType and includes formal collections of humans, sometimes by legal means, charter, agreement or some mode of formal understanding. Examples include geopolitical entities such as nations, municipalities or countries; or companies, institutes, governments, universities, militaries, political parties, game groups, international organizations, trade associations, etc. All institutions, for example, are organizations.

Also included are informal collections of humans. Informal or less defined groupings of humans may result from ethnicity or tribes or nationality or from shared interests (such as social networks or mailing lists) or expertise ("communities of practice"). This dimension also includes the notion of identifiable human groups with set members at any given point in time. Examples include music groups, cast members of a play, directors on a corporate Board, TV show members, gangs, mobs, juries, generations, minorities, etc.

Finally, Organizations contain the concepts of Industries and Programs and Communities.
AreaRegion, FinanceEconomy, Situations, Society
Geopolitical Named places that have some informal or formal political (authorized) component. Important subcollections include Country, IndependentCountry, State_Geopolitical, City, and Province. AreaRegion
Artifacts Products This is SuperType includes any instance offered for sale or performed as a commercial service. A Product is often a physical object made by humans that is not a conceptual work or a facility, such as vehicles, cars, trains, aircraft, spaceships, ships, foods, beverages, clothes, drugs, weapons. Activities, Animals, AreaRegion, AtomsElements, AudioInfo, Chemistry, Drugs, Facilities, FinanceEconomy, FoodDrink, LocationPlace, NaturalSubstances, OrganicChemistry, Plants, StructuredInfo, VisualInfo, WrittenInfo
Food or Drink This SuperType is any edible substance grown, made or harvested by humans. The category also specifically includes the concept of cuisines. Activities, Animals, Chemistry, Drugs, NaturalSubstances, OrganicChemistry, Plants, Products, ProtistsFungus
Drugs This SuperType is a drug, medication or addictive substance, or a toxin or a poison. Chemistry, FoodDrink, OrganicChemistry, Plants, Products
Facilities Facilities are physical places or buildings constructed by humans, such as schools, public institutions, markets, museums, amusement parks, worship places, stations, airports, ports, carstops, lines, railroads, roads, waterways, tunnels, bridges, parks, sport facilities, monuments. All can be geospatially located.

Facilities also include animal pens and enclosures and general human "activity" areas (golf course, archeology sites, etc.). Importantly Facilities include infrastructure systems such as roadways and physical networks.


Facilities also include the component parts that go into making them (such as foundations, doors, windows, roofs, etc.).


Facilities can also include natural structures that have been converted or used for human activities, such as occupied caves or agricultural facilities.

Finally, facilities also include workplaces. Workplaces are areas of human activities, ranging from single person workstations to large aggregations of people (but which are not formal political entities).

AreaRegion, FinanceEconomy, Forms, LocationPlace, NaturalSubstances, Organizations, Products, VisualInfo
Information Audio Info This SuperType is for any audio-only human work. Examples include live music performances, record albums, or radio shows or individual radio broadcasts Activities, Products
Visual Info The Visual Info SuperType is for any still image or picture or streaming video human work, with or without audio. Examples include graphics, pictures, movies, TV shows, individual shows from a TV show, etc. Activities, AudioInfo, Facilities, FinanceEconomy, Products, StructuredInfo, WrittenInfo
Written Info This SuperType includes any general material written by humans including books, blogs, articles, manuscripts, but any written information conveyed via text. Activities, AudioInfo, FinanceEconomy, Products, StructuredInfo, VisualInfo
Structured Info This information SuperType is for all kinds of structured information and datasets, including computer programs, databases, files, Web pages and structured data that can be presented in tabular form. Activities, Events, FinanceEconomy, Products, VisualInfo, WrittenInfo
Social Finance & Economy This SuperType pertains to all things financial and with respect to the economy, including chartable company performance, stock index entities, money, local currencies, taxes, incomes, accounts and accounting, mortgages and property. Activities, AreaRegion, Chemistry, Facilities, Organizations, Products, StructuredInfo, VisualInfo, WrittenInfo
Society This category includes concepts related to political systems, laws, rules or cultural mores governing societal or community behavior, or doctrinal, faith or religious bases or entities (such as gods, angels, totems) governing spiritual human matters. Culture, Issues, beliefs and various activisms (most -isms) are included. Activities, Organizations, Situations


Table 1. Description of SuperTypes
Dimension SuperType (label) Description/Sub-types SuperType Intersections
Natural World Natural Phenomena
(NaturalPhenomena)
This SuperType includes natural phenomena and natural processes such as weather, weathering, erosion, fires, lightning, earthquakes, tectonics, etc. Clouds and weather processes are specifically included. Also includes climate cycles, general natural events (such as hurricanes) that are not specifically named, and biochemical processes and pathways. Activities, Events

Natural Substances
(NaturalSubstance)
Notable inclusions are minerals, compounds, chemicals, or physical objects that are not the outcome of purposeful human effort, but are found naturally occurring. Other natural objects (such as rock, fossil, etc.) are also found under this SuperType. Natural Substances include subatomic particles. The contrast is with Earthscape, which covers natural "features" or living substances, which are covered under the appropriate SuperTypes. Chemicals can be Natural Substances, but only if they are naturally occurring, such as limestone or salt Animals, Chemistry, Drugs, FoodDrinks, Products

Earthscape
(Earthscape)
The Natural Feature SuperType is the collection of cartographic features that occur on the surface of the Earth. Positive examples include Mountain, Ocean, and Mesa. Artificial features such as canals are excluded. Most instances of these features have a fixed location in space.

Underground and underwater are also explicitly contained.

This SuperType is explicitly disjoint with Extraterrestrial (see below).
Geopolitical, NaturalSubstances, Organizations

Extraterrestrial
(Extraterrestrial)
This SuperType includes all natural things not specifically terrestrial, including celestial bodies (planets, asteroids, stars, galaxies, etc., that can be located within a sky map) Events, NaturalPhenomena, NaturalSubstances, VisualInfo
Living Things Prokaryotes
(Prokaryotes)
The Prokaryotes include all prokaryotic organisms, including the Monera, Archaebacteria, Bacteria, and Blue-green algas. Also included in this SuperType are viruses and prions.

Protists & Fungus
(Protists_Fungus)
This is the remaining cluster of eukaryotic organisms, specifically including the fungus and the protista (protozoans and slime molds). FoodDrinks, Prokaryotes

Plants
(Plants)
This SuperType includes all plant types and flora, including flowering plants, algae, non-flowering plants, gymnosperms, cycads, and plant parts and body types. Note that all Plant Parts are also included. Drugs, FoodDrinks, Products

Animals
(Animals)
This large SuperType includes all animal types, including specific animal types and vertebrates, invertebrates, insects, crustaceans, fish, reptiles, amphibia, birds, mammals, and animal body parts. Animal parts are specifically included. Also, groupings of such animals are included. Humans, as an animal, are included (versus as an individual Person). Diseases are specifically excluded. Animals have many of the similar overlaps to Plants. However, in addition, there are more terms for animal groups, animal parts, animal secretions, etc. Also Animals can include some human traits (posture, dead animal, etc) Chemistry, FoodDrinks, NaturalSubstances, PersonTypes, Products, Society

Diseases
(Diseases)
Diseases are atypical or unusual or unhealthy conditions for (mostly human) living things, generally known as conditions, disorders, infections, diseases or syndromes. Diseases only affect living things and sometimes are caused by living things. This SuperType also includes impairments, disease vectors, wounds and injuries, and poisoning Animals, Events, NaturalPhenomena

Person Types
(PersonTypes)
The appropriate SuperType for all named, individual human beings. This SuperType also includes the assignment of formal, honorific or cultural titles given to specific human individuals. It further includes names given to humans who conduct specific jobs or activities (the latter case is known as an avocation). Examples include steelworker, waitress, lawyer, plumber, artisan. Ethnic groups are specifically included. Animals, Society, Organizations
Human Activities Organizations
(Organizations)
Organization is a broad SuperType and includes formal collections of humans, sometimes by legal means, charter, agreement or some mode of formal understanding. Examples include geopolitical entities such as nations, municipalities or countries; or companies, institutes, governments, universities, militaries, political parties, game groups, international organizations, trade associations, etc. All institutions, for example, are organizations.

Also included are informal collections of humans. Informal or less defined groupings of humans may result from ethnicity or tribes or nationality or from shared interests (such as social networks or mailing lists) or expertise ("communities of practice"). This dimension also includes the notion of identifiable human groups with set members at any given point in time. Examples include music groups, cast members of a play, directors on a corporate Board, TV show members, gangs, mobs, juries, generations, minorities, etc.

Finally, Organizations contain the concepts of Industries and Programs and Communities.
PersonTypes

Finance & Economy
(FinanceEconomy)
This SuperType pertains to all things financial and with respect to the economy, including chartable company performance, stock index entities, money, local currencies, taxes, incomes, accounts and accounting, mortgages and property. Activities, Earthscape, Events, Facilities, NaturalSubstances, Products, StructuredInfo, WrittenInfo

Society
(Society)
This category includes concepts related to political systems, laws, rules or cultural mores governing societal or community behavior, or doctrinal, faith or religious bases or entities (such as gods, angels, totems) governing spiritual human matters. Culture, Issues, beliefs and various activisms (most -isms) are included PersonTypes, WrittenInfo

Activities
(Activities)
These are ongoing activities that result (mostly) from human effort, often conducted by organizations to assist other organizations or individuals (in which case they are known as services, such as medicine, law, printing, consulting or teaching) or individual or group efforts for leisure, fun, sports, games or personal interests (activities) Events, FinanceEconomy, NaturalPhenomena, Products, StructuredInfo
Time-related Events
(Events)
These are nameable occasions, games, sports events, conferences, natural phenomena, natural disasters, wars, incidents, anniversaries, holidays, or notable moments or periods in time Activities, Chemistry, FinanceEconomy, NaturalPhenomena

Time
(Time)
This SuperType is for specific time or date or period (such as eras, or days, weeks, months type intervals) references in various formats
Human Works Products
(Products)
This is the largest SuperType and includes any instance offered for sale or performed as a commercial service. Often these are physical objects made by humans that are not a conceptual work or a facility, such as vehicles, cars, trains, aircraft, spaceships, ships, foods, beverages, clothes, drugs, weapons. Products also include the concept of 'state' (e.g., on/off) Activities, Animals, AudioInfo, Chemistry, Drugs, Facilities, FinanceEconomy, FoodDrinks, NaturalSubstances, Notations, Plants, StructuredInfo, VisualInfo, WrittenInfo

Food or Drink
(FoodDrink)
This SuperType is any edible substance grown, made or harvested by humans. The category also specifically includes the concept of cuisines Activities, Animals, Chemistry, Drugs, Events, NaturalSubstances, Plants, Products, ProtistsFungus

Drugs
(Drugs)
This SuperType is a drug, medication or addictive substance Chemistry, FoodDrinks, NaturalSubstances, Products

Facilities
(Facilities)
Facilities are physical places or buildings constructed by humans, such as schools, public institutions, markets, museums, amusement parks, worship places, stations, airports, ports, carstops, lines, railroads, roads, waterways, tunnels, bridges, parks, sport facilities, monuments. All can be geospatially located.

Facilities also include animal pens and enclosures and general human "activity" areas (golf course, archeology sites, etc.). Importantly Facilities include infrastructure systems such as roadways and physical networks.

Facilities also include the component parts that go into making them (such as foundations, doors, windows, roofs, etc.)

Earthscape, FinanceEconomy, Products, Workplaces
Human Places Geopolitical
(Geopolitical)
Named places that have some informal or formal political (authorized) component. Important subcollections include Country, IndependentCountry, State_Geopolitical, City, and Province. FinanceEconomy, Organizations

Workplaces, etc.
(Workplaces)
These are various workplaces and areas of human activities, ranging from single person workstations to large aggregations of people (but which are not formal political entities) Earthscape, Facilities, FinanceEconomy
Information Chemistry (n.o.c)
(Chemistry)
This SuperType is a residual category (n.o.c., not otherwise categorized) for chemical bonds, chemical composition groupings, and the like. It is formed by what is not a natural substance or living thing (organic) substance. Drugs, Events, FoodDrinks, NaturalSubstances, Products

Audio Info
(AudioInfo)
This SuperType is for any audio-only human work. Examples include live music performances, record albums, or radion shows or individual radio broadcasts Events, Notations, Products

Visual Info
(VisualInfo)
any still image or picture or streaming video human work, with or without audio. Examples include graphics, pictures, movies, TV shows, individual shows from a TV show, etc. AudioInfo, Events, Facilities, NaturalPhenomena, Notations, Products, StructuredInfo, WrittenInfo

Written Info
(WrittenInfo)
This SuperType includes any general material written by humans including books, blogs, articles, manuscripts, but any written information conveyed via text. FinanceEconomy, Notations, Products, StructuredInfo, VisualInfo

Structured Info
(StructuredInfo)
This information SuperType is for all kinds of structured information and datasets, including computer programs, databases, files, Web pages and structured data that can be presented in tabular form Activities, Events, NaturalPhenomena, Notations, Products, VisualInfo, WrittenInfo

Notations & References
(Notations)
Akin to conceptual works, these are codified means of human expression. Examples range from human languages themselves, to more domain-specific cases such as chemical symbols, genetic code (A-G-C-T), protocols, and computer languages, mathematical and set notations, etc.

Identifiers (numeric or alphanumeric identifiers for objects, often in a highly patterned way, such as phone numbers, URLs, zip and postal codes, SKUs, product codes, etc.), Units (any of the various ways in which measurement, space, volume, weight, speed, intensity, temperature, calories, siesmic intensity or other quantitative descriptions of phenomena can be made) and key reference types are also included in this SuperType
AudioInfo, Numbers, StructuredInfo, VisualInfo, WrittenInfo

Numbers
(Numbers)
This unique SuperType is for any abstract representation of numbers and numerics Notations
Descriptive Attributes
(Attributes)
This general SuperType category is for descriptive attributes of all kinds. Think of the specific attributes in Wikipedia "infoboxes" to understand the purpose and coverage of this SuperType. It includes colors, shapes, sizes, emotions, states or other descriptive characteristics about an object, particularly those than can be listed or enumerated by attribute type
Classificatory Abstract-level
(Abstract_level)
This general SuperType category is largely composed of former AbstractConcepts, and represent some of the more abstract upper-level nodes for connecting the UMBEL structure together. This SuperType also includes theories or processes or methods for humans to do stuff or any human technology

Topics/Categories
(TopicsCategories)
This largely subject-oriented SuperType is a means for using controlled vocabularies and classification schemes for characterizing what content "is about". The key constituents of this category are Types, Classifications, Concepts, CCC, and controlled vocabularies

Markets & Industries
(MarketsIndustries)
This SuperType is a specialized classificatory system for markets and industries. It could be combined with the SuperType above, but is kept separate in order to provide a separate, economy-oriented system.
Table 1. Description of SuperTypes

ANALYSIS OF THE SUPERTYPES

This section provides an analysis of the reference concept assignments and their possible disjointedness or overlap with other SuperTypes.

Distribution of SuperTypes

The following diagram shows the distribution of these 28,000 UMBEL concepts across SuperType. By far the largest SuperType is Products (itself split into two columns to keep the other items in scale), even with further splits into Food & Drinks and Drugs (pharmaceuticals). The next largest categories are Animals, Persons and Places and Events and Activities SuperTypes:

# of SuperTypes by Category
Figure 1. Distribution of Reference Concepts by SuperType

Even in its generic state, UMBEL provides a very rich vocabulary for describing things or for tying in more detailed external ontologies.

Possible Overlaps (non-disjoint) between SuperTypes

Twenty-nine of the SuperTypes are “mostly disjoint.” This is because there are some concepts — say, MusicPerformingAgent — that can apply to either a person or a group (band or orchestra, for example). Thus, for this concept alone, we have a bit of overlap between the normally disjoint Person and Organization SuperTypes.

The following shows the resulting interaction matrix where there may be some overlap between SuperTypes, including the RC count of the overlap and shading from light red to dark red showing an increasing degree of overlap:

Instance SuperTypes Overlap
Figure 2. Cross-interaction Matrix of Reference Concepts by SuperType

This kind of interaction diagram is also useful for further analyzing the concept graph structure. Note this figure is in the accompanying summary file as well.

Disjoint and Non-disjoint Analysis

First, the 29 SuperTypes in our mostly disjoint categories contain 90% of the UMBEL reference concepts. The remaining 10%, by definition classificatory or non-disjoint (overlapping), occurs in the other four SuperTypes. Here are the summary percentages of these high-level splits:

Disjoint Concepts (29 SuperTypes) 90%
Attributes (1 SuperType) 1%
Classifications (3 SuperTypes) 9%
TOTAL 100%
Table 2. Distribution of Non-disjoint SuperTypes

The actual statistics by SuperType are shown in the table below.

Within the 90% of reference concepts that are putatively disjoint, about 65% are fully disjoint.

But, as this table indicates, there is a wide diversity of overlap or not between SuperTypes:

SuperType Count Percentage
Abstract-level 475
1.3%
Attributes 741
2.1%
Markets & Industries 142
0.4%
Topics & Types 1,371
3.8%


2,729 7.6%
Natural Phenomena 260
0.7%
Natural Substances 1,250
3.5%
Earthscape 884
2.5%
Extraterrestrial 184
0.5%
Prokaryotes 238
0.7%
Protists & Fungus 59
0.2%
Plants 716
2.0%
Animals 4,006
11.2%
Diseases 514
1.4%
Persons 2,433
6.8%
Organizations 1,824
5.1%
Finance and Economy 773
2.2%
Society 279
0.8%
Activities 2,748
7.7%
Events 2,708
7.6%
Time 192
0.5%
Products 5,731
16.1%
Food or Drink 878
2.5%
Drugs 439
1.2%
Facilities 1,036
2.9%
Geopolitical 1,186
3.3%
Workplaces 267
0.7%
Chemistry (n.o.c.) 682
1.9%
Audio Info 202
0.6%
Visual Info 295
0.8%
Written Info 1,210
3.4%
Structured Info 686
1.9%
Notations & References 1,221
3.4%
Numbers 45
0.1%






32,946 92.4%

35,675
100.0%
Table 3. Quantification of Reference Concepts by SuperType

Also telling is that nearly half of the overlaps (42%) between SuperTypes occur in only three areas: PersonTypes v. Animals (for humans), PersonTypes v. Organizations (for certain agents) and Events v. Activities (for genuine ambiguities between the categories). The remaining 75 interactions account for the remaining half of the observed overlaps.

Even Where Overlaps Occur, They are Minor

Of the 29 mostly disjoint SuperTypes, only a relatively few show potential interactions, and then mostly in minor ways (excepting the three interactions noted earlier). We can illustrate this (drawn to scale) for the interaction between the Product, Food & Drink and Drug (Pharmaceuticals) SuperTypes, with the fully disjoint Organization SuperType thrown in for comparison:

Example SuperTypes Overlap

Figure 3. Sample Venn Diagram of Minor SuperTypes Overlap

Across all 28,000 concepts, then, about 60% are disjoint from one another. These reference concepts can gain the advantages noted at the beginning of this report.

FUTURE WORK

The next steps with the SuperTypes will be to refine the interaction matrix such that only true overlaps are excluded from disjoint assertions, as opposed to all of the RefConcepts in the participating SuperTypes.

Copyright © 2009-2016 by Structured Dynamics LLC.

[[Category:ZTechWiki]][[Category:Specification]][[Category:UMBEL]]