Sharing Data (Export RDF)
Profiles Research Networking Software is a Semantic Web application, which means its content
can be read and understood by other computer programs. This enables the data in
profiles, such as addresses and publications, to be shared with other institutions
and appear on other websites. If you click the "Export RDF" link on the left sidebar
of a profile page, you can see what computer programs see when visiting a profile.
The section below describes the technical details for building a computer program
that can export data from Profiles Research Networking Software.
Technical Details
As a Semantic Web application, Profiles Research Networking Software uses the Resource Description
Framework (RDF) data model. In RDF, every entity (e.g., person, publication, concept)
is given a unique URI. (A URI is similar to a URL that you would enter into a web
browser.) Entities are linked together using "triples" that contain three URIs--a
subject, predicate, and object. For example, the URI of a Person can be connected
to the URI of a Concept through a predicate URI of hasResearchArea. Profiles Research
Networking Software contains millions of URIs and triples. Semantic Web applications use an
ontology, which describes the classes and properties used to define entities and
link them together. Profiles Research Networking Software uses the VIVO Ontology, which was
developed as part of an NIH-funded grant to be a standard for academic and research
institutions. A growing number of sites around the world are adopting research networking
platforms that use the VIVO Ontology. Because RDF can link different triple-stores
that use the same ontology, software developers are able to create tools that span
multiple institutions and data sources. When RDF data is shared with the public,
as it is in Profiles Research Networking Software, it is called Linked Open Data (LOD).
There are four types of application programming interfaces (APIs) in Profiles Research
Networking Software.
- RDF crawl. Because Profiles Research Networking Software is a Semantic Web application, every
profile has both an HTML page and a corresponding RDF document, which contains the
data for that page in RDF/XML format. Web crawlers can follow the links embedded
within the RDF/XML to access additional content.
- SPARQL endpoint. SPARQL is a programming language that enables arbitrary queries
against RDF data. This provides the most flexibility in accessing data; however,
the downsides are the complexity in coding SPARQL queries and performance. In general,
the XML Search API (see below) is better to use than SPARQL.
- XML Search API. This is a web service that provides support for the most common
types of queries. It is designed to be easier to use and to offer better performance
than SPARQL, but at the expense of fewer options. It enables full-text search across
all entity types, faceting, pagination, and sorting options. The request message
to the web service is in XML format, but the output is in RDF/XML format.
- Old XML based web services. This provides backwards compatibility for institutions
that built applications using the older version of Profiles Research Networking Software. These
web services do not take advantage of many of the new features of Profiles Research
Networking Software. Users are encouraged to switch to one of the new APIs.
For more information about the APIs, please see the
documentation and
example files.