Powered by Zoomin Software. For more details please contactZoomin

Glossary

S

  • Last Updated: May 18, 2026
  • 8 minute read
    • MarkLogic Server
    • Version 12.0
    • Documentation

S3 (Amazon Simple Storage Service)

Cloud. Amazon S3 is an online file storage web service offered by Amazon Web Services that provides storage through web services interfaces. See Amazon S3 for more information.

schema

Database. A representation of a SQL schema. A schema is implemented as an XML document in the schemas database and consists of a unique id, a name (which must also be unique), and a collection of views. During SQL execution, the schema provides the naming context for its views, which enables you to have multiple views of the same name in different schemas. The default schema is called “main.”It is default is the sense that it is always implicitly available and first on the default schema search path for name resolution in SQL. Even though the “main” schema is a default, you must create this schema.

search APIs

Database. MarkLogic provides search features through a set of layered APIs. The core text search foundations in MarkLogic are the XQuery cts:* and JavaScript cts.* APIs, which are built-in functions that perform full-text search. The XQuery search:*, JavaScript jsearch.*, and REST APIs above this foundation provide a higher level of abstraction that enable rapid development of search applications. The higher-level APIs offer benefits such as the following:

Abstraction of queries from the constraints and indexes that support them.

Built in support for search result snippeting, highlighting, and performance analysis.

An extensible simple string query grammar.

Easy-to-use syntax for query composition.

Built in best practices that optimize performance.

You can use more than one of these APIs in an application. For example, a Java application can include an XQuery extension to perform custom search result transformations on the server. Similarly, an XQuery application can call both search:* and cts:* functions.

search features

Database. MarkLogic Server search supports a wide range of full-text features. These features include phrase search, boolean search, proximity, stemming, tokenization, decompounding, wildcarded searches, punctuation-sensitive search, diacritic-sensitive/insensitive searches, case-sensitive/insensitive searches, spelling correction functions, thesaurus functions, geospatial searches, advanced language and collation support, document quality settings, numerous relevance algorithms, individual term weighting, topic clustering, faceted navigation, custom-indexed fields, and much more. These features are all designed to build off of each other and work together in an extensible and flexible way.

searchable expression

Search. An XPath expression that is (a) Rooted in an fn:doc, fn:collection, or xdmp:document-properties call in XQuery, or in an fn.doc, cts.doc, fn.collection, or xdmp.document-properties call in Server-Side JavaScript, and (b) uses only forward axes such as “/” and “//”. For details, see Fully Searchable Paths and cts:search Operations in Query Performance and Tuning.

secure credentials

Security. Secure credentials enable a security administrator to manage credentials, making them available to less privileged users for authentication to other systems without giving them access to the credentials themselves.

Secure credentials consist of a PEM encoded x509 certificate and private key and/or a username and password. Secure credentials are stored as secure documents in the Security database on MarkLogic Server, with passwords and private keys encrypted.

semantic web technologies

Semantics. A set of standards and best practices for sharing linked data for use by applications over the Web. This includes the “semantics” (context and meaning) of that data. Semantic web technologies include the Resource Description Framework Schema (RDFS) standard, SPARQL, Web Ontology Language (OWL) and other ontology vocabularies. See also linked data.

sequence file

Hadoop. A flat file of binary key-value pairs in one of the Apache Hadoop SequenceFile formats. The MarkLogic content pump (mlcp) tool only supports importing Text and BytesWritable values from a sequence file.

server

Client-server model software. The software or hardware that provides resources upon client request.

server-side JavaScript

JavaScript. MarkLogic provides a JavaScript environment that runs within an e-node.

server-side XQuery and XSLT APIs

XQuery and XSLT. MarkLogic includes support for XQuery 1.0 and XSLT 2.0. These are W3C-standard XML-centric languages designed for processing, querying, and transforming XML.

service

MarkLogic. Describes what to monitor and how to monitor one or more objects in a MarkLogic cluster. Services can define warning and critical thresholds for alerting and can monitor one or more objects in MarkLogic Server.

service group

MarkLogic. A group of one or more services.

shallow time

General. The time spent evaluating a specific expression, not including time spent evaluating any expressions contained within the specific expression. See elapsed time and deep time.

shuffle

Hadoop. The process of sorting all map output values with the same key into a single (key, value-list) reduce input key-value pair. The shuffle happens between map and reduce. Portions of the shuffle can be performed by map tasks and portions by reduce tasks.

single-statement transaction

Database. Any transaction created in auto transaction mode. Single-statement transactions always contain only one statement and are automatically committed on successful completion or rolled back on error.

SKOS (Simple Knowledge Organization System)

Semantics. A W3C recommendation designed for representation of thesauri, classification schemes, taxonomies, subject-heading systems, or any other type of structured controlled vocabulary. See also thesaural relationship.

small binary document

MarkLogic. A binary document stored in a MarkLogic database whose size does not exceed the large size threshold. For more information, see Choosing a Binary Format in Load Content into MarkLogic Server.

smart mastering

Data Hub. Leverage fuzzy logic and AI to match and merge data so you can master data quickly and automatically without having to buy a separate MDM tool.

snippet

MarkLogic. The result of a search function that returned a portion of the found documents.

snippeting

MarkLogic. A search function in which portions (snippets) of the found documents are returned in the search results.

SOAP (Simple Object Access Protocol)

Web. A Web Services protocol for exchanging structured information in computer networks. It relies on Application Layer protocols, most notably Hypertext Transfer Protocol (HTTP) or Simple Mail Transfer Protocol (SMTP), for message negotiation and transmission.

SPARQL

Semantics. A recursive acronym for SPARQL Protocol and RDF Query Language (SPARQL), a query language designed for querying data in Resource Description Framework Schema (RDFS) format. MarkLogic supports SPARQL1.1.

SPARQL endpoint

Semantics. A resource that a SPARQL process can contact and use as a service. The endpoint accepts SPARQL queries and updates, and returns the results using SPARQL protocol.

SPARQL protocol

Semantics. A means of conveying SPARQL queries from query clients to query processors, consisting of an abstract interface with bindings to Hypertext Transfer Protocol (HTTP) and Simple Object Access Protocol (SOAP).

SPARQL Update

Semantics. A language for making updates to triples in Resource Description Framework Schema (RDFS) format. MarkLogic supports SPARQL Update 1.1.

split

Hadoop. The unit of work for one thread in local mode or one MapReduce task in distributed mode.

SQL support

SQL. The SQL supported by the core SQL engine is SQL92, with the addition of SET, SHOW, and DESCRIBE statements. MarkLogic SQL enables you to connect Business Intelligence (BI) tools, such as Tableau and Qlik.

SSL (Secure Sockets Layer)

Security. A transaction security standard that provides encrypted protection between browsers and App Servers. When SSL is enabled for an App Server, browsers communicate with the App Server by means of an HTTPS connection, which is HTTP over an encrypted Secure Sockets Layer. HTTPS connections are widely used by banks and web vendors for secure transactions over the web.

stand

MarkLogic. Databases are made up of one or more forests, and forests are made up of one or more stands. See forest.

state parameter

OAuth. A parameter used to keep track of requests and provides a method for maintaining states within the OAuth 2.0 protocol. Used to prevent Cross-Site Request Forgery (CSRF) attacks. [New in v11.2.0]

statement

XQuery. An XQuery main module, as defined by the W3C XQuery standard, to be evaluated by MarkLogic Server. A main module consists of an optional prolog and a complete XQuery expression. Statements are either query statements or update statements, determined statically through static analysis prior to beginning the statement evaluation.

static content

MarkLogic. Content stored in the modules database of the App Server. MarkLogic Server responds directly to HTTP range requests (partial GETs) of static content. See Downloading Binary Content With HTTP Range Requests in Develop Server-Side Applications.

static host

A host configured to be a permanent part of a cluster. D-nodes must be on static hosts.

status change

MarkLogic. As part of content processing (create, update, and delete), Content Processing Framework (CPF) automatically handles the document status change events and sets a state (or cleans up in the case of delete) for the document. When you want the result of these changes to move a document in or out of document processing, it is known as a status change. The Status Change Handling pipeline performs these tasks automatically.

stem

MarkLogic. A word stem is the part of a word that is common to all of its inflected variants. For example, in English, "run" is the stem of "run", "runs", "ran", and "running". Words derived from the same meaning and part of speech have the same stem (for example, 'mouse' and 'mice'). Some words can have multiple stems (if the same word can be used as a different part of speech, or if there are two words with the same spelling), and if you use advanced stemming (which can find multiple stems for a word), then stemmed search will find all of the words having the same stem as any of the stems. See Understanding and Using Stemmed Searches in Develop Search Applications.

MarkLogic. A stemmed search for a term matches all the terms that have the same stem as the search term. MarkLogic Server can perform stemmed searches in a number of different languages. See Understanding and Using Stemmed Searches in Develop Search Applications.

stemming

Search. When stemming is enabled, MarkLogic Server automatically searches for words that come from the same stem of the word specified in the query, not just the exact string specified in the query. The purpose of stemming is to increase the recall for a search. A stemmed search for a word finds the exact same terms as well as terms that derive from the same meaning and part of speech as the search term. See Stemming in MarkLogic Server in Develop Search Applications.

string query

Search. A simple search string constructed using either the default MarkLogic Server search grammar, or a user-defined grammar. For example, 'cat' and 'cat OR dog' are string queries. For details, see Creating a Query From Search Text With cts:parse or Searching Using String Queries in Develop Search Applications.

structured query

Search. The pre-parsed representation of a query, expressed as XML or JSON. Structured queries allow you to express complex queries very efficiently. For details, see Querying Documents and Metadata in Develop Using the REST API and Searching Using Structured Query in Develop Search Applications.

sub-database

Database. A database contained in a super-database

Subject

Semantics. A representation of a resource such as a person or an entity. See also Object and Predicate.

super-database

Database. A database containing other databases (sub-database) so that they can be queried as if they were a single logical database.

synchronous

General. Synchronous processing happens “at the same time”. This form of input/output processing blocks I/O until the operation has finished. Synchronous processing does not permit other processing to continue before/until the transmission has finished. See asynchronous.

system time

Bitemporal. When the information (captured in a document) was stored in the database. System time may also be called transaction time. See bitemporal and valid time.

system timestamp

MarkLogic. A number maintained by MarkLogic Server that increases every time a change or a set of changes occurs in any of the databases in a system (including configuration changes from any host in a cluster). Each fragment stored in a database has system timestamps associated with it to determine the range of timestamps during which the fragment is valid.

TitleResults for “How to create a CRG?”Also Available inAlert