Search

LABDRIVE provides a user-friendly, easy to use web interface for searching alongside a full API for Query DSL (Domain Specific Language) based on JSON to define queries, that supports the ElasticSearch syntax to search for items.

This guide is just an introduction to the LABDRIVE capabilities for you to get familiar with them.

Here you can find:

  • How to search using the Management Interface

  • How to do basic search using the API

Search using the Management Interface

1. Sign in to the LABDRIVE Management Interface and go to Content Search:

2. You can introduce your search query:

(1) Select here if you want to search in the items that are inside the containers or if you want to search by the container's metadata.

(2) Introduce your query here. It could be simply a word or it could be a metadata-based query if you start typing any metadata field:

(3) Or you can make an advanced query selecting the {;} symbol:

(4) With this option you can restrict the types of data that LABDRIVE is going to search in:

  • Metadata: For searching in the item metadata

  • File content: For searching in the item content (full text in a PDF for instance), for the indexed documents

  • Embedded metadata: For searching in the item's embedded metadata (EXIF metadata for images, headers for PDF files, etc)

  • File name and path: For searching based on the item name

(5) For restricting your search to a particular archive node or group of containers.

(6) For filtering based on content tags

(7) For filtering by file format (if the content is characterized)

And (8), (9) and (10) to browse your results.

You can point to files and folders with their direct link, using the identifier that your platform assigns to them when they are uploaded.

For instance, if a file is in the container 9 and has the file id 10981 like in this example:

You can use the following link to open a container details view with the file pre selected:

https://acme.libnova.com/download/file/10981

This is useful for instance for telling one of your colleagues "this is the file you are looking for".

If what you want is to send someone a direct download link, so they can immediately download the file when opening the link, you can use:

https://acme.libnova.com/download/file/10981

But, what happens if you would like to provide a link to a file/folder or container that matches a certain query? You can do it using the Search Links functionality.

This functionality is really useful when performing integrations. For instance, you can ingest one object in your platform, assign an unique identifier to it and then, generate http links from other platforms that are going to open the ingested element (without needing to know the file ID that the platform assigns to it, or even if the file changes it location or name over time).

For files and folders, you can go to your search console in the management interface selecting Content Search:

When you build your query and click search, you will see that the URL will include it. For instance if you search for objects that contain a particular value in a field like here:

This way, you can send your colleagues a search query, that will display results to them (note that results will depend on their permissions).

And, if you happen to have a unique identifier in any of your fields, you can make your platform to guide them not only to the Search results page, but also to the particular file that matches they query (only one should match).

For instance, let's say that for a given object you have the following metadata:

If you craft and open the following URL

https://acme.libnova.com/search/files?q=uuid:6a937735-78a9-474b-bfd6-6486bdd2351c&direct=view_file

And there is only one file or folder with the selected UUID, the platform will redirect you to the folder/file, leaving it selected:

This method will work the same for files and for folders but, for files (does not work for folders), you can also include a direct download request:

https://acme.libnova.com/search/files?q=uuid:6a937735-78a9-474b-bfd6-6486bdd2351c&direct=download_file

This will immediately start download the file that matches your query.

The same is valid for containers. If you have a container with the following metadata:

You can create a link like:

https://acme.libnova.com/search/containers?q=uuid:7e1b6910-6ba1-40f3-9ce9-8e56853f56fa

That will show the search results like in the previous example. But for containers, you can do several things: you can redirect the user to the container details view, if only one is matching, by adding the following to the search link:

https://acme.libnova.com/search/containers?q=uuid:7e1b6910-6ba1-40f3-9ce9-8e56853f56fa&direct=view_container_details

And you can also make the link to open the Explore Content tab, if you include the following:

https://acme.libnova.com/search/containers?q=uuid:7e1b6910-6ba1-40f3-9ce9-8e56853f56fa&direct=view_container_content

Search using the API

API examples here are just illustrative. Check the LABDRIVE API documentation for additional information and all available methods.

1. Sign in to the LABDRIVE Management Interface

2. Obtain your LABDRIVE API key selecting your name and then Access Methods:

and then, follow one of the following methods:

Search by container

To get a list of every item in the container, with all properties use this method:

curl --request POST \
  --url "$your_platform_url/api/file/elastic" \
  --header "Content-Type: application/json" \
  --header "authorization: Bearer $your_platform_api_key" \
  --data '{
	"must": [
		{
			"term": {
				"container_id": 3
			}
		}
	]
}'

Use:

  • url: Your LABDRIVE address

  • header: Your LABDRIVE API Token (add Bearer prefix)

  • container_id: The id of the container to list.

You can also request the file list of more than one data container with a single query:

curl --request POST \
  --url "$your_platform_url/api/file/elastic" \
  --header "Content-Type: application/json" \
  --header "authorization: Bearer $your_platform_api_key" \
  --data '{
	"must": [
		{
			"terms": {
				"container_id": [
					1,
					2
				]
			}
		}
	]
}'

When users are uploading a file to the platform, search results may not show the new file for up to 3 seconds. Consider this in your code when you are uploading and immediately after the upload, you need to search for the uploaded file.

This section is just an introduction. Many operators and properties are available for search. Make sure you read Advanced API file search.

To get a list of every item in the container, with all properties use this method:

curl --request GET  --url "$your_platform_url/api/file" \
       --header "Content-Type: application/json" \
       --header "authorization: Bearer $your_platform_api_key" \
       --data '{
          "conditions": [
              {
                  "container_id": 185
              }
                        ],
          "limit": 100,
          "offset": 0
      }'

Or if you are looking for all files in container 185, with a size larger than 702 bytes and characterized as a PDF 1.5 (PRONOM fmt/19), you can use:

curl --request GET  --url "$your_labdrive_url/api/file" \
       --header "Content-Type: application/json" \
       --header "authorization: Bearer $your_labdrive_api_key" \
       --data '{
          "conditions": [
              {
                  "container_id": 185
              },
              {
                  "size": {
                      "operator": "gt",
                      "value": 702
                  }
              },
              {
                  "type": "FILE"
              },
              {
                  "format": "fmt\/19"
              }
          ],
          "limit": 100,
          "offset": 0
      }'

When users are updating metadata for an object, search results may not show the updated element for up to 3 seconds. Consider this in your code when you are updating the metadata and immediately after the edit, you need to search for it.

It is possible to search for any of the metadata associated to your items (item metadata, item embedded metadata -if available-, file hash, etc). It is also possible to create complex queries combining multiple query types in one and to use wildcards:

Metadata fields in your queries are NOT the same as in your metadata schema, and should be adjusted with two changes:

  • Include the "metadata." test as their prefix.

  • Replace any "." or any other special character by an "_".

E.g.: If your IECODE is "dc.title", the search term will be "metadata.dc_title"

curl --request POST \
  --url "$your_platform_url/api/file/elastic" \
  --header "Content-Type: application/json" \
  --header "authorization: Bearer $your_platform_api_key" \
  --data '{
	"must": [
		{
			"nested": {
				"path": "metadata",
				"query": {
					"simple_query_string": {
						"analyze_wildcard": true,
						"fields": [
							"metadata.dc_author"
						],
						"query": "albert*",
						"default_operator": "and"
					}
				}
			}
		}
	]
}'

Last updated