Vault Key Concepts

Object storage

Object storage is a data storage approach used to manage unstructured data in cloud infrastructure. In this storage method, data blocks of a file are grouped together as an object, along with relevant metadata and a custom identifier. These objects are then placed in a flat data environment known as a storage pool.

When accessing data from object storage, unique identifiers and metadata are used to locate the required objects, such as images or audio files. Users can customize metadata, which allows them to add additional context useful for various purposes, including data analytics and retrieval.

Object storage systems offer RESTful APIs, HTTP, and HTTPS interfaces to query object metadata, making it easy to locate and access specific data. The global storage pool architecture ensures fast access to the required data, and the flat environment enables seamless scaling, even for massive loads like petabytes or exabytes. Storage pools can be distributed across multiple object storage devices and geographical locations, facilitating unlimited scalability. As data grows, users can simply add more storage devices to the pool.

Object storage's elasticity and scalability make it well-suited for managing unstructured data in the cloud. As a cloud service, it provides on-demand access to object-based storage. Major cloud service providers primarily use cloud object storage as their primary storage format, thanks to its numerous benefits.

In conclusion, object storage in the cloud refers to the use of object-based storage as a scalable and elastic on-demand cloud service, which is the preferred storage format for most major cloud providers due to its flexibility and efficiency in managing unstructured data.

File Storage

File storage is a data storage method that organizes information into folders, resembling the paper filing system used in physical offices. Similar to how you'd locate a document in a filing cabinet, file storage requires you to know the specific room, cabinet, drawer, and folder containing the desired file. In this hierarchical structure, files are assigned names, tagged with metadata, and then placed within appropriate folders.

To access data, you need to know the exact path leading to the desired file. However, as the number of files grows, searching and retrieving data can become time-consuming. File storage has limited scalability compared to other storage methods. Nonetheless, it remains a straightforward solution for storing small amounts of diverse data and facilitating multiple users' simultaneous access.

Block storage

Block storage improves on the performance of file storage, breaking files into separate blocks and storing them separately. A block-storage system will assign a unique identifier to each chunk of raw data, which can then be used to reassemble them into the complete file when you need to access it. Block storage doesn’t require a single path to data, so you can store it wherever is most convenient and still retrieve it quickly when needed.

Block storage works well for organizations that work with large amounts of transactional data or mission-critical applications that need minimal delay and consistent performance. However, it can be expensive, offers no metadata capabilities, and requires an operating system to access blocks.

Replay

The replay capability in the context of vault refers to the ability to re-execute data events or transactions at a later time. When data events occur within a vault environment, they are recorded in a log or event stream. The replay capability enables the system to use this log to reprocess the same events in the exact order they originally occurred.

Last updated