One of the main features of SAS® Visual Investigator is the ability to search and visualize customer data by indexing data into openSearch. The inability to index the data prevents users from accessing new data. Eventually, users might be unable to access data at all.
The indexing job might fail because one or more of the data nodes in openSearch reaches 100% disk capacity. The data notes might reach 100% disk capacity when an indexing failure leaves behind duplicate indices in openSearch, which consumes addition disk space.
When you run a full index on a data object, the first step in openSearch is to create a new index and then write data from the source database table to the index until all data rows are exhausted. At the end of the indexing process, the new index is registered to SAS Visual Investigator, and the old index is deleted, releasing disk space. If a failure occurs during the indexing process, it is possible that the old index might not be deleted and remains intact on openSearch, which consumes disk space.
As mentioned above, the old copy of the data exists while the new index is being populated. This process doubles the disk space that is needed in openSearch during indexing, even if for a brief time. As a result, disk usage is a very important metric to monitor closely to prevent the disk usage from reaching full capacity and affecting data accessibility in SAS Visual Investigator.
1) Determine the current disk usage for each opendistro pods.
Complete the following steps to view the disk space for each opendistro pod in openSearch:
2) Determine whether there are duplicate indices in openSearch that cause high disk usage
Compete the following steps to view the indices on openSearch:
Step 3: Delete all data indices in openSearch.
Note: The steps in this section require you to fully re-index all of the data.
Complete the following steps to delete the indices on openSearch. These instructions take care to avoid deleting anything besides the data indices.
At this point, the indices should be deleted, and you should have reclaimed the disk space. If one or more of the openSearch data pods still contains data, then you need to remove the orphaned data (dangling indices). Complete the additional steps below to reclaim the disk space.
4) Delete the orphaned data to reclaim disk space.
Execute the following instructions from outside of the opendistro pod.
If you are unable to delete the orphaned data (dangling indices), you can also delete the Persistent Volume Claims (PCVs) to reclaim the disk space.
5) Delete PVCs from openSearch if disk space is not reclaimed when you delete all of the data pods.
Note: The steps in this section require you to fully re-index all of the data. In addition, your pod names might differ. Lastly, these instructions are for stateful pods.
All of the following commands below use a variable for namespace: NS=<namespace>.