Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Q6. What is Splunk Indexer? What are the stages of Splunk Indexing?
Q7. What is a Splunk Forwarder? What are the types of Splunk Forwarders?
Q8. Can you name a few most important configuration files in Splunk?
The Splunk Interview Questions blog is largely divided into the following parts:
Check this video on Splunk Software Engineer Interview Questions and Answers:
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 2/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
2. What is Splunk?
Splunk is ‘Google’ for our machine-generated data. It’s a software/engine that can be used for searching, visualizing,
monitoring, reporting, etc. of our enterprise data. Splunk takes valuable machine data and turns it into powerful
operational intelligence by providing real-time insights into our data through charts, alerts, reports, etc.
Splunk Network port 514 (Used to get data from the Network port, i.e., UDP data)
KV Store 8191
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 3/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Submit
Universal Forwarder (UF): The Splunk agent installed on a non-Splunk system to gather data locally; it can’t parse or
index data.
Heavyweight Forwarder (HWF): A full instance of Splunk with advanced functionalities.
It generally works as a remote collector, intermediate forwarder, and possible data filter, and since it parses data, it is not
recommended for production systems.
$splunkhome/etc/system/default
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 4/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
If we plan to run a variety of summary index reports, we may need to create additional summary indexes.
Learn more about Splunk from this Splunk Training in New York to get ahead in your career!
Career Transition
16. Can you write down a general regular expression for extracting the IP address from
logs?
There are multiple ways in which we can extract the IP address from logs. Below are a few examples:
OR
17 E l i S T
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ i d 5/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
When the unique ID (from one or more fields) alone is not sufficient to discriminate between two transactions. This is
the case when the identifier is reused, for example, web sessions identified by a cookie/client IP. In this case, the time
span or pauses are also used to segment the data into transactions.
When an identifier is reused, say in DHCP logs, a particular message identifies the beginning or end of a transaction.
When it is desirable to see the raw text of events combined rather than an analysis of the constituent fields of the
events.
As the performance of the stats command is higher, it can be used especially in a distributed search environment
the time spent in each. This will give us a lot of information quickly such as which requests are hanging Splunk, which
requests are blameless, etc.
A bucket moves through several stages as it ages. Below are the various stages it goes through:
Hot: A hot bucket contains newly indexed data. It is open for writing. There can be one or more hot buckets for each
index.
Warm: A warm bucket consists of data rolled out from a hot bucket. There are many warm buckets.
Cold: A cold bucket has data that is rolled out from a warm bucket. There are many cold buckets.
Frozen: A frozen bucket is comprised of data rolled out from a cold bucket. The indexer deletes frozen data by default,
but we can archive it. Archived data can later be thawed (data in a frozen bucket is not searchable).
$SPLUNK_HOME/var/lib/splunk/defaultdb/db
We should see the hot-db there, and any warm buckets we have. By default, Splunk sets the bucket size to 10 GB for 64-bit
systems and 750 MB for 32-bit systems.
if the aggregation is pertinent to that event. The eventstats command computes requested statistics, like stats does,
but aggregates them to the original raw data.
27. What is the command used to check the running Splunk processes on Unix/Linux?
If we want to check the running Splunk Enterprise processes on Unix/Linux, we can make use of the following command:
28. What is the command used for enabling Splunk to boot start?
To boot start Splunk, we have to use the following command:
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 7/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Learn the complete concepts of Splunk from Intellipaat’s Splunk Training at Hyderabad in just 26 hours!
$SPLUNK_HOME/etc/system/local/
In the file, we will have to use the following command (here, in the place of ‘NEW_PASSWORD’, we will add our own new
password):
[user_info]
PASSWORD = NEW_PASSWORD
After that, we can just restart the Splunk Enterprise and use the new password to log in
Now, if we are using the versions prior to 7.1, we will follow the below steps:
Note: In case we have created other users earlier and know their login details, copy and paste their credentials from the
passwd.bk file into the passwd file and restart Splunk.
$splunk_home/var/log/splunk/searches.log
34. What is Btool? How will you troubleshoot Splunk configuration files?
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 8/16
9/26/22, 9:56 AM y p g
Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Splunk Btool is a command-line tool that helps us troubleshoot configuration file issues or just see what values are being
used by our Splunk Enterprise installation in the existing environment.
35. What is the difference between Splunk App and Splunk Add-on?
In fact, both contain preconfigured configuration, reports, etc., but the Splunk add-on does not have a visual app. On the
other hand, a Splunk app has a preconfigured visual app.
/opt/splunk/var/lib/splunk
It contains seek pointers and CRCs for the files we are indexing, so ‘splunkd’ can tell us if it has read them already. We can
access it through the GUI by searching for:
index=_thefishbucket
Are you interested in learning Splunk from experts? Intellipaat’s Splunk Course in Bangalore is the right choice!
In props.conf:
<code>[source::/var/log/foo]
# index processor
TRANSFORMS-set= setnull,setparsing
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 9/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
</code>
In transforms.conf:
[setparsing]
REGEX = login
DEST_KEY = queue
FORMAT = indexQueue
39. How can I understand when Splunk has finished indexing a log file?
We can figure this out:
If we are having trouble with data input and we want a way to troubleshoot it, particularly if our whitelist/blacklist rules are
not working the way we expected, we will go to the following URL:
https://yoursplunkhost:8089/services/admin/inputstatus
$SPLUNK_HOME/etc/system/local
$SPLUNK_HOME/etc/system/local/ui-prefs.conf file
includes:
[search]
dispatch.earliest_time = @d
dispatch.latest_time = now
The default time range that all users will see in the search app will be today.
http://docs.splunk.com/Documentation/Splunk/latest/Admin/Ui-prefsconf
$SPLUNK_HOME/var/run/splunk/dispatch
contains a directory for each search that is running or has completed. For example, a directory named 1434308943.358 will
contain a CSV file of its search results, a search.log with details about the search execution, and other stuff. Using the
defaults (which we can override in limits.conf), these directories will be deleted 10 minutes after the search completes—
unless the user saves the search results, in which case the results will be deleted after 7 days.
42. What is the difference between Search Head Pooling and Search Head Clustering?
Both are features provided by Splunk for the high availability of Splunk search head in case any search head goes down.
However, the search head cluster is newly introduced and search head pooling will be removed in the next upcoming
versions.
The search head cluster is managed by a captain, and the captain controls its slaves. The search head cluster is more
reliable and efficient than the search head pooling.
43. If I want to add folder access logs from a windows machine to Splunk, how do I do
it?
Below are the steps to add folder access logs to Splunk:
1. Enable Object Access Audit through group policy on the Windows machine on which the folder is located
/opt/splunk/var/lib/splunk
It contains seek pointers and CRCs for the files we are indexing, so splunkd can tell us if it has read them already.
http://www.learnsplunk.com/splunk-indexer-configuration.html#sthash.t1ixi19P.dpuf.
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 11/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
47. What is the difference between Splunk SDK and Splunk Framework?
Splunk SDKs are designed to allow us to develop applications from scratch and they do not require Splunk Web or any
components from the Splunk App Framework. These are separately licensed from Splunk and do not alter the Splunk
Software.
Splunk App Framework resides within the Splunk web server and permits us to customize the Splunk Web UI that comes
with the product and develop Splunk apps using the Splunk web server. It is an important part of the features and
functionalities of Splunk, which does not license users to modify anything in Splunk.
48. For what purpose inputlookup and outputlookup are used in Splunk Search?
The inputlookup command is used to search the contents of a Splunk lookup table. The lookup table can be a CSV lookup
or a KV store lookup. The inputlookup command is considered to be an event-generating command. An event-generating
command generates events or reports from one or more indexes without transforming them. There are many commands
that come under the event-generating commands such as metadata, loadjob, inputcsv, etc. The inputlookup command is
one of them.
Syntax:
Now coming to the outputlookup command, it writes the search results to a static lookup table, or KV store collection, that
we specify. The outputlookup command is not being used with external lookups.
Syntax:
Forwarder: You can see it as a dumb agent whose main task is to collect the data from various sources like remote
machines and transfer it to the indexer.
Indexer: The indexer will then process the data in real-time and store & index it on the localhost or cloud server.
Search Head: It allows the end-user to interact with the data and perform various operations like searching, analyzing,
and visualizing the information.
50. How to add the colors in Splunk UI based on the field names?
Splunk UI has a number of features that allow the administrator to make the reports more presentable. One such feature
that proves to be very useful for presenting distinguished results is the custom colors. For example, if the sales of a product
drop below a threshold value, then as an administrator you can set the chart to display the values in red color.
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 12/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
The administrator can also change chart colors in the Splunk Web UI by editing the panels from the panel settings
mentioned above the dashboard. Moreover, you can write the codes and use hexadecimal values to choose a color from
the palette.
This is what happens to the data at each stage of the indexing pipeline:
As soon as the data center the pipeline, it goes to the hot bucket. There can be multiple hot buckets at any point in
time, which you can both search and write to.
If any problem like the Splunk getting restarted or the hot bucket has reached a certain threshold value/size, then a
new bucket will be created in its place and the existing ones roll to become a warm bucket. These warm buckets are
searchable, but you cannot write anything in them.
Further, if the indexer reaches its maximum capacity, the warm bucket will be rolled to become a cold one. Splunk will
automatically execute the process by selecting the oldest warm bucket from the pipeline. However, it doesn’t rename
the bucket. All the above buckets will be stored in the default location
‘$SPLUNK_HOME/var/lib/splunk/defaultdb/db/*’.
After a certain period of time, the cold bucket rolls to become the frozen bucket. These buckets don’t have the same
location as the previous buckets and are non-searchable. These buckets can either be archived or deleted based on the
priorities.
You can’t do anything if the bucket is deleted, but you can retrieve the frozen bucket if it’s being archived. The process
of retrieving an archived bucket is known as thawing. Once a bucket is thawed it becomes searchable and stores into a
new location
‘$SPLUNK_HOME/var/lib/splunk/defaultdb/thaweddb/’
Pivots, on the other hand, give you the flexibility to create multiple views and see the results as per the requirements. With
pivots, even the managers of stakeholders from non-technical backgrounds can create views and get more details about
their departments.
Real-time dashboards
Dynamic form-based dashboards
Dashboards for scheduled reports
Real-time alerts: we can divide the real-time alerts into two parts, pre-result, and rolling-window alerts. The pre-result
alert gets triggered with every search, while rolling-window alerts are triggered when a specific criterion is met by the
search.
Scheduled Alerts: As the name suggests, scheduled alerts can be initialized to trigger multiple alerts based on the set
criteria.
Replication factor: The replication factor (RF) determines the number of users that can receive copies of your data/buckets.
However, the search factor should not be greater than the replication factor.
./splunk start
./splunk stop
Erex
Abstract
Typer
Rename
Anomalies
Fill down
https://intellipaat.com/blog/interview-question/splunk-interview-questions/ 14/16
9/26/22, 9:56 AM Top 60 Splunk Interview Questions & Answers 2022 - Intellipaat
Accum
Add totals
Fast mode: speeds up your search result by limiting the types of data.
Verbose mode: Slower as compared to the fast mode, but returns the information for as many events as possible.
Smart mode: It toggles between different modes and search behaviors to provide maximum results in the shortest
period of time.
Next
Course Schedule
Name Date
2022-09-24 2022-09-18
2022-10-01 2022-09-25
2022-10-08 2022-10-02
Helpful to us
Reply
Babulal says:
Thanks for sharing very effective Interview questions and answers. MAY 3, 2016 AT 6:01 PM
Reply
Nidhi says: