0% found this document useful (0 votes)
18 views

Power Bi

Uploaded by

rinkuaws9696.2
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views

Power Bi

Uploaded by

rinkuaws9696.2
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Q1 - Difference between multicast and conditional split: Multicast and conditional

split are components used in data integration or ETL (Extract, Transform, Load)
processes.

• Multicast: It is used to split a data flow into multiple identical copies. Each
copy is an independent branch of the data flow and can be processed
separately.
• Conditional Split: It is used to divide a data flow into different branches based
on specified conditions. Each branch can have its own set of conditions and
processing logic.

Q2 - What is normalization and denormalization: Normalization and denormalization


are techniques used in database design to organize data efficiently.

• Normalization: It is the process of organizing data into logical tables to


eliminate data redundancy and ensure data integrity. It involves breaking
down data into smaller, related tables and establishing relationships between
them using primary and foreign keys.
• Denormalization: It is the opposite of normalization and involves combining
tables or adding redundant data to improve performance by reducing the
number of joins required in queries. Denormalization sacrifices some level of
data integrity for improved query performance.

Q3 - Difference between CTE (Common Table Expression) and Temp Table: CTE and
Temp Table are used to store temporary results within a query or a session.

• CTE: It is a named temporary result set that can be referenced within a SELECT,
INSERT, UPDATE, or DELETE statement. It exists only for the duration of the
query and is typically used for complex queries to improve readability and
maintainability.
• Temp Table: It is a temporary table stored in the tempdb database. It can be
created and used like a regular table within a session. Temp tables persist until
they are explicitly dropped or until the session ends.

Q4 - Difference between UNION and MERGE: UNION and MERGE are used in SQL to
combine or merge data from multiple tables.

• UNION: It combines the result sets of two or more SELECT statements into a
single result set, eliminating duplicates by default.
• MERGE: It is used to perform insert, update, or delete operations on a target
table based on the data from a source table. It allows you to synchronize data
between two tables based on specified conditions.
Q5 - What is an index: An index is a database structure that improves the speed of
data retrieval operations on database tables. It is created on one or more columns of
a table and allows the database engine to quickly locate and access the data based
on the indexed columns. Indexes can significantly improve query performance.

Q6 - Difference between UNION and UNION ALL: UNION and UNION ALL are used
to combine data from multiple SELECT statements in SQL.

• UNION: It combines the result sets of two or more SELECT statements into a
single result set, eliminating duplicates from the final result.
• UNION ALL: It also combines the result sets of multiple SELECT statements
into a single result set, but it does not remove duplicates. It includes all rows
from all SELECT statements, even if they are duplicates.

Q7 - Where is an index used: Indexes are used in databases to speed up the retrieval
of data. They are primarily used in database tables where there are frequent search
operations, such as SELECT statements with WHERE clauses. By creating indexes on
columns used in search conditions, the database engine can quickly locate the
required data, reducing the need for full table scans and improving query
performance.

Q8 - How can duplicate records be identified: To identify duplicate records in a


database table, you can use the following SQL query:

SELECT column1, column2, ... FROM table GROUP BY column1, column2, ... HAVING
COUNT(*) > 1;

This query groups the rows based on the specified columns and then filters out the
groups that have more than one occurrence, indicating the presence of duplicate
records.

Q9 - What is a stored procedure and how can we reduce running cost:

• A stored procedure is a prepared SQL code that is stored in a database and


can be executed repeatedly. It allows you to encapsulate a series of SQL
statements and logic into a single unit, which can be called and executed as
needed. Stored procedures provide advantages such as code reusability,
improved performance, and enhanced security.
To reduce running costs, you can optimize stored procedures by:

1. Minimizing network traffic by executing complex tasks on the database server.


2. Properly indexing tables to improve query performance.
3. Using parameterized queries to avoid SQL injection and improve execution
plans.
4. Avoiding unnecessary looping or recursion within the stored procedure.
5. Regularly monitoring and optimizing the stored procedure's performance.

Q10 - What are functions:

• In the context of databases or programming, functions are named routines


that accept input values, perform specific operations, and return a result.
Functions can be used to encapsulate commonly used calculations,
transformations, or data manipulations. They promote code reuse, improve
readability, and simplify complex operations.

Q11 - What is RLS (Row-Level Security):

• Row-Level Security (RLS) is a feature that allows you to restrict access to rows
within database tables based on user permissions or conditions defined by a
security policy. RLS ensures that users can only access the data that they are
authorized to see, based on the attributes of the rows. It provides an
additional layer of security and helps enforce data privacy and confidentiality.

Q12 - What is time intelligence in Power BI:

• Time intelligence refers to the ability to analyze and visualize data over
different time periods or time-related dimensions in Power BI. It involves
performing calculations and comparisons based on dates, such as year-to-
date, quarter-to-date, or period-over-period comparisons. Time intelligence
functions and features in Power BI enable users to gain insights into trends,
patterns, and performance metrics over time.

Q13 - Many-to-many relationship in Power BI:

• A many-to-many relationship in Power BI refers to a relationship between two


tables where multiple records in one table can be associated with multiple
records in another table. In such cases, a bridge or junction table is used to
connect the two tables. Power BI allows you to define and manage many-to-
many relationships, enabling you to analyze and visualize data from multiple
tables accurately.
Q14 - What is the benefit of using a variable index:

• The term "variable index" is not commonly used in the context of databases or
indexing. However, if you meant a clustered index or a non-clustered index,
their benefits include:
• Improved query performance: Indexes allow the database engine to
quickly locate and retrieve data, reducing the need for full table scans.
• Efficient data retrieval: Indexes enable data to be accessed in a sorted
or filtered manner, optimizing search operations.
• Reduced disk I/O: Indexes store a subset of data, allowing the database
engine to read fewer disk blocks when fetching data.
• Enhanced data integrity: Indexes can enforce unique constraints and
ensure data consistency.

Q15 - What is the difference between RANK and ROW_NUMBER: In SQL or Power BI,
RANK and ROW_NUMBER are window functions used to assign a sequential number
to each row within a result set or a partition.

• RANK: The RANK function assigns a unique rank to each row based on a
specified column or expression. Rows with the same values receive the same
rank, and the subsequent rank is skipped. For example, if two rows have the
highest sales, they both receive rank 1, and the next row receives rank 3.
• ROW_NUMBER: The ROW_NUMBER function assigns a unique number to each
row in the result set, regardless of any ties or duplicate values. Each row is
assigned a distinct sequential number based on the order specified in the
query. For example, the first row receives number 1, the second row receives
number 2, and so on.

Q16 - Difference between FILTER and FILTER ALL:

• FILTER and FILTER ALL are DAX functions used in Power BI for filtering data.
• FILTER: The FILTER function filters a table or a table expression based on
specified conditions or criteria. It returns a new table that includes only the
rows that meet the specified filter conditions.
• FILTER ALL: The FILTER ALL function also filters a table or table expression, but
it removes any existing filters on the columns included in the function. It
returns a new table that includes all the rows from the original table,
regardless of any existing filters.

Q17 - COUNT and COUNTX in Power BI:

• COUNT and COUNTX are aggregation functions used in Power BI to count the
number of rows or records in a table or table expression.
• COUNT: The COUNT function returns the number of rows that contain non-
blank values in a specified column or expression. It does not include rows with
blank or null values in the count.
• COUNTX: The COUNTX function counts the number of rows that result from
evaluating an expression or calculation for each row in a table or table
expression. It considers both non-blank and blank values, providing a count of
all rows returned by the expression.

Q18 - What data sources can Power BI connect to:

• Power BI can connect to a wide range of data sources, including:


1. Databases: SQL Server, Oracle, MySQL, PostgreSQL, etc.
2. Cloud Services: Azure SQL Database, Azure Data Lake, Amazon Redshift,
Google BigQuery, etc.
3. Excel Files: Local Excel files or Excel workbooks stored in cloud platforms.
4. SharePoint Lists and Libraries.
5. Web Services and APIs: RESTful APIs, OData feeds, JSON, XML, etc.
6. Files: CSV, XML, JSON, text files, etc.
7. Online Services: Dynamics 365, Salesforce, Google Analytics, Azure DevOps,
etc.
8. Other data sources: Hadoop, SharePoint Online, Exchange, Power Platform,
etc.

Power BI provides various connectors and integration options to connect to these


data sources and import data for analysis and visualization.

Please note that the availability of specific data sources may depend on the version
and licensing of Power BI you are using.

You might also like