huangbri
diff --git a/‎Tools/Advisor/matrix_multiply_advisor/README.md
Lines changed: 40 additions & 13 deletions b/‎Tools/Advisor/matrix_multiply_advisor/README.md
Lines changed: 40 additions & 13 deletions
diff --git a/‎Tools/ApplicationDebugger/array-transform/README.md
Lines changed: 35 additions & 19 deletions b/‎Tools/ApplicationDebugger/array-transform/README.md
Lines changed: 35 additions & 19 deletions
diff --git a/‎Tools/Benchmarks/STREAM/README.md
Lines changed: 47 additions & 17 deletions b/‎Tools/Benchmarks/STREAM/README.md
Lines changed: 47 additions & 17 deletions
diff --git a/‎Tools/GPU-Occupancy-Calculator/README.md
Lines changed: 19 additions & 9 deletions b/‎Tools/GPU-Occupancy-Calculator/README.md
Lines changed: 19 additions & 9 deletions
@@ -1,5 +1,6 @@
 # `Matrix Multiply` Sample
-A sample containing multiple implementations of matrix multiplication code sample and  is implemented using the DPC++ language for CPU and GPU.
+A sample containing multiple implementations of matrix multiplication code
+sample and is implemented using the DPC++ language for CPU and GPU.
 
 | Optimized for                       | Description
 |:---                               |:---
@@ -11,42 +12,60 @@ A sample containing multiple implementations of matrix multiplication code sampl
 
 ## Purpose
 
-The Matrix Multiplication sample performs basic matrix multiplication. Three versions are provided that use different features of DPC++.
+The Matrix Multiplication sample performs basic matrix multiplication. Three
+versions are provided that use different features of DPC++.
 
 ## Key Implementation details
 
-The basic DPC++ implementation explained in the code includes device selector, buffer, accessor, kernel, and command groups.
-The include folder is located at %ONEAPI_ROOT%\dev-utilities\latest\include on your development system.
+The basic DPC++ implementation explained in the code includes device selector,
+buffer, accessor, kernel, and command groups. The include folder is located at
+%ONEAPI_ROOT%\dev-utilities\latest\include on your development system.
 
 ## License
 Code samples are licensed under the MIT license. See
 [License.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/License.txt) for details.
 
-Third party program Licenses can be found here: [third-party-programs.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/third-party-programs.txt)
+Third party program Licenses can be found here:
+[third-party-programs.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/third-party-programs.txt)
 
 
 ### Running Samples In DevCloud
-Running samples in the Intel DevCloud requires you to specify a compute node. For specific instructions, jump to [Run the Matrix Multiply Advisor sample on the DevCloud](#run-matmul-advisor-on-devcloud)
+Running samples in the Intel DevCloud requires you to specify a compute node.
+For specific instructions, jump to [Run the Matrix Multiply Advisor sample on the DevCloud](#run-matmul-advisor-on-devcloud)
 
-### Using Visual Studio Code*  (Optional)
+## Using Visual Studio Code* (Optional)
 
-You can use Visual Studio Code (VS Code) extensions to set your environment, create launch configurations,
-and browse and download samples.
+You can use Visual Studio Code (VS Code) extensions to set your environment,
+create launch configurations, and browse and download samples.
 
 The basic steps to build and run a sample using VS Code include:
  - Download a sample using the extension **Code Sample Browser for Intel oneAPI Toolkits**.
  - Configure the oneAPI environment with the extension **Environment Configurator for Intel oneAPI Toolkits**.
  - Open a Terminal in VS Code (**Terminal>New Terminal**).
  - Run the sample in the VS Code terminal using the instructions below.
+ - (Linux only) Debug your GPU application with GDB for Intel® oneAPI toolkits using the **Generate Launch Configurations** extension.
 
-To learn more about the extensions and how to configure the oneAPI environment, see
-[Using Visual Studio Code with Intel® oneAPI Toolkits](https://software.intel.com/content/www/us/en/develop/documentation/using-vs-code-with-intel-oneapi/top.html).
+To learn more about the extensions, see
+[Using Visual Studio Code with Intel® oneAPI Toolkits](https://www.intel.com/content/www/us/en/develop/documentation/using-vs-code-with-intel-oneapi/top.html).
 
-After learning how to use the extensions for Intel oneAPI Toolkits, return to this readme for instructions on how to build and run a sample.
+After learning how to use the extensions for Intel oneAPI Toolkits, return to
+this readme for instructions on how to build and run a sample.
 
 ## How to Build
 
-This sample contains 3 version of matrix multiplication using DPC++:
+> **Note**: If you have not already done so, set up your CLI
+> environment by sourcing  the `setvars` script located in
+> the root of your oneAPI installation.
+>
+> Linux Sudo: . /opt/intel/oneapi/setvars.sh
+>
+> Linux User: . ~/intel/oneapi/setvars.sh
+>
+> Windows: C:\Program Files(x86)\Intel\oneAPI\setvars.bat
+>
+> For more information on environment variables, see Use the setvars Script for [Linux or macOS](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-linux-or-macos.html), or [Windows](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-windows.html).
+
+This sample contains three versions of matrix multiplication using DPC++:
 
     multiply1 – basic implementation of matrix multiply using DPC++
     multiply1_1 – basic implementation that replaces the buffer store with a local accessor “acc” to reduce memory traffic
@@ -65,6 +84,14 @@ Edit the line in src/multiply.hpp to select the version of the multiply function
     Clean the program
     make clean
 
+If an error occurs, you can get more details by running `make` with
+the `VERBOSE=1` argument:
+``make VERBOSE=1``
+For more comprehensive troubleshooting, use the Diagnostics Utility for
+Intel® oneAPI Toolkits, which provides system checks to find missing
+dependencies and permissions errors.
+[Learn more](https://www.intel.com/content/www/us/en/develop/documentation/diagnostic-utility-user-guide/top.html).
+
 ### On a Windows* System Using Visual Studio 2017 or newer
    * Open Visual Studio 2017
    * Select Menu "File > Open > Project/Solution", find "matrix_multiply" folder and select "matrix_multiply.sln"
 
@@ -50,23 +50,45 @@ Code samples are licensed under the MIT license. See
 
 Third party program Licenses can be found here: [third-party-programs.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/third-party-programs.txt)
 
+## Using Visual Studio Code* (Optional)
+
+You can use Visual Studio Code (VS Code) extensions to set your environment,
+create launch configurations, and browse and download samples.
+
+The basic steps to build and run a sample using VS Code include:
+ - Download a sample using the extension **Code Sample Browser for Intel oneAPI Toolkits**.
+ - Configure the oneAPI environment with the extension **Environment Configurator for Intel oneAPI Toolkits**.
+ - Open a Terminal in VS Code (**Terminal>New Terminal**).
+ - Run the sample in the VS Code terminal using the instructions below.
+ - (Linux only) Debug your GPU application with GDB for Intel® oneAPI toolkits using the **Generate Launch Configurations** extension.
+
+To learn more about the extensions, see
+[Using Visual Studio Code with Intel® oneAPI Toolkits](https://www.intel.com/content/www/us/en/develop/documentation/using-vs-code-with-intel-oneapi/top.html).
+
+After learning how to use the extensions for Intel oneAPI Toolkits, return to
+this readme for instructions on how to build and run a sample.
+
 ## Building and Running the `array-transform` Program
 
-> Note: if you have not already done so, set up your CLI
-> environment by sourcing  the setvars script located in
+> **Note**: If you have not already done so, set up your CLI
+> environment by sourcing  the `setvars` script located in
 > the root of your oneAPI installation.
 >
 > Linux Sudo: . /opt/intel/oneapi/setvars.sh
+>
 > Linux User: . ~/intel/oneapi/setvars.sh
+>
 > Windows: C:\Program Files(x86)\Intel\oneAPI\setvars.bat
+>
+>For more information on environment variables, see Use the setvars Script for [Linux or macOS](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-linux-or-macos.html), or [Windows](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-windows.html).
 
 ### Setup
 
 Preliminary setup steps are needed for the debugger to function.
 Please see the setup instructions in the Get Started Guide based on
 your OS:
-[Linux](https://software.intel.com/en-us/get-started-with-debugging-dpcpp-linux),
-[Windows](https://software.intel.com/en-us/get-started-with-debugging-dpcpp-windows).
+[Linux](https://www.intel.com/en-us/get-started-with-debugging-dpcpp-linux),
+[Windows](https://www.intel.com/en-us/get-started-with-debugging-dpcpp-windows).
 
 
 ### Include Files
@@ -91,21 +113,6 @@ For more information, see the Intel® oneAPI
 Base Toolkit Get Started Guide
 (https://devcloud.intel.com/oneapi/get-started/base-toolkit/).
 
-### Using Visual Studio Code*  (Optional)
-
-You can use Visual Studio Code (VS Code) extensions to set your environment, create launch configurations,
-and browse and download samples.
-
-The basic steps to build and run a sample using VS Code include:
- - Download a sample using the extension **Code Sample Browser for Intel® oneAPI Toolkits**.
- - Configure the oneAPI environment with the extension **Environment Configurator for Intel® oneAPI Toolkits**.
- - Open a Terminal in VS Code (**Terminal>New Terminal**).
- - Run the sample in the VS Code terminal using the instructions below.
-
-To learn more about the extensions and how to configure the oneAPI environment, see
-[Using Visual Studio Code with Intel® oneAPI Toolkits](https://software.intel.com/content/www/us/en/develop/documentation/using-vs-code-with-intel-oneapi/top.html).
-
-After learning how to use the Extension Pack for Intel® oneAPI Toolkits, return to this readme for instructions on how to build and run a sample.
 
 
 ### Auto-Attach
@@ -191,6 +198,15 @@ For instructions about starting and using the debugger, please
 see the
 [Get Started Guide (Linux)](https://software.intel.com/en-us/get-started-with-debugging-dpcpp-linux).
 
+
+If an error occurs, you can get more details by running `make` with
+the `VERBOSE=1` argument:
+``make VERBOSE=1``
+For more comprehensive troubleshooting, use the Diagnostics Utility for
+Intel® oneAPI Toolkits, which provides system checks to find missing
+dependencies and permissions errors.
+[Learn more](https://www.intel.com/content/www/us/en/develop/documentation/diagnostic-utility-user-guide/top.html).
+
 ### On a Windows* System Using Visual Studio* Version 2017 or Newer
 
 #### Command line using MSBuild
 
@@ -1,12 +1,12 @@
 # STREAM Sample
 
-This package contains a modified version of the [Stream Benchmark](http://www.cs.virginia.edu/stream/) implementation using DPC++ for CPU and GPU. 
+This package contains a modified version of the [Stream Benchmark](http://www.cs.virginia.edu/stream/) implementation using DPC++ for CPU and GPU.
 
 
 | Optimized for                       | Description
 |:---                               |:---
 | OS                                | Linux* Ubuntu* 20.04
-| Hardware                          | GEN9, Iris-Xe Max 
+| Hardware                          | GEN9, Iris-Xe Max
 | Software                          | Intel&reg; oneAPI DPC++ Compiler
 | What you will learn               | How to benchmark the memory bandwidth using STREAM.
 | Time to complete                  | 5 minutes
@@ -15,7 +15,7 @@ This package contains a modified version of the [Stream Benchmark](http://www.cs
 ## Purpose
 The STREAM sample performs the memory bandwidth benchmark.
 
-## Key Implementation Details 
+## Key Implementation Details
 This sample contains a STREAM implementation using DPC++ for CPU and GPU and is a variant of the [STREAM](http://www.cs.virginia.edu/stream/) benchmark code. Please review the license terms regarding publishing benchmarks.”
 
 ## License
@@ -35,45 +35,66 @@ For the original [Stream License]( http://www.cs.virginia.edu/stream/FTP/Code/LI
   3. You are free to publish results obtained from running this
      program, or from works that you derive from this program,
      with the following limitations:
-     
+
      3a. In order to be referred to as "STREAM benchmark results",
          published results must be in conformance to the STREAM
          Run Rules, (briefly reviewed below) published at
          http://www.cs.virginia.edu/stream/ref.html
          and incorporated herein by reference.
          As the copyright holder, John McCalpin retains the
          right to determine conformity with the Run Rules.
-     
+
      3b. Results based on modified source code or on runs not in
          accordance with the STREAM Run Rules must be clearly
          labelled whenever they are published.  Examples of
          proper labelling include:
-         "tuned STREAM benchmark results" 
+         "tuned STREAM benchmark results"
          "based on a variant of the STREAM benchmark code"
          Other comparable, clear and reasonable labelling is
          acceptable.
-     
+
      3c. Submission of results to the STREAM benchmark web site
          is encouraged, but not required.
   4. Use of this program or creation of derived works based on this
      program constitutes acceptance of these licensing restrictions.
   5. Absolutely no warranty is expressed or implied.
 ***
 
-## Building the `STREAM` Program for CPU and GPU
+## Using Visual Studio Code* (Optional)
+
+You can use Visual Studio Code (VS Code) extensions to set your environment,
+create launch configurations, and browse and download samples.
 
-### On a Linux* System 
+The basic steps to build and run a sample using VS Code include:
+ - Download a sample using the extension **Code Sample Browser for Intel oneAPI Toolkits**.
+ - Configure the oneAPI environment with the extension **Environment Configurator for Intel oneAPI Toolkits**.
+ - Open a Terminal in VS Code (**Terminal>New Terminal**).
+ - Run the sample in the VS Code terminal using the instructions below.
+ - (Linux only) Debug your GPU application with GDB for Intel® oneAPI toolkits using the **Generate Launch Configurations** extension.
 
-Perform the following steps:
+To learn more about the extensions, see
+[Using Visual Studio Code with Intel® oneAPI Toolkits](https://www.intel.com/content/www/us/en/develop/documentation/using-vs-code-with-intel-oneapi/top.html).
 
-> Note: If you have not already done so, set up your CLI environment by sourcing 
->    the setvars script located in the root of your oneAPI installation.  
->     
->   Linux (sudo): `source /opt/intel/oneapi/setvars.sh`  
->   Linux (user): `~/intel/oneapi/ setvars.sh`  
+After learning how to use the extensions for Intel oneAPI Toolkits, return to
+this readme for instructions on how to build and run a sample.
 
-1. Build the program using the following `cmake` commands. 
-``` 
+## Building the `STREAM` Program for CPU and GPU
+
+### On a Linux* System
+> **Note**: If you have not already done so, set up your CLI
+> environment by sourcing  the `setvars` script located in
+> the root of your oneAPI installation.
+>
+> Linux Sudo: . /opt/intel/oneapi/setvars.sh
+>
+> Linux User: . ~/intel/oneapi/setvars.sh
+>
+> Windows: C:\Program Files(x86)\Intel\oneAPI\setvars.bat
+>
+>For more information on environment variables, see Use the setvars Script for [Linux or macOS](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-linux-or-macos.html), or [Windows](https://www.intel.com/content/www/us/en/develop/documentation/oneapi-programming-guide/top/oneapi-development-environment-setup/use-the-setvars-script-with-windows.html).
+
+1. Build the program using the following `cmake` commands.
+```
 $ mkdir build
 $ cd build
 $ cmake ..
@@ -88,6 +109,15 @@ $ make
     make clean
     ```
 
+
+If an error occurs, you can get more details by running `make` with
+the `VERBOSE=1` argument:
+``make VERBOSE=1``
+For more comprehensive troubleshooting, use the Diagnostics Utility for
+Intel® oneAPI Toolkits, which provides system checks to find missing
+dependencies and permissions errors.
+[Learn more](https://www.intel.com/content/www/us/en/develop/documentation/diagnostic-utility-user-guide/top.html).
+
 ## Running the Sample
 ```
 ./stream_sycl.exe
 
@@ -2,21 +2,31 @@
 
 ## Purpose
 
-Intel GPU Occupancy Calculator is HTML Web Application to compute GPU Occupancy for GPU/HPC application development.
+Intel GPU Occupancy Calculator is HTML Web Application to compute GPU Occupancy
+for GPU/HPC application development.
 
 ## Tool Details
 
-* Allows user to pick a GPU SKU, input Global Size of HPC application, Work-Group(WG) size, Sub-Group size, Shared Local Memory(SLM) size and barrier usage.
-* Computes Sub-Slice/Dual Sub-Slice (SS/DSS) Theoretical Occupancy based on the inputs.
-* Generates charts for Impact of varying Work-Group size and Impact of varying Shared Local Memory size.
-* Generates a list of all valid Work-Group sizes that can be used and corresponding Occupancy.
-* Generates a list of optimal configuration values for WG, SG and SLM limit that will get 100% Occupancy.
+* Allows user to pick a GPU SKU, input Global Size of HPC application,
+  Work-Group(WG) size, Sub-Group size, Shared Local Memory(SLM) size and barrier
+  usage.
+* Computes Sub-Slice/Dual Sub-Slice (SS/DSS) Theoretical Occupancy based
+  on the inputs.
+* Generates charts for Impact of varying Work-Group size and
+  Impact of varying Shared Local Memory size.
+* Generates a list of all valid
+  Work-Group sizes that can be used and corresponding Occupancy.
+* Generates a list of optimal configuration values for WG, SG and SLM limit that
+  will get 100% Occupancy.
 
 ## Usage
 * Open Page: [oneapi-src.github.io/oneAPI-samples/Tools/GPU-Occupancy-Calculator/](https://oneapi-src.github.io/oneAPI-samples/Tools/GPU-Occupancy-Calculator/)
-* Select a GPU from the dropdown and change "EU Count" if necessary OR select the option to enter PCI ID for GPU.
-* The tool will load some default values for  Global Size, WG size, SG size, SLM size and will compute Occupancy and generate graphs.
-* Change the values of GPU target, Global size, WG size, SG size, SLM size or Barrier usage based on your HPC application to calculate Occupancy and tune application.
+* Select a GPU from the dropdown and change "EU Count" if necessary OR select
+  the option to enter PCI ID for GPU.
+* The tool will load some default values for Global Size, WG size, SG size,
+  SLM size and will compute Occupancy and generate graphs.
+* Change the values of GPU target, Global size, WG size, SG size, SLM size or
+  Barrier usage based on your HPC application to calculate Occupancy and tune application.
 
 ## License