Sourcery refactored main branch #1

sourcery-ai · 2023-11-02T03:28:16Z

Branch main refactored by Sourcery.

If you're happy with these changes, merge this Pull Request using the Squash and merge strategy.

See our documentation here.

Run Sourcery locally

Reduce the feedback loop during development by using the Sourcery editor plugin:

Review changes via command line

To manually merge these changes, make sure you're on the main branch, then run:

git fetch origin sourcery/main
git merge --ff-only FETCH_HEAD
git reset HEAD^

Help us improve this pull request!

sourcery-ai · 2023-11-02T03:28:18Z

08-building-search-applications/scripts/transcript_download.py

-    filename = os.path.join(TRANSCRIPT_FOLDER, video_id + ".json")
-
-    metadata = {}
-    metadata["speaker"] = ""
-    metadata["title"] = playlist_item["snippet"]["title"]
-    metadata["videoId"] = playlist_item["snippet"]["resourceId"]["videoId"]
-    metadata["description"] = playlist_item["snippet"]["description"]
-
+    filename = os.path.join(TRANSCRIPT_FOLDER, f"{video_id}.json")
+
+    metadata = {
+        "speaker": "",
+        "title": playlist_item["snippet"]["title"],
+        "videoId": playlist_item["snippet"]["resourceId"]["videoId"],
+        "description": playlist_item["snippet"]["description"],
+    }


Function gen_metadata refactored with the following changes:

Use f-string instead of string concatenation (use-fstring-for-concatenation)

Merge dictionary assignment with declaration [×4] (merge-dict-assign)

sourcery-ai · 2023-11-02T03:28:18Z

08-building-search-applications/scripts/transcript_download.py


    video_id = playlist_item["snippet"]["resourceId"]["videoId"]
-    filename = os.path.join(TRANSCRIPT_FOLDER, video_id + ".json.vtt")
+    filename = os.path.join(TRANSCRIPT_FOLDER, f"{video_id}.json.vtt")


Function get_transcript refactored with the following changes:

Use f-string instead of string concatenation (use-fstring-for-concatenation)

sourcery-ai · 2023-11-02T03:28:18Z

08-building-search-applications/scripts/transcript_download.py

-    # Get the next page token from the response and create a new request object
-    next_page_token = response.get("nextPageToken")
-    if next_page_token:
+    if next_page_token := response.get("nextPageToken"):


Lines 149-167 refactored with the following changes:

Use named expression to simplify assignment and conditional (use-named-expression)

Replace unused for index with underscore (for-index-underscore)

This removes the following comments ( why? ):

# Get the next page token from the response and create a new request object

sourcery-ai · 2023-11-02T03:28:18Z

08-building-search-applications/scripts/transcript_enrich_bucket.py

        word_count = len(words)
        if word_count > 0:
-            append_text = " ".join(words[0 : int(word_count * PERCENTAGE_OVERLAP)])
+            append_text = " ".join(words[:int(word_count * PERCENTAGE_OVERLAP)])


Function append_text_to_previous_segment refactored with the following changes:

Replace a[0:x] with a[:x] and a[x:len(a)] with a[x:] (remove-redundant-slice-index)

sourcery-ai · 2023-11-02T03:28:18Z

08-building-search-applications/scripts/transcript_enrich_bucket.py

            if current_seconds < seg_finish_seconds and total_tokens < MAX_TOKENS:
                # add the text to the transcript
-                text += current_text + " "
+                text += f"{current_text} "


Function parse_json_vtt_transcript refactored with the following changes:

Use f-string instead of string concatenation [×2] (use-fstring-for-concatenation)

sourcery-ai · 2023-11-02T03:28:18Z

08-building-search-applications/scripts/transcript_enrich_embeddings.py

-    if len(time_value) == 3:
-        h, m, s = time_value
-        return int(h) * 3600 + int(m) * 60 + int(s)
-    else:
+    if len(time_value) != 3:
        return 0
+    h, m, s = time_value
+    return int(h) * 3600 + int(m) * 60 + int(s)


Function convert_time_to_seconds refactored with the following changes:

Swap if/else branches (swap-if-else-branches)

Remove unnecessary else after guard condition (remove-unnecessary-else)

sourcery-ai · 2023-11-02T03:28:19Z

08-building-search-applications/scripts/transcript_enrich_lite.py

    """This function removes the text from each dictionary in the list."""
    return [
-        {k: v for k, v in seg.items() if k != "text" and k != "description"}
+        {k: v for k, v in seg.items() if k not in ["text", "description"]}


Function remove_text refactored with the following changes:

Replace multiple comparisons of same variable with in operator (merge-comparisons)

sourcery-ai · 2023-11-02T03:28:19Z

08-building-search-applications/scripts/transcript_enrich_speaker.py

    # create multiple threads to process the queue
    threads = []
-    for i in range(PROCESSING_THREADS):
+    for _ in range(PROCESSING_THREADS):


Lines 228-228 refactored with the following changes:

Replace unused for index with underscore (for-index-underscore)

sourcery-ai · 2023-11-02T03:28:19Z

08-building-search-applications/scripts/transcript_enrich_summaries.py

    # create multiple threads to process the queue
    threads = []
-    for i in range(PROCESSOR_THREADS):
+    for _ in range(PROCESSOR_THREADS):


Lines 179-179 refactored with the following changes:

Replace unused for index with underscore (for-index-underscore)

sourcery-ai · 2023-11-02T03:28:19Z

08-building-search-applications/scripts/transcript_enrich_summaries.py

-    if len(time_value) == 3:
-        h, m, s = time_value
-        return int(h) * 3600 + int(m) * 60 + int(s)
-    else:
+    if len(time_value) != 3:
        return 0
+    h, m, s = time_value
+    return int(h) * 3600 + int(m) * 60 + int(s)


Function convert_time_to_seconds refactored with the following changes:

Swap if/else branches (swap-if-else-branches)

Remove unnecessary else after guard condition (remove-unnecessary-else)

'Refactored by Sourcery'

43ab042

sourcery-ai bot requested a review from hkhdair November 2, 2023 03:28

sourcery-ai bot commented Nov 2, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sourcery refactored main branch #1

Sourcery refactored main branch #1

Uh oh!

sourcery-ai bot commented Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Sourcery refactored main branch #1

Are you sure you want to change the base?

Sourcery refactored main branch #1

Uh oh!

Conversation

sourcery-ai bot commented Nov 2, 2023

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Nov 2, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant