-
Notifications
You must be signed in to change notification settings - Fork 539
Insights: pytorch/executorch
Overview
Could not load contribution data
Please try again later
1 Release published by 1 person
-
v0.6.0
published
Apr 24, 2025
550 Pull requests merged by 81 people
-
Arm backend: Update NEGATE with TOSA 1.0 support
#10845 merged
May 14, 2025 -
Arm backend: Update operator support for TOSA-1.0+INT+u55
#10849 merged
May 14, 2025 -
[Executorch][llm] Enable local global attention in export_llama script
#10836 merged
May 14, 2025 -
Change lowbit example to use 4-bit as default in example
#10865 merged
May 14, 2025 -
[LlamaDemo] Replace some tokens
#10863 merged
May 14, 2025 -
Update CI for HF Optimum models
#10820 merged
May 14, 2025 -
Update documents for Express SDK update
#10462 merged
May 14, 2025 -
Remove EXECUTORCH_SEPARATE_FLATCC_HOST_PROJECT
#10860 merged
May 13, 2025 -
Use the install method for flatc
#10859 merged
May 13, 2025 -
Allow graceful handling of cpuinfo init failure
#10826 merged
May 13, 2025 -
Arm backend: Decompose sum in pass
#10852 merged
May 13, 2025 -
Build flatcc for the host
#10855 merged
May 13, 2025 -
Arm backend: Merge decompose/convert meandim pass
#10844 merged
May 13, 2025 -
BUCK forward fix on NXP backend
#10838 merged
May 13, 2025 -
Qualcomm AI Engine Direct - Flags for CI
#9536 merged
May 13, 2025 -
[Executorch][llm] Enable leveraging ring kv cache via module swap
#10835 merged
May 13, 2025 -
Add floatValue to ExecuTorch value
#10823 merged
May 13, 2025 -
Update ownership for the build system
#10837 merged
May 13, 2025 -
[Executorch][llm] Make custom update cache op operate on indices
#10834 merged
May 13, 2025 -
Delete executorch_print_configuration_summary
#10806 merged
May 13, 2025 -
Arm backend: Fix mypy linting in pre-push
#10850 merged
May 13, 2025 -
[Executorch][llm] Add ring buffer based kv cache and mask calculation to MHA
#10833 merged
May 13, 2025 -
Move dependent options to default preset
#10805 merged
May 13, 2025 -
Arm backend: Refactor pass tests for TOSA V1.0
#10843 merged
May 13, 2025 -
Arm backend: Update partitioner de-tagging iteration order
#10813 merged
May 13, 2025 -
Use certifi certs for buck download
#10095 merged
May 13, 2025 -
Correct model name in examples/arm/run.sh
#10815 merged
May 13, 2025 -
[ET-VK] Removing un used push constants for conv2d pw.
#10841 merged
May 13, 2025 -
Default to file load mode in module
#10827 merged
May 13, 2025 -
[ET-VK] Removing un used push constants for conv2d pw.
#10814 merged
May 13, 2025 -
Move simple options to default preset
#10804 merged
May 13, 2025 -
Refactor _get_source_transforms to remove args
#10519 merged
May 13, 2025 -
[Executorch][llm] Add support for ring kv cache and ring attention
#10832 merged
May 13, 2025 -
[Executorch][llm] Enable local global attention in export_llama script
#10612 merged
May 13, 2025 -
[Executorch][llm] Enable leveraging ring kv cache via module swap
#10611 merged
May 13, 2025 -
[Executorch][llm] Make custom update cache op operate on indices
#10610 merged
May 13, 2025 -
[Executorch][llm] Add ring buffer based kv cache and mask calculation to MHA
#10609 merged
May 13, 2025 -
[Executorch][llm] Add support for ring kv cache and ring attention
#10608 merged
May 13, 2025 -
Delete EXECUTORCH_BUILD_ANDROID_JNI
#10803 merged
May 13, 2025 -
Move OPTIMIZE_SIZE to default preset
#10802 merged
May 13, 2025 -
Refactor _to_edge_and_lower_llama to remove args
#10520 merged
May 13, 2025 -
Move EXECUTORCH_ENABLE_EVENT_TRACER to default preset
#10801 merged
May 12, 2025 -
[jit] Remove @torch.jit.export
#10824 merged
May 12, 2025 -
Rename "topic: not user facing"
#10828 merged
May 12, 2025 -
Rename "topic: not user facing"
#10791 merged
May 12, 2025 -
mediatek llama runner use executorch_core
#10754 merged
May 12, 2025 -
Android Qwen thinking mode prompt support
#10668 merged
May 12, 2025 -
[jit] Remove TorchScript from doc
#10825 merged
May 12, 2025 -
Move EXECUTORCH_ENABLE_PROGRAM_VERIFICATION to default preset
#10800 merged
May 12, 2025 -
Forward-fixing G3 lt kernel
#10812 merged
May 12, 2025 -
Move EXECUTORCH_LOG_LEVEL to default preset
#10799 merged
May 12, 2025 -
Xnnpack test for program-data separation
#10817 merged
May 12, 2025 -
Update backends-coreml.md
#10816 merged
May 12, 2025 -
[ET-VK] Return fence after waiting is done.
#10808 merged
May 12, 2025 -
Xnnpack test for program-data separation
#10532 merged
May 12, 2025 -
Make a separate target for kernel utils
#10788 merged
May 12, 2025 -
Move EXECUTORCH_PAL_DEFAULT to default preset
#10798 merged
May 12, 2025 -
[llava] Remove torch.jit.save in llava example
#10794 merged
May 12, 2025 -
NXP Backend: Add eIQ Neutron Backend
#10196 merged
May 12, 2025 -
Arm backend: Rescale fixes for TOSA 1.0
#10809 merged
May 12, 2025 -
Arm backend: Fix ensures check in UnsqueezeScalarPlaceholdersPass
#10811 merged
May 12, 2025 -
[ET-VK] Return fence after waiting is done.
#10787 merged
May 12, 2025 -
Introduce assertj test lib to make the throw exception test more accu…
#10779 merged
May 10, 2025 -
Remove FLATC_EXECUTABLE and the ability to bring your own flatc
#10781 merged
May 9, 2025 -
fix bug with sequential backends
#10708 merged
May 9, 2025 -
bugfix
#10793 merged
May 9, 2025 -
Use torchtune 0.6.1
#10792 merged
May 9, 2025 -
Reapply #9841: Migrate elementwise_util callers to the variants with out_dtypes in template arguments
#10491 merged
May 9, 2025 -
Reapply #9842: Save some size in dtype_util when dtype selective build is not in use
#10490 merged
May 9, 2025 -
Save some size in pattern/{bitwise,comparison}_op.h
#10489 merged
May 9, 2025 -
Cortex-M: Use q/dq ops in Arm Ethos Runner
#10782 merged
May 9, 2025 -
Arm backend: Suppress colors in pre-push if non-interactive
#10783 merged
May 9, 2025 -
fix transpose / permutations fusion pass
#10780 merged
May 9, 2025 -
Use std::align_alloc in file_data_loader
#10660 merged
May 9, 2025 -
Arm Backend: Use tosa_ref_model only if it is avaiable
#10778 merged
May 9, 2025 -
: constant fold None
#10762 merged
May 9, 2025 -
Make constant_folding's _DEFAULT_SKIP_TARGETS public
#10760 merged
May 8, 2025 -
Extract trace from prepare_and_convert and remove export_program
#10493 merged
May 8, 2025 -
Create a macos-arm64 preset
#10768 merged
May 8, 2025 -
Convert the unit test from java to kotlin
#10702 merged
May 8, 2025 -
Allow options to be set by presets
#10767 merged
May 8, 2025 -
Minor vector sizing change.
#10753 merged
May 8, 2025 -
Arm backend: Replace asserts with exceptions in permutation code
#10774 merged
May 8, 2025 -
Arm backend: Remove redundant validation check for op_where
#10773 merged
May 8, 2025 -
Automatically announce declared options
#10766 merged
May 8, 2025 -
Arm Backend: Update unit tests for TOSA 1.0
#10776 merged
May 8, 2025 -
Arm backend: Add model name to -llama_inputs
#10775 merged
May 8, 2025 -
[ET-VK] Implement linear_qcs4w
#10772 merged
May 8, 2025 -
[ET-VK] Introduce generic export pass for fusing Q/DQ nodes
#10771 merged
May 8, 2025 -
[ET-VK] Implement linear_qcs4w
#10588 merged
May 8, 2025 -
[ET-VK] Introduce generic export pass for fusing Q/DQ nodes
#10525 merged
May 8, 2025 -
Tests use executorch_core
#10764 merged
May 8, 2025 -
Update buck2 to 2025-05-06
#10742 merged
May 8, 2025 -
Handle avg_pool2d with padding == 0 as no padding
#10697 merged
May 8, 2025 -
Vulkan tests use executorch_core
#10765 merged
May 8, 2025 -
Run apple.yml on ciflow/trunk
#10759 merged
May 7, 2025 -
Add libextension_flat_tensor.a to build_apple_frameworks.sh
#10758 merged
May 7, 2025 -
Add tools/cmake to unittests
#10752 merged
May 7, 2025 -
Arm backend: add support for operator @
#10749 merged
May 7, 2025 -
Arm backend: Adjust MaxPool2d padding when window is not divisible by stride
#10751 merged
May 7, 2025 -
Test.cmake use executorch_core
#10747 merged
May 7, 2025 -
Create a helper to define overridable configs
#10731 merged
May 7, 2025 -
bump torchao pin
#10743 merged
May 7, 2025 -
[cadence] add-delinearize-index-dep
#10739 merged
May 7, 2025 -
Arm backend: Replace asserts with ValueError for slicing constraints
#10748 merged
May 7, 2025 -
[CMake] llm_runner use executorch_core
#10698 merged
May 7, 2025 -
[CMake] Avoid extension_module have dupe flat_tensor cpp
#10735 merged
May 7, 2025 -
Arm backend: Add DecomposeCosineSimilarity
#10729 merged
May 7, 2025 -
[ET-VK] Using width packed bias in conv1d op to slightly improve speed and memory.
#10733 merged
May 7, 2025 -
[ExecuTorch][#10375] Add
extension.BundledModule
to Wrapextension.Module
with Bundled Program Logic#10744 merged
May 7, 2025 -
[ExecuTorch][#10375] Add
extension.BundledModule
to Wrapextension.Module
with Bundled Program Logic#10449 merged
May 7, 2025 -
Kernels should depend on executorch_core
#10738 merged
May 7, 2025 -
Fix BUCK
#10740 merged
May 7, 2025 -
Implement native_dropout
#10567 merged
May 7, 2025 -
Add resnet18 test case to OSS
#10705 merged
May 6, 2025 -
Prepare for recursive DCE PR
#10730 merged
May 6, 2025 -
Backend data separation test
#10734 merged
May 6, 2025 -
extension/module doesn't depend on prim_ops
#10710 merged
May 6, 2025 -
Refactor attention v2
#10707 merged
May 6, 2025 -
Backend data separation test
#10531 merged
May 6, 2025 -
Qwen3 doc and config tweaks
#10640 merged
May 6, 2025 -
Create skeleton for a build preset GitHub workflow
#10724 merged
May 6, 2025 -
Enable xnnpack in aten mode
#9049 merged
May 6, 2025 -
Arm backend: Create op utility function for num input verification
#10713 merged
May 6, 2025 -
Arm backend: Add non-interactive in git hook
#10712 merged
May 6, 2025 -
[CMake] data_loader and runner_util use executorch_core
#10703 merged
May 6, 2025 -
Match PyTorch _link_check.yml
#10709 merged
May 6, 2025 -
Use three-dot diffs in URL and xref lint workflows
#10706 merged
May 6, 2025 -
Refactor attention v2
#10623 merged
May 6, 2025 -
Add pass to remove unused parameters in to_edge
#10484 merged
May 6, 2025 -
Make quantize_pt2 private and remove external call sites
#10683 merged
May 6, 2025 -
xnnpack_backend doesn't need to be STATIC only
#10692 merged
May 6, 2025 -
Accept tokenizer.json in iOS benchmark app
#10639 merged
May 5, 2025 -
[ET-VK][ez][Refactor] Re-order
DispatchNode
arguments to match shader layout spec#10700 merged
May 5, 2025 -
[ET-VK][ez][Refactor] Re-order
DispatchNode
arguments to match shader layout spec#10693 merged
May 5, 2025 -
Update extension llm runner deps
#10685 merged
May 5, 2025 -
Arm Ethos-u: Update buck dep
#10691 merged
May 5, 2025 -
Rename ModuleLinear -> ModuleAddMul
#10695 merged
May 5, 2025 -
Rename ModuleLinear -> ModuleAddMul
#10529 merged
May 5, 2025 -
Remove unused vulkan_executor_runner_lib
#10679 merged
May 5, 2025 -
[ET-VK] Simplifying conv1d op shader by changing it to process one output texel per thread.
#10690 merged
May 5, 2025 -
[ET-VK] Using vector for storing ref_mapping_ in GraphBuilder to improve model load time and memory.
#10689 merged
May 5, 2025 -
[ET-VK] Minor build graph change to improve model load time and memory.
#10688 merged
May 5, 2025 -
[ET-VK] Simplifying conv1d op shader by changing it to process one output texel per thread.
#10665 merged
May 5, 2025 -
[ET-VK] Using vector for storing ref_mapping_ in GraphBuilder to improve model load time and memory.
#10647 merged
May 5, 2025 -
[ET-VK] Minor build graph change to improve model load time and memory.
#10646 merged
May 5, 2025 -
kernel/portable deps on executorch_core only
#10674 merged
May 5, 2025 -
extension/module should depend on executorch_core
#10678 merged
May 5, 2025 -
vulkan_backend is not necessarily static
#10680 merged
May 5, 2025 -
Make lower_ep_to<edge, cadence, executorch> private functions and remove external call sites
#10671 merged
May 5, 2025 -
[ET-VK][ez] Improvements to GLSL codegen script
#10682 merged
May 5, 2025 -
No need to expose executorch dependency executorch_core
#10673 merged
May 5, 2025 -
[ET-VK][ez] Improvements to GLSL codegen script
#10605 merged
May 5, 2025 -
Arm backend: Enable test_llama_tosa_BI and related fixes
#10681 merged
May 5, 2025 -
Arm backend: Refactor TosaArg to use TosaSpecification
#10655 merged
May 5, 2025 -
Arm backend: Add support for single input matmul
#10654 merged
May 5, 2025 -
Arm backend: Add support to neg.default
#10653 merged
May 5, 2025 -
Pass one NDM to backend init
#10669 merged
May 5, 2025 -
Handle unsupported pybind inputs non-fatally
#10670 merged
May 5, 2025 -
Pass one NDM to backend init
#10528 merged
May 3, 2025 -
Make quantize_pt2 return an ExportedProgram instead of a GraphModule
#10644 merged
May 3, 2025 -
Permute elimination pass fixes.
#10662 merged
May 3, 2025 -
[LlamaDemo] Add a button to toggle thinking mode
#10667 merged
May 2, 2025 -
support phi4 in ios demo app
#10659 merged
May 2, 2025 -
Arm backend: Add SDPA decomposition to annotation pipeline
#10657 merged
May 2, 2025 -
New Xcode media
#10666 merged
May 2, 2025 -
[LlamaDemo] Support more tokenizer suffix
#10664 merged
May 2, 2025 -
bump tokenizer pin
#10658 merged
May 2, 2025 -
Avoid directly calling the map operator
#10638 merged
May 2, 2025 -
Arm backend: Update Rescale and affected nodes to support TOSA 1.0
#10656 merged
May 2, 2025 -
Add filter function to XNNPack Quantizer
#10626 merged
May 2, 2025 -
Fix linter errors in OSS
#10650 merged
May 2, 2025 -
Helpers to create zeros tensor.
#10643 merged
May 2, 2025 -
LlamaDemo Android add Qwen 3 prompt format
#10624 merged
May 2, 2025 -
Helpers to create random integer tensor.
#10649 merged
May 2, 2025 -
Helpers to create random normal tensor.
#10648 merged
May 2, 2025 -
Helpers to create random tensor.
#10645 merged
May 2, 2025 -
Minor changes to native layer norm shader op to improve perf.
#10585 merged
May 2, 2025 -
Helpers to create ones tensor.
#10642 merged
May 2, 2025 -
Helpers to create full tensors.
#10641 merged
May 2, 2025 -
Add expand_copy to the list of trivially quantizable ops
#10606 merged
May 2, 2025 -
Update README.md
#10636 merged
May 2, 2025 -
Helpers to create empty tensors.
#10621 merged
May 1, 2025 -
Update iOS app tutorial
#10635 merged
May 1, 2025 -
Fix iOS app frontend build
#10634 merged
May 1, 2025 -
Fix tests build in Package.swift
#10631 merged
May 1, 2025 -
Replace usage of deprecated
distutils.(file|dir)_util
#10590 merged
May 1, 2025 -
Support named_data in flat_tensor
#10629 merged
May 1, 2025 -
Update doc-build.yml
#10627 merged
May 1, 2025 -
Add thinking mode toggle and UX improvements for Qwen3 on iOS app
#10614 merged
May 1, 2025 -
Support named_data in flat_tensor
#10527 merged
May 1, 2025 -
Fix tests build in Package.swift
#10628 merged
May 1, 2025 -
FIx Package.swift tests build
#10625 merged
May 1, 2025 -
Update doc-build.yml
#10620 merged
May 1, 2025 -
Add complex dtype support to mul
#10560 merged
May 1, 2025 -
Android fix uninitialized temperature
#10619 merged
May 1, 2025 -
Add prompt constants file
#10618 merged
May 1, 2025 -
Support different model types in iOS app
#10615 merged
May 1, 2025 -
[ET-VK][ez] Use standard quant naming scheme for quantized ops
#10616 merged
May 1, 2025 -
[ET-VK][ez] Use standard quant naming scheme for quantized ops
#10587 merged
May 1, 2025 -
Add complex dtype support to op_sum
#10559 merged
May 1, 2025 -
Patch to D73900459
#10604 merged
May 1, 2025 -
Move --deep flag in Benchmakr project to Tests target
#10613 merged
May 1, 2025 -
Update doc-build.yml
#10596 merged
May 1, 2025 -
Fix num nonbatch dims calculation
#10600 merged
May 1, 2025 -
Update Benchmark README.md
#10607 merged
May 1, 2025 -
[MPS] float16 naming fixed and inexistent mps_logical_not model removed
#10562 merged
May 1, 2025 -
Fix quantizer tests after dq conv is enabled
#10569 merged
May 1, 2025 -
Simplify linker flags for Package.swift tests
#10601 merged
May 1, 2025 -
Fix missing backslashes in Qwen3 example commands
#10599 merged
May 1, 2025 -
Implement a coversion pass from pow(E,x) to E-1 mul ops.
#10564 merged
May 1, 2025 -
Do not raise error when quant primitives are left after partitioner
#10573 merged
May 1, 2025 -
Qualcomm AI Engine Direct - Support Qnn IR backend in online preparation
#8876 merged
May 1, 2025 -
Use generated files in module test
#10597 merged
Apr 30, 2025 -
Fix paths in LLaMa demo project.pbxproj
#10598 merged
Apr 30, 2025 -
Allow removing permute pairs in addition to transpose pairs (#10501)
#10566 merged
Apr 30, 2025 -
Use generated files in module test
#10497 merged
Apr 30, 2025 -
Update using-executorch-ios.md
#10595 merged
Apr 30, 2025 -
Fix iOS app preprocessor flags
#10594 merged
Apr 30, 2025 -
Android build use SUPPORT_REGEX_LOOKAHEAD=ON
#10586 merged
Apr 30, 2025 -
Allow tokenizer.json to be recognized in llama ios app
#10591 merged
Apr 30, 2025 -
Support negative lookahead for tokenizer in ios llama app
#10592 merged
Apr 30, 2025 -
Revert D73804939
#10589 merged
Apr 30, 2025 -
Use external hf_tokenizer in llama runner
#9112 merged
Apr 30, 2025 -
Fix llama app build
#10583 merged
Apr 30, 2025 -
Add helpers to create errors in ObjC/Swift
#10575 merged
Apr 30, 2025 -
Arm backend: Replace asserts with error handling in upsample operators
#10577 merged
Apr 30, 2025 -
Add --deep code signing flag to Benchmark app to sign the Tests bundl…
#10574 merged
Apr 30, 2025 -
Add a link to a list of nightly builds for Apple packages
#10568 merged
Apr 30, 2025 -
Include C++ std lib into SwiftPM package manifest
#10572 merged
Apr 30, 2025 -
Bump coremltools version
#10544 merged
Apr 30, 2025 -
Fix coreml rank0
#10534 merged
Apr 30, 2025 -
Replace usage of deprecated
distutils.(file|dir)_util
#10530 merged
Apr 29, 2025 -
Add Qwen3 0.6B, 1.7B, and 4B
#10539 merged
Apr 29, 2025 -
Run link checks on modified files on push too
#10558 merged
Apr 29, 2025 -
Fix ETCoreMLModelManager tests
#10557 merged
Apr 29, 2025 -
Provide list of files to link linters if desired
#10556 merged
Apr 29, 2025 -
Move link checks to a dedicated workflow
#10540 merged
Apr 29, 2025 -
Fix ios benchmark app
#10543 merged
Apr 29, 2025 -
Add a test for PR 10465
#10537 merged
Apr 29, 2025 -
Arm backend: Add FuseEqualPlaceholdersPass
#9893 merged
Apr 29, 2025 -
Arm backend: Build executorch with -j$(nproc)
#10547 merged
Apr 29, 2025 -
Add virtual keyword to Module methods to make it Mock-able
#10521 merged
Apr 29, 2025 -
Arm backend: Add TOSA VGF encapsulated compilation target.
#10476 merged
Apr 29, 2025 -
Arm backend: Add table ops to CheckProperQuantization
#10545 merged
Apr 29, 2025 -
Arm backend: Add testing/support for Inception_v4 and w2l for Ethos-U85
#10517 merged
Apr 29, 2025 -
Run Android benchmark on S22 private devices
#10538 merged
Apr 29, 2025 -
Arm backend: Convert asserts to raise errors in op_avg_pool2d
#10516 merged
Apr 29, 2025 -
Arm backend: Make run.sh run without setup.sh
#10515 merged
Apr 29, 2025 -
Backends arm: Bump ethos-u/core_platform
#10514 merged
Apr 29, 2025 -
LLM export pass to swap in custom SDPA
#10355 merged
Apr 29, 2025 -
Remove #include <span>
#10535 merged
Apr 29, 2025 -
Remove graph prints in tests
#10506 merged
Apr 29, 2025 -
Add device type (public, private) and id
#10496 merged
Apr 29, 2025 -
Unbreak pytree Buck build
#10536 merged
Apr 29, 2025 -
Remove #include <span>
#10533 merged
Apr 28, 2025 -
[ET-VK] Using uint16 for quantized linear tiling shader to reduce register pressure and improve performance.
#10509 merged
Apr 28, 2025 -
Enable quant fusion and const propagation by default
#10394 merged
Apr 28, 2025 -
link bmm, mm, view_copy, slice_copy, split_with_sizes_copy to jarvis
#10436 merged
Apr 28, 2025 -
Fix linter
#10526 merged
Apr 28, 2025 -
Update README.md to stable docs
#10518 merged
Apr 28, 2025 -
Use python3 in .lintrunner.toml
#10523 merged
Apr 28, 2025 -
Arm backend: Remove build_quantized_ops_aot_lib.sh
#10350 merged
Apr 28, 2025 -
Arm backend: Rename build_executorch_runner script
#10511 merged
Apr 28, 2025 -
Lint xrefs and urls
#10507 merged
Apr 28, 2025 -
Add test for floor divide.
#10483 merged
Apr 27, 2025 -
Add direct copy fast path for portable copy op
#10487 merged
Apr 27, 2025 -
[ExecuTorch] Arm backend: Buckify cos test
#10505 merged
Apr 26, 2025 -
[ExecuTorch] Arm backend: Update more node visitors to support TOSA 1.0 (#10425)
#10504 merged
Apr 26, 2025 -
[serialization_lib][1.00] update consumers
#10503 merged
Apr 26, 2025 -
[serializer-lib][0.80] Refactor to accomodate v1.0
#10502 merged
Apr 26, 2025 -
[ExecuTorch] Arm backend: Buckify cos test
#10480 merged
Apr 26, 2025 -
[ExecuTorch] Arm backend: Update more node visitors to support TOSA 1.0 (#10425)
#10479 merged
Apr 26, 2025 -
[serialization_lib][1.00] update consumers
#10478 merged
Apr 26, 2025 -
[serializer-lib][0.80] Refactor to accomodate v1.0
#10477 merged
Apr 26, 2025 -
Move the transpose matmul pass to OSS and run it earlier in the flow
#10433 merged
Apr 26, 2025 -
Adding bmm, mm, view_copy, slice_copy, split_with_sizes_copy optimizations
#9877 merged
Apr 26, 2025 -
Add CI workflow to check c10 is synced with PyTorch
#10413 merged
Apr 25, 2025 -
Add quantize_and_export_to_edge and quantize_and_export_to_executorch
#10379 merged
Apr 25, 2025 -
Re-sync c10
#10402 merged
Apr 25, 2025 -
Adopt runtime::FunctionRef in thread_parallel_interface.h and thread_parallel.h
#10442 merged
Apr 25, 2025 -
Move fully-featured FunctionRef from extension/pytree to ExecuTorch core
#10441 merged
Apr 25, 2025 -
Re-sync extension/pytree/function_ref.h with LLVM
#10440 merged
Apr 25, 2025 -
Bump tokenizers dep
#10332 merged
Apr 25, 2025 -
[ET-VK][ez] Improve insert_prepack_node pass to handle multiple uses of constant tensors
#10488 merged
Apr 25, 2025 -
Use a local strong NSError for methods with nested autorelease pool
#10465 merged
Apr 25, 2025 -
[ET-VK][ez] Improve insert_prepack_node pass to handle multiple uses of constant tensors
#10426 merged
Apr 25, 2025 -
Fix, or rather "port", bug fix for sdpa
#10466 merged
Apr 25, 2025 -
Update README.md
#10486 merged
Apr 25, 2025 -
Update stable to 0.6
#10485 merged
Apr 25, 2025 -
Revert "[0.6 release] Update stable to point to 0.6"
#10482 merged
Apr 25, 2025 -
Recursive checkout in CI
#10472 merged
Apr 25, 2025 -
Add data path to runner
#10460 merged
Apr 25, 2025 -
Arm backend: Add decomposition pass for aten.ne
#10475 merged
Apr 25, 2025 -
Arm backend: Update op_view for TOSA 1.0
#10474 merged
Apr 25, 2025 -
forward fix dq conv
#10399 merged
Apr 25, 2025 -
Arm backend: Broaden exception handling for unsupported ops
#10473 merged
Apr 25, 2025 -
Typo
#10324 merged
Apr 25, 2025 -
Add data path to runner
#10445 merged
Apr 25, 2025 -
Update executorch maven in getting-started.md
#10452 merged
Apr 24, 2025 -
Revert "Arm backend: Update more node visitors to support TOSA 1.0"
#10455 merged
Apr 24, 2025 -
[0.6 release] Update stable to point to 0.6
#10437 merged
Apr 24, 2025 -
Rename convert_pt2
#10378 merged
Apr 24, 2025 -
Update using-executorch-ios.md
#10448 merged
Apr 24, 2025 -
Arm backend: update quantizer/__init__
#10408 merged
Apr 24, 2025 -
Update using-executorch-ios.md
#10446 merged
Apr 24, 2025 -
Disable executor_runner for iOS builds
#10435 merged
Apr 24, 2025 -
Arm backend: Add unit test for DeiT-Tiny model on TOSA-MI backend
#10391 merged
Apr 24, 2025 -
[Android] Add API to use new config
#10346 merged
Apr 24, 2025 -
Update pyproject.toml pins
#10428 merged
Apr 24, 2025 -
Update iOS doc to point to examples README (#10429)
#10434 merged
Apr 24, 2025 -
Reland minibench refactor
#10417 merged
Apr 24, 2025 -
refactor no longer needed EXECUTORCH_BUILD_HOST_TARGETS
#10320 merged
Apr 24, 2025 -
Fix uint16 support for quantize_per_tensor.
#10398 merged
Apr 24, 2025 -
Update iOS doc to point to examples README
#10429 merged
Apr 24, 2025 -
Arm backend: Convert asserts to raise errors in op_bmm
#10424 merged
Apr 24, 2025 -
Arm backend: Fix CPU cycle counters over backend delegate code
#10393 merged
Apr 24, 2025 -
Arm backend: Update more node visitors to support TOSA 1.0
#10425 merged
Apr 24, 2025 -
[Android] Add Tensor_unsupoprted return type instead of crash
#10414 merged
Apr 24, 2025 -
Remove leftover print
#10396 merged
Apr 24, 2025 -
[cortex-m] Add scalar c++ op for dequantize_per_tensor
#10383 merged
Apr 24, 2025 -
[cortex-m] Add scalar c++ op for quantize_per_tensor
#10382 merged
Apr 24, 2025 -
[cortex-m] initial commit
#10381 merged
Apr 24, 2025 -
[ExecuTorch][#9638] Introduce Protected Method Getter in Extension.Module
#10384 merged
Apr 24, 2025 -
Add pass to tag external constants for delegates
#10422 merged
Apr 24, 2025 -
Add pass to tag external constants for delegates
#10328 merged
Apr 23, 2025 -
Fix linter
#10404 merged
Apr 23, 2025 -
[ET-VK] Add coop shader for int8 linear
#10416 merged
Apr 23, 2025 -
[ET-VK] Enable int8 tiled compute shader to be used with buffer tensors
#10415 merged
Apr 23, 2025 -
[ET-VK] Add coop shader for int8 linear
#10304 merged
Apr 23, 2025 -
[ET-VK] Enable int8 tiled compute shader to be used with buffer tensors
#10302 merged
Apr 23, 2025 -
Revert "Migrate elementwise_util callers to the variants with out_dtypes in template arguments"
#10411 merged
Apr 23, 2025 -
Revert "Save some size in dtype_util when dtype selective build is not in use"
#10410 merged
Apr 23, 2025 -
Generalize view_copy fusion.
#10356 merged
Apr 23, 2025 -
Add job_arn to benchmark result
#10372 merged
Apr 23, 2025 -
Add CI workflow to check c10 is synced with PyTorch
#10403 merged
Apr 23, 2025 -
Revert "Minibench refactor (#10376)"
#10405 merged
Apr 23, 2025 -
Revert "Arm backend: Populate __init__.py for quantizer and Arm root"
#10395 merged
Apr 23, 2025 -
Arm backend: Convert assert to throw ValueError in op_log
#10392 merged
Apr 23, 2025 -
Arm backend: Add Tutorial to Example tab on the Docs page
#10386 merged
Apr 23, 2025 -
Arm backend: Update node visitors to support TOSA 1.0
#10390 merged
Apr 23, 2025 -
Arm backend: Make it easier to generate non delegated/quantized PTEs
#10387 merged
Apr 23, 2025 -
Arm backend: Allow --quantize in non delegated using aot_arm_compiler
#10385 merged
Apr 23, 2025 -
Arm backend: Set REGIONCFG registers of the Ethos-U
#10388 merged
Apr 23, 2025 -
[ExecuTorch][#10364] Add Protected Method Getter in
extension.Module
#10374 merged
Apr 23, 2025 -
Pcre2 buck target in third-party (#55)
#10367 merged
Apr 23, 2025 -
Run iPhone benchmark on private devices
#10380 merged
Apr 23, 2025 -
Minibench refactor
#10376 merged
Apr 23, 2025 -
New embedding quant fusion
#10325 merged
Apr 23, 2025 -
[cortex-m] Add scalar c++ op for dequantize_per_tensor
#10267 merged
Apr 23, 2025 -
[cortex-m] Add scalar c++ op for quantize_per_tensor
#10266 merged
Apr 23, 2025 -
[cortex-m] initial commit
#10265 merged
Apr 23, 2025 -
Add buck file for qnn jni
#10370 merged
Apr 23, 2025 -
Implement a coversion pass: pow(2,x) to mul(x,x).
#10373 merged
Apr 23, 2025 -
Clarify ownership of runner components
#10338 merged
Apr 23, 2025 -
BroadcastIndexesRange: leading 1s don't require true broadcasting
#9431 merged
Apr 23, 2025 -
Save some size in dtype_util when dtype selective build is not in use
#9842 merged
Apr 23, 2025 -
Migrate elementwise_util callers to the variants with out_dtypes in template arguments
#9841 merged
Apr 23, 2025 -
RFC: Specialize for non-mixed-dtype in elementwise_util
#9388 merged
Apr 23, 2025 -
Enable transpose-quantized_relu-transpose fusion.
#10337 merged
Apr 22, 2025 -
Refactor elementwise_util: create variants with out_dtypes in template argument list
#9387 merged
Apr 22, 2025 -
[ET-VK][ez] Enable Vulkan tests to build for Android in OSS + misc fixes
#10368 merged
Apr 22, 2025 -
Support dynamically quantized 2D convolutions
#10347 merged
Apr 22, 2025 -
Update size threshhold
#10214 merged
Apr 22, 2025 -
Serialize PTD files with named data
#10359 merged
Apr 22, 2025 -
Update module wrapper so that params are explicitly registered to the wrapper
#10357 merged
Apr 22, 2025 -
Add build_optimized_size_test.sh
#9840 merged
Apr 22, 2025 -
Replace third-party/pkg_resources:pkg_resources with third-party/pypi/setuptools:setuptools [4/11]
#10259 merged
Apr 22, 2025 -
[ET-VK] Add support for
aten::upsample_bilinear2d
ATen op#10363 merged
Apr 22, 2025 -
[ET-VK] Add support for
aten::upsample_bilinear2d
ATen op#10306 merged
Apr 22, 2025 -
[ET-VK][ez] Store physical device identity metadata
#10361 merged
Apr 22, 2025 -
[ET-VK][ez] Streamline + fix enabling device extensions
#10360 merged
Apr 22, 2025 -
[ET-VK][ez] Store physical device identity metadata
#10353 merged
Apr 22, 2025 -
[ET-VK][ez] Streamline + fix enabling device extensions
#10352 merged
Apr 22, 2025 -
[Android] New config API for Llm init and generate
#10345 merged
Apr 22, 2025 -
Android JNI llama cache temperature in class
#10287 merged
Apr 22, 2025 -
Add high_freq_factor to ModelArgs
#10348 merged
Apr 22, 2025 -
Exporting start_time in InstructionEvent to Inspector (#10295)
#10344 merged
Apr 22, 2025 -
Arm backend: Populate __init__.py for quantizer and Arm root
#10351 merged
Apr 22, 2025 -
Arm backend: Remove no-op repeat nodes in ConvertExpandCopyToRepeatPass
#10137 merged
Apr 22, 2025 -
Arm backend: Add upsample_bilinear2d op
#10349 merged
Apr 22, 2025 -
Add tests for op_quantize_per_tensor + add checks for quant_min/max
#10300 merged
Apr 22, 2025 -
Update readme for qcom example
#10331 merged
Apr 22, 2025 -
Arm backend: Convert asserts to raise errors in op_mul
#10134 merged
Apr 22, 2025 -
Replace split_with_sizes_copy with slice_copy
#10318 merged
Apr 22, 2025 -
Update check_xrefs.sh
#10343 merged
Apr 21, 2025 -
[Executorch][llama] Hookup use_attention_mask option in the source transforms inside llm mananger
#10342 merged
Apr 21, 2025 -
[Executorch][llama] Allow custom sdpa op replacement pass to leverage attention mask
#10341 merged
Apr 21, 2025 -
[Executorch][llama] bug fix for custom sdpa for attention bias
#10340 merged
Apr 21, 2025 -
[Executorch][BE] Fix error logging with better message
#10339 merged
Apr 21, 2025 -
[Executorch][llama] Hookup use_attention_mask option in the source transforms inside llm mananger
#10286 merged
Apr 21, 2025 -
[Executorch][llama] Allow custom sdpa op replacement pass to leverage attention mask
#10285 merged
Apr 21, 2025 -
[Executorch][llama] bug fix for custom sdpa for attention bias
#10284 merged
Apr 21, 2025 -
[Executorch][BE] Fix error logging with better message
#10283 merged
Apr 21, 2025 -
Fix android instrumentation
#10335 merged
Apr 21, 2025 -
Serialize PTD files with named data
#10327 merged
Apr 21, 2025 -
Update check_urls.sh
#10321 merged
Apr 21, 2025 -
Update module wrapper so that params are explicitly registered to the wrapper
#10305 merged
Apr 21, 2025 -
Refactor export_delegated_program
#10334 merged
Apr 21, 2025 -
Refactor export_delegated_program
#10303 merged
Apr 21, 2025 -
Fix Linter
#10333 merged
Apr 21, 2025 -
[Android] Remove old onStats
#10312 merged
Apr 21, 2025 -
[mps] Disable dialect verifier under mps preprocess
#10323 merged
Apr 21, 2025 -
[exir] Allow verifiers in _transform
#10322 merged
Apr 21, 2025 -
Documentation updates for OpenVINO backend
#10172 merged
Apr 21, 2025 -
[mps] Disable dialect verifier under mps preprocess
#10276 merged
Apr 21, 2025 -
[exir] Allow verifiers in _transform
#10274 merged
Apr 21, 2025 -
Fixed inaccurate PR labelling instructions
#10268 merged
Apr 21, 2025 -
Fix URLs
#10316 merged
Apr 21, 2025 -
Android Linter fix
#10317 merged
Apr 21, 2025 -
Add aten_lib to executorch_llama
#10307 merged
Apr 20, 2025 -
retrieve cadence_passes in apply_jarvis_passes
#10245 merged
Apr 20, 2025 -
Fix cross-links
#10313 merged
Apr 19, 2025 -
Print actual numel in et_view
#10299 merged
Apr 19, 2025 -
Remove args from LLMEdgeManager and misc cleanup
#10288 merged
Apr 19, 2025 -
Implement _fft_c2r core ATen op
#10208 merged
Apr 19, 2025 -
[Android] Use same stats as llm::Stats
#10247 merged
Apr 19, 2025 -
Fix link in android
#10311 merged
Apr 18, 2025 -
Lint docs before building
#10310 merged
Apr 18, 2025 -
Script to validate links
#10309 merged
Apr 18, 2025 -
Build vulkan+xnnpack AAR
#10301 merged
Apr 18, 2025 -
Runtime API to retrieve attributes
#10144 merged
Apr 18, 2025 -
Support pre-quantization via torchao quantize_
#10293 merged
Apr 18, 2025 -
Add view_as_real_copy.out
#10207 merged
Apr 18, 2025 -
Remove unused pass and test to replace
linalg.vector_norm
.#10296 merged
Apr 18, 2025 -
Script to validate URLs
#10289 merged
Apr 18, 2025 -
Automatically update version name for maven upload
#10290 merged
Apr 18, 2025 -
Introduce GenerationConfig
#10228 merged
Apr 18, 2025 -
Update CONTRIBUTING.md
#10292 merged
Apr 18, 2025 -
Update CONTRIBUTING.md
#10291 merged
Apr 18, 2025 -
Fix bugs in executorch package
#10251 merged
Apr 18, 2025 -
Fix x86_64 emulator stuck issue and enable tests
#10218 merged
Apr 17, 2025 -
Bump torchao pin, adjust llama export to support pre-quantization via quantize_ (phi4-mini load/export)
#10142 merged
Apr 17, 2025 -
Instruct users to run llama for qnn to the active repro
#10231 merged
Apr 17, 2025 -
Fix links in docs
#10278 merged
Apr 17, 2025 -
Fix links in docs
#10277 merged
Apr 17, 2025 -
fix typo
#10243 merged
Apr 17, 2025 -
Qualcomm AI Engine Direct - add more profile event
#10227 merged
Apr 17, 2025 -
Qualcomm AI Engine Direct - Fix the bug in rms_norm builder
#10250 merged
Apr 17, 2025 -
reset devtool webpage tutorial
#10264 merged
Apr 17, 2025 -
Buckify Tanh test
#10249 merged
Apr 17, 2025 -
Revert "Add new dependency library for vulkan tests"
#10273 merged
Apr 17, 2025 -
Use __XTENSA__ in et_pal.cpp
#10270 merged
Apr 17, 2025 -
[ET-VK] Enable auto-generated operator correctness tests and benchmark binaries in OSS
#10260 merged
Apr 17, 2025 -
Qualcomm AI Engine Direct - add op support list
#10253 merged
Apr 17, 2025 -
Add new dependency library for vulkan tests
#10136 merged
Apr 17, 2025 -
Update demo-apps-ios.md
#10263 merged
Apr 17, 2025 -
Update LLaMA iOS docs
#10262 merged
Apr 17, 2025 -
Update screenshot in using on iOS page
#10261 merged
Apr 17, 2025 -
Update demo-apps-ios.md
#10252 merged
Apr 17, 2025 -
Update LLaMA iOS docs
#10255 merged
Apr 17, 2025 -
Update screenshot in using on iOS page
#10256 merged
Apr 17, 2025 -
reset devtool webpage tutorial
#10254 merged
Apr 17, 2025 -
Fix timespec_get not compatiable for AOSP OS Android N14
#10240 merged
Apr 17, 2025 -
Buckify Sigmoid test
#10224 merged
Apr 16, 2025 -
Add split_with_sizes to block list
#10244 merged
Apr 16, 2025 -
forward fix preprocess multimethod
#10239 merged
Apr 16, 2025 -
Add redirects for relocated docs
#10241 merged
Apr 16, 2025 -
[ET-VK] Manual sync native layer norm
#10242 merged
Apr 16, 2025 -
Add redirects for relocated docs
#10221 merged
Apr 16, 2025 -
[ET-VK] Manual sync to fbsource
#10238 merged
Apr 16, 2025 -
clean up complex tests
#10213 merged
Apr 16, 2025 -
[ET-VK] Use performant tiled algorithm for 4 bit weight only quantized linear
#10236 merged
Apr 16, 2025 -
Qualcomm AI Engine Direct - Add block quantization to llama
#10225 merged
Apr 16, 2025 -
[ET-VK] Add co-op algorithm for 4 bit weight only quantized linear
#10235 merged
Apr 16, 2025 -
[ET-VK] Allow int4 linear to execute without 8bit buffer support
#10234 merged
Apr 16, 2025 -
Add quantized kernels to executorch_jni_full
#10223 merged
Apr 16, 2025 -
[ET-VK] Use performant tiled algorithm for 4 bit weight only quantized linear
#10205 merged
Apr 16, 2025 -
Android E2E with real input
#10230 merged
Apr 16, 2025 -
Use sccache to accelerate android build
#9587 merged
Apr 16, 2025 -
Increase Android perf test timeout to 4h
#10232 merged
Apr 16, 2025 -
[ET-VK] Add co-op algorithm for 4 bit weight only quantized linear
#10204 merged
Apr 16, 2025 -
[ET-VK] Allow int4 linear to execute without 8bit buffer support
#10030 merged
Apr 16, 2025 -
Qualcomm AI Engine Direct - OSS models breakage fix
#10191 merged
Apr 16, 2025 -
Android MV2 E2E instrumentation test
#10219 merged
Apr 16, 2025 -
[Executorch][to_backend] Introduce preprocess_multimethod
#9823 merged
Apr 16, 2025 -
[ET][Testing] Build test_backend_compiler_lib when testing is on
#9953 merged
Apr 16, 2025 -
[ExecuTorch][to_backend] Enable to_backend API to leverage preprocess_multimethod
#9824 merged
Apr 16, 2025 -
[0.6 documentation] Fix Page Developer Tools: Bundled Program
#10229 merged
Apr 16, 2025 -
add BoxWithNMSLimit_out to DSP as a custom portable op
#10157 merged
Apr 16, 2025 -
[0.6 documentation] Fix Page Developer Tools: Bundled Program
#10222 merged
Apr 16, 2025 -
[0.6 documentation] Fix Page Developer Tools: Bundled Program
#10194 merged
Apr 16, 2025 -
Complex Support: bmm
#10197 merged
Apr 16, 2025 -
Set the default list of models running on private devices
#10217 merged
Apr 16, 2025 -
Add error message for empty string in filedataloader.
#10145 merged
Apr 15, 2025 -
Arm backend: Add support alias_copy operator
#10199 merged
Apr 15, 2025 -
Run Android release job on ephemeral runners
#10190 merged
Apr 15, 2025 -
Experiment with private rooted Pixel 3 devices
#10192 merged
Apr 15, 2025 -
Update llama cmake for custom ops
#10201 merged
Apr 15, 2025 -
Switch docs to 0.6 branch
#10212 merged
Apr 15, 2025 -
Strip .html suffix from doc links
#10211 merged
Apr 15, 2025 -
Strip .html suffix from doc links
#10210 merged
Apr 15, 2025 -
Update pytorch-labs/tokenizers to 295ee78
#10161 merged
Apr 15, 2025 -
Remove layer norm from the default quantizer, add one that has it
#10182 merged
Apr 15, 2025 -
FIx links in docs
#10185 merged
Apr 15, 2025 -
port hardtanh and add hardtanh test
#9914 merged
Apr 15, 2025 -
Update llama cmake for custom ops
#10176 merged
Apr 15, 2025 -
Arm Backend: Add New DecomposeSilu pass to arm_pass_manager
#9448 merged
Apr 15, 2025 -
Arm backend: Add support to ge.Scalar
#10195 merged
Apr 15, 2025 -
Arm backend: Fixing typos
#10189 merged
Apr 15, 2025 -
Qualcomm AI Engine Direct - Mimi Enablement Stage 2
#10098 merged
Apr 15, 2025 -
Refactor internal switch cases
#9802 merged
Apr 15, 2025 -
import complex.h from c10
#10155 merged
Apr 15, 2025 -
Update README.md
#10187 merged
Apr 15, 2025 -
Update README.md
#10186 merged
Apr 15, 2025 -
Update README.md
#10170 merged
Apr 15, 2025 -
Update mps_README.md
#10169 merged
Apr 15, 2025 -
Add '--recursive' to git submodule update --init
#10180 merged
Apr 15, 2025 -
FIx links in docs
#10184 merged
Apr 15, 2025 -
Fix linter
#10183 merged
Apr 15, 2025 -
[Core ML] Improve error logging
#9801 merged
Apr 15, 2025 -
Add docs for $BUILD_AAR_DIR
#10174 merged
Apr 15, 2025 -
Update instrumentation test docs
#10173 merged
Apr 15, 2025 -
Fix android instrumentation
#10125 merged
Apr 15, 2025 -
Consolidate references in docs
#10175 merged
Apr 15, 2025 -
Add '--recursive' to git submodule update --init
#10178 merged
Apr 15, 2025 -
Clone submodules recursively in install_executorch.py
#10140 merged
Apr 15, 2025 -
Add approximate gelu replacement to opt level 2
#10129 merged
Apr 14, 2025 -
[doc] Link Hugging Face models to the ExecuTorch doc
#10171 merged
Apr 14, 2025 -
Delete obsolete docs
#10160 merged
Apr 14, 2025 -
Update using-executorch-ios.md
#10150 merged
Apr 14, 2025 -
Update demo-apps-ios.md
#10149 merged
Apr 14, 2025 -
Fix compiler warnings in a few places
#10165 merged
Apr 14, 2025 -
Update README.md
#10168 merged
Apr 14, 2025 -
[doc] Link Hugging Face models to the ExecuTorch doc
#10154 merged
Apr 14, 2025 -
Update mps_README.md
#10167 merged
Apr 14, 2025 -
LLM custom ops tutorial should direct to general custom ops
#10143 merged
Apr 14, 2025 -
Fix paths in LLaMa project.pbxproj
#10138 merged
Apr 14, 2025 -
Update android docs for nightly snapshots
#10163 merged
Apr 14, 2025 -
Update doc links to relative markdown files
#10164 merged
Apr 14, 2025 -
Delete obsolete docs
#10159 merged
Apr 14, 2025 -
Add memory requirement and clarify image format for llava example
#10153 merged
Apr 14, 2025 -
[ET-VK][ez] Support convolutions with padding > 0 and dilation > 1
#10148 merged
Apr 14, 2025 -
[ET-VK][ez] Support convolutions with padding > 0 and dilation > 1
#10115 merged
Apr 14, 2025 -
Update using-executorch-ios.md
#10128 merged
Apr 14, 2025 -
Update demo-apps-ios.md
#10146 merged
Apr 14, 2025 -
NXP backend: Add NeutronQuantizer
#9876 merged
Apr 14, 2025 -
Mimi: sqnr and test without streaming
#10004 merged
Apr 14, 2025 -
[#9971] Gracefully error out in ETDump for set_debug_buffer
#10130 merged
Apr 14, 2025 -
LLM custom ops tutorial should direct to general custom ops
#10139 merged
Apr 14, 2025 -
Clone submodules recursively in install_executorch.py
#10131 merged
Apr 14, 2025 -
Make llama model search case insensitive for benchmark app
#10133 merged
Apr 14, 2025 -
Fix paths in LLaMa project.pbxproj
#10132 merged
Apr 14, 2025 -
Arm backend: Add support for TOSA 1.0 serializer
#10135 merged
Apr 14, 2025 -
Arm backend: Remove node vistor for full
#9904 merged
Apr 14, 2025
91 Pull requests opened by 50 people
-
[#9971] Gracefully error out in ETDump part 3 for *profiling_delegate
#10147 opened
Apr 14, 2025 -
[Example] Yolo12 Detection sample with OpenVINO/XNNPACK backend
#10156 opened
Apr 14, 2025 -
Move benchmarking workflow cli from testinfra to executorch
#10162 opened
Apr 14, 2025 -
Arm backend: Add TOSA support for GroupNorm
#10198 opened
Apr 15, 2025 -
qnn runner: add memory consumption logging
#10237 opened
Apr 16, 2025 -
fix tabular output
#10246 opened
Apr 16, 2025 -
NXP backend: Enable initial unit tests workflow
#10258 opened
Apr 17, 2025 -
fix tabular output
#10271 opened
Apr 17, 2025 -
Added tensor's dim order ambiguity check
#10272 opened
Apr 17, 2025 -
[exir] Refactor EdgeProgramManager.transform
#10275 opened
Apr 17, 2025 -
Move default Vela/Regor configurations to Sram_Only
#10279 opened
Apr 17, 2025 -
Fix undefined fht_float in Apple OS
#10280 opened
Apr 17, 2025 -
Add CI for conv_former and fastvit for QNN
#10282 opened
Apr 17, 2025 -
Migrate ExecuTorch's use of pt2e from torch.ao to torchao
#10294 opened
Apr 18, 2025 -
Exporting start_time in InstructionEvent to Inspector
#10295 opened
Apr 18, 2025 -
Qualcomm AI Engine Direct - alias_copy op
#10319 opened
Apr 21, 2025 -
Use dependency injection for runner
#10326 opened
Apr 21, 2025 -
Update flat tensor ndm to account for named delegate data
#10330 opened
Apr 21, 2025 -
[XNNPACK] torchao is installed by default
#10336 opened
Apr 21, 2025 -
Rename some "jarvis" instances into "falcon" or "cadence"
#10354 opened
Apr 22, 2025 -
Bump PyTorch nightly pin past April 22 2025
#10362 opened
Apr 22, 2025 -
Add test_qnn_delegates.py to oss ci
#10377 opened
Apr 23, 2025 -
Android test use kotlin
#10401 opened
Apr 23, 2025 -
always turn on dynamo for map (#150962)
#10409 opened
Apr 23, 2025 -
IOManager Interface
#10418 opened
Apr 23, 2025 -
Fix `numel()` downcast in executorch/backends/vulkan/test/utils/test_utils.cpp +2
#10419 opened
Apr 23, 2025 -
Fix `numel()` downcast in dper_lib/silvertorch/core/legacy/tools/eval/tests/TestUtil.cpp +2
#10420 opened
Apr 23, 2025 -
Test fix
#10423 opened
Apr 24, 2025 -
[ExecuTorch][#10447] Extend `PyBundledModule` with `extension.BundledModule`
#10450 opened
Apr 24, 2025 -
Support more open-source models
#10463 opened
Apr 25, 2025 -
Introduce `platform-config` in CompileSpec for MediaTek backend
#10464 opened
Apr 25, 2025 -
Arm backend: [serializer-lib][0.80] Refactor to accomodate v1.0
#10467 opened
Apr 25, 2025 -
Arm backend: [serialization_lib][1.00] update consumers
#10468 opened
Apr 25, 2025 -
Arm backend: [serializer-lib][0.80] Refactor to accomodate v1.0
#10469 opened
Apr 25, 2025 -
Arm backend: [serialization_lib][1.00] update consumers
#10470 opened
Apr 25, 2025 -
Use a local strong NSError for methods with nested autorelease pool
#10471 opened
Apr 25, 2025 -
Experiment so
#10498 opened
Apr 26, 2025 -
Increase max try in llm benchmark
#10500 opened
Apr 26, 2025 -
Allow removing permute pairs in addition to transpose pairs
#10501 opened
Apr 26, 2025 -
Use std::string_view and std::optional
#10541 opened
Apr 29, 2025 -
Qualcomm AI Engine Direct - xr model enablement (mld_f)
#10546 opened
Apr 29, 2025 -
Arm backend: Fix sigmoid int16 and int32 flakyness
#10548 opened
Apr 29, 2025 -
Remove strictness in export calls
#10552 opened
Apr 29, 2025 -
[MPS] Add portable grid_sampler_2d implementation + tests
#10561 opened
Apr 29, 2025 -
Introducing NXP Neutron runtime
#10563 opened
Apr 29, 2025 -
Hack vulkan so
#10565 opened
Apr 29, 2025 -
Qualcomm AI Engine Direct - Streaming Mimi Enablement
#10570 opened
Apr 30, 2025 -
Qualcomm AI Engine Direct - Refactor llama runner
#10578 opened
Apr 30, 2025 -
NXP backend: Create NeutronAtenPassManager with initial BatchNorm fusing passes
#10579 opened
Apr 30, 2025 -
Qualcomm AI Engine Direct - multi-method support
#10584 opened
Apr 30, 2025 -
Update MemoryPlanning Verifier to only assume model has user input if it has at least one tensor input
#10617 opened
May 1, 2025 -
Add a pass to fuse scalar mul with quant ops
#10630 opened
May 1, 2025 -
Skip message format
#10632 opened
May 1, 2025 -
Enable do_quant_fusion_and_const_prop by default
#10633 opened
May 1, 2025 -
Update install script and building from source docs
#10652 opened
May 2, 2025 -
[executorch][android] Add Runtime.java to centralize native library l…
#10672 opened
May 3, 2025 -
Introduce PAL function table
#10675 opened
May 4, 2025 -
Clean up eager quant in llm_export
#10684 opened
May 5, 2025 -
Fix preq embedding dtype check
#10699 opened
May 5, 2025 -
Add input size validation to Module.execute
#10701 opened
May 5, 2025 -
Arm backend: Allocate the scratch buffer runtime rather than in the pte
#10714 opened
May 6, 2025 -
openvino_backend doesn't need to be static only
#10732 opened
May 6, 2025 -
[TEST] Split prim ops into its own
#10741 opened
May 7, 2025 -
: constant fold None
#10755 opened
May 7, 2025 -
move pattern
#10756 opened
May 7, 2025 -
Prim ops move 2
#10763 opened
May 7, 2025 -
Qualcomm AI Engine Direct - fix for pytorch uplevel
#10769 opened
May 8, 2025 -
[ET-VK] Removing descriptor pool intialization from DescriptorPool ctor.
#10777 opened
May 8, 2025 -
[ET-VK] Reducing memory wastage by tightening DescriptorPoolConfig values.
#10784 opened
May 9, 2025 -
[ET-VK] Moving device capabilities check to DispatchNode and PrepackNode ctor.
#10785 opened
May 9, 2025 -
Update export and build from source docs
#10807 opened
May 11, 2025 -
Example external model to be used by the ahead of time arm compiler
#10810 opened
May 12, 2025 -
Pass to replace Adaptive Avg. Pool with Aten Avg. Pool
#10818 opened
May 12, 2025 -
Make qwen3 compatible with ao-quantized checkpoints
#10822 opened
May 12, 2025 -
Forward fix on NXP backend
#10829 opened
May 12, 2025 -
[ET-VK] custom memory pools
#10831 opened
May 12, 2025 -
Support prequant qwen3
#10839 opened
May 13, 2025 -
Hook up PreprocessAll flow to EdgeManager
#10842 opened
May 13, 2025 -
Arm backend: Add test for DeiT Tiny for TOSA BI
#10846 opened
May 13, 2025 -
Arm backend: Reenable test_fuse_const_ops_tosa_BI
#10847 opened
May 13, 2025 -
Arm backend: Add DecomposeLinalgVectorNorm pass + tests
#10848 opened
May 13, 2025 -
Arm backend: Refactor misc tests for TOSA V1.0
#10851 opened
May 13, 2025 -
Add a pass to fuse mul.Scalar into dequant
#10853 opened
May 13, 2025 -
[jit] Remove more reference to TorchScript
#10856 opened
May 13, 2025 -
Add pass to convert kwargs to args + populate optional args.
#10857 opened
May 13, 2025 -
Allow setting thread count from Java
#10858 opened
May 13, 2025 -
Fix broken tests
#10866 opened
May 14, 2025 -
Mostly sync BlasKernel.cpp with ATen ReducedPrecisionGemvFastPathKernel
#10868 opened
May 14, 2025 -
Update llama runner README.md
#10869 opened
May 14, 2025 -
Arm backend: Check in tosa.fbs for TOSA 0.80 and 1.0
#10870 opened
May 14, 2025
62 Issues closed by 22 people
-
[CMake] optimized_kernels and quantized_kernel will depend on portable_kernels.
#10677 closed
May 14, 2025 -
Beefing up CONTRIBUTING.md to lower the barrier of entry for external contributors
#9582 closed
May 13, 2025 -
[ET-LLM] Use pytorch-labs/tokenizers in ET
#8376 closed
May 13, 2025 -
Editable mode is error-ing out with flatc message
#8784 closed
May 13, 2025 -
Add complex dtype support to op_sum_dim
#10431 closed
May 13, 2025 -
Add complex dtype support to op_mul
#10430 closed
May 13, 2025 -
Update buck version when there's an April 4 or newer release & clean up after #9890
#9919 closed
May 12, 2025 -
[Build Presets] Create a GitHub workflow foundation
#10715 closed
May 10, 2025 -
where is pytorch_tokenizers.tools.llama2c.convert?
#10571 closed
May 8, 2025 -
How to use tokenizer.json in ExecuTorch Android demo (without tokenizer.model)?
#10745 closed
May 7, 2025 -
Add pcre2 changes to the Llama app
#10555 closed
May 6, 2025 -
Where should debug handle generation live
#10553 closed
May 6, 2025 -
Add HuggingFace tokenizer to executor runner
#9455 closed
May 6, 2025 -
Finalize etLLM components and specs
#10427 closed
May 6, 2025 -
Fix CoreML handling of scalars Rank0 tensors in ET
#10443 closed
May 5, 2025 -
Running executorch on Meta Quest device
#10581 closed
Apr 30, 2025 -
CoreML Partitioner is not able to lower mv3
#10451 closed
Apr 30, 2025 -
custom_ops fails to build when cross-compiling to x64 from arm64
#6839 closed
Apr 30, 2025 -
Dependency failures when installing with cmake v4.0
#10152 closed
Apr 29, 2025 -
ARM baremetal tests are not running properly
#9602 closed
Apr 29, 2025 -
Update Arm Ethos U backend docs using backend template
#8530 closed
Apr 25, 2025 -
Revamp iOS documentations
#7903 closed
Apr 25, 2025 -
Move ExecuTorchDemo iOS app to pytorch-labs/executorch-demo
#8788 closed
Apr 25, 2025 -
Migrate demo apps to executorch-examples repository
#10329 closed
Apr 25, 2025 -
Benchmark: Report memory usage in Android benchmark app
#7988 closed
Apr 24, 2025 -
Support BFloat16 dtype in Android Tensor API
#9881 closed
Apr 24, 2025 -
Remove EXECUTORCH_BUILD_HOST_TARGETS
#9404 closed
Apr 24, 2025 -
Get/set buffer api similar to get/set input/output
#8540 closed
Apr 24, 2025 -
[Android] Add java layer fp16 support
#10371 closed
Apr 24, 2025 -
Add Protected Method Getter in `extension.Module`
#10364 closed
Apr 23, 2025 -
Report avg_inference_latency from Android LLM benchmark app
#8578 closed
Apr 23, 2025 -
Parallelize portable ops if threadpool is available, with fallback to parallel_for-as-for-loop
#8932 closed
Apr 22, 2025 -
Introduce APIs In Bundled Program to Not Take **Method** When Loading Input
#10269 closed
Apr 22, 2025 -
Support Dynamically Quantized Convolutions
#9021 closed
Apr 22, 2025 -
[0.6 Release] Quality testing
#9837 closed
Apr 22, 2025 -
MPS Backend issue - torch._export.verifier.SpecViolationError: Tensor should not be used in dim_order mode
#10215 closed
Apr 22, 2025 -
Does Executorch support BF16 dtype using XNNPACK
#10188 closed
Apr 21, 2025 -
[Android] Improve LlmCallback.onStats()
#10080 closed
Apr 21, 2025 -
Attempting running Minibench on Android, no results generated
#7076 closed
Apr 18, 2025 -
[Android benchmark] Uninstall existing benchmark APK before installing
#8492 closed
Apr 18, 2025 -
[etLLM] extension/llm should have unit tests
#8495 closed
Apr 18, 2025 -
[Release 0.6] wrong inspector output in developer tool tutorial page
#10113 closed
Apr 17, 2025 -
[QCOM] Have a list of pending ops to be supported
#10220 closed
Apr 17, 2025 -
llama iOS demo app documentation page feedback
#10012 closed
Apr 17, 2025 -
Fix stable/ links in readmes for moved pages
#8728 closed
Apr 16, 2025 -
0.6 Release: Building From Source Page
#10014 closed
Apr 16, 2025 -
"Building an ExecuTorch iOS Demo App" Feedback
#10066 closed
Apr 16, 2025 -
[0.6 documentation] Developer Tools: Bundled Program
#10193 closed
Apr 16, 2025 -
[0.6 documentation] Custom kernels section could provide more steps
#10031 closed
Apr 15, 2025 -
Backend Specific Configuration in Runtime
#9459 closed
Apr 15, 2025 -
Verify Mimi's accuracy and performance
#9581 closed
Apr 15, 2025 -
Enable dynamic shapes for torchao:8daXw quant ops
#8981 closed
Apr 15, 2025 -
Release 0.6 llama low-bit kernels
#10166 closed
Apr 15, 2025 -
Recursive initialization of repos
#9783 closed
Apr 14, 2025 -
Improve contributor documentation for Android
#9913 closed
Apr 14, 2025 -
LlaVA Model Loads Sucessfully Failing inference app crashes after image i/p adding logs for Reference
#9233 closed
Apr 14, 2025 -
"There appear to be 1 leaked semaphore objects to clean up at shutdown" while llava export
#5171 closed
Apr 14, 2025 -
Adapting the Qwen 2.5 0.5B model, I encountered a model conversion failure issue on the MTK platform.
#6228 closed
Apr 14, 2025 -
Missing .so lib for training pybinding
#9576 closed
Apr 14, 2025 -
[Release 0.6] arm tests failing
#10111 closed
Apr 14, 2025
82 Issues opened by 30 people
-
Running Qwen3 XNNPACK on Android fails while setting up pretokenizer
#10867 opened
May 14, 2025 -
Make PR label checker non-mandatory
#10864 opened
May 13, 2025 -
Automatically specify params and metadata for export LLM
#10862 opened
May 13, 2025 -
Allow loading downstream HF repos
#10861 opened
May 13, 2025 -
Add first class citizen support for bundling along strings, ints etc. as metadata
#10854 opened
May 13, 2025 -
Create a minimal_executor_runner target
#10830 opened
May 12, 2025 -
Out-variant kernels are lacking BC protection for new default args being added.
#10821 opened
May 12, 2025 -
Consolidate executor_runners
#10819 opened
May 12, 2025 -
Unable to run example script to generate Coreml Models
#10797 opened
May 9, 2025 -
Create a landing page for executorch mobile
#10796 opened
May 9, 2025 -
Remove all instances and usages of torchscript
#10795 opened
May 9, 2025 -
Add timestamps for pte generation in CI
#10761 opened
May 7, 2025 -
Deploying VITA-1.5 Multimodal Model with ExecuTorch
#10757 opened
May 7, 2025 -
[CMake] Duplicated sources in extension_llm_runner and llama_runner
#10746 opened
May 7, 2025 -
Format CMakeLists.txt
#10736 opened
May 6, 2025 -
Make debug handle as first citizen of Export Graph
#10728 opened
May 6, 2025 -
Consolidate debug handle infra from Export Graph, torch.ao to ExecuTorch
#10727 opened
May 6, 2025 -
Python runtime API for operators
#10726 opened
May 6, 2025 -
_load_for_executorch_from_buffer doesn't keep buffer alive
#10725 opened
May 6, 2025 -
[Build Presets] Create a windows-x86_64 preset
#10723 opened
May 6, 2025 -
[Build Presets] Create a linux-x86_64 preset
#10722 opened
May 6, 2025 -
[Build Presets] Create an android-x86_64 preset
#10721 opened
May 6, 2025 -
[Build Presets] Create an android-arm64-v8a preset
#10720 opened
May 6, 2025 -
[Build Presets] Create a ios-simulator-arm64 preset
#10719 opened
May 6, 2025 -
[Build Presets] Create an ios-arm64 preset
#10718 opened
May 6, 2025 -
[Build Presets] Create a macos-arm64 preset
#10717 opened
May 6, 2025 -
[Build Presets] Create foundation and default configurations
#10716 opened
May 6, 2025 -
[Neutron Backend] Move Neutron Backend to Dim Order Representation usage
#10711 opened
May 6, 2025 -
[CMake] Decouple prim_ops from executorch
#10704 opened
May 6, 2025 -
[Android] instrumentation test use models from storage, not bundled with apk
#10696 opened
May 5, 2025 -
Add torchao kernels to xcframework
#10694 opened
May 5, 2025 -
[CMake] Duplicated entries in executorch_srcs.cmake
#10687 opened
May 5, 2025 -
[CMake] Potentially duplicated srcs in llama_runner build
#10686 opened
May 5, 2025 -
[CMake] Enable BUILD_SHARED_LIBS=ON flag
#10676 opened
May 4, 2025 -
Missing out variants: {'torchao::dequantize_affine'}
#10663 opened
May 2, 2025 -
Request for support of ExecuTorch pip package on linux aarch64
#10651 opened
May 2, 2025 -
Support selective build for Android
#10622 opened
May 1, 2025 -
Update quantization overview doc page
#10603 opened
May 1, 2025 -
Question about programmatically running inference on Android-based custom OS with vulkan delegate
#10602 opened
May 1, 2025 -
Advice on how to run the training example in Android
#10593 opened
Apr 30, 2025 -
Need a feature to get etdump while running LLAMA model on qnn with qnn_llama_runner
#10580 opened
Apr 30, 2025 -
runtime oeprator-level numeric issue detector
#10554 opened
Apr 29, 2025 -
Add merge to NamedDataMap API
#10551 opened
Apr 29, 2025 -
Refactor flat_tensor
#10550 opened
Apr 29, 2025 -
Mechanism to detect performance CPU core is not reliable
#10549 opened
Apr 29, 2025 -
Bug in Benchmark Dash UI display
#10524 opened
Apr 28, 2025 -
[exir] MemoryPlanning Verifier assumes that if a model has a user input it has at least 1 tensor input.
#10522 opened
Apr 28, 2025 -
which clang source of bf16 GEMM kernel
#10513 opened
Apr 28, 2025 -
Module.execute does not check input count
#10510 opened
Apr 28, 2025 -
ConvertToLinearPass is not sound when transposes are elided
#10499 opened
Apr 26, 2025 -
[Android] Make QNN backend as a .so library
#10495 opened
Apr 25, 2025 -
[Android] Make vulkan backend as a .so library
#10494 opened
Apr 25, 2025 -
[CMake] Utils.cmake, function name kernel_link_options() and target_link_options_shared_lib() misleading
#10492 opened
Apr 25, 2025 -
[CMake] When should we use static lib and when shared
#10461 opened
Apr 25, 2025 -
[Android] Add an interface to talk to delegates
#10459 opened
Apr 25, 2025 -
[Android] Make libraries hot pluggable
#10457 opened
Apr 24, 2025 -
[Android] Get rid of NativePeer.java
#10456 opened
Apr 24, 2025 -
[Android] Migrate tests to kotlin
#10454 opened
Apr 24, 2025 -
[Android] Add method metadata report API in Module.java
#10453 opened
Apr 24, 2025 -
Extend `PyBundledModule` with `extension.BundledModule`
#10447 opened
Apr 24, 2025 -
[Android] Use generic JNI instead of fbjni
#10444 opened
Apr 24, 2025 -
[Android] Add a Runtime.java
#10439 opened
Apr 24, 2025 -
[Android] Java API for runtime info
#10438 opened
Apr 24, 2025 -
Both optimized and portable arithmetic operators have broadcasting errors
#10421 opened
Apr 23, 2025 -
MTK Buck version
#10407 opened
Apr 23, 2025 -
[minibench] error report when run failed
#10400 opened
Apr 23, 2025 -
Undefined symbol while compiling runtime for MediaTek
#10389 opened
Apr 23, 2025 -
Add `extension.BundledModule` to Wrap `extension.Module` with Bundled Program Logic
#10375 opened
Apr 22, 2025 -
Migrate ExecuTorch documentation to the new theme
#10366 opened
Apr 22, 2025 -
ExecuTorch LLM runner does not stop on phi-4-mini's stop token
#10365 opened
Apr 22, 2025 -
Trying to convert torch.multinomial to PTE model
#10315 opened
Apr 19, 2025 -
executorch-ubuntu-22.04-clang12-android pull docker image is much slower
#10308 opened
Apr 18, 2025 -
executorch model Inference time is higher than the torch model
#10297 opened
Apr 18, 2025 -
[QCOM] Warning when exporting the model
#10281 opened
Apr 17, 2025 -
runtime executor test model export failure on Mac
#10257 opened
Apr 17, 2025 -
[QCOM] [Llama] the size of w4a16 quantized Llama 3.2 1B Pte is too large
#10226 opened
Apr 16, 2025 -
[QCOM] Support stable diffusion 2.1 on SM8750
#10209 opened
Apr 15, 2025 -
Integrate quantize_ into executorch.export
#10203 opened
Apr 15, 2025 -
Add base recipes for XNNPack, CoreML and QNN
#10202 opened
Apr 15, 2025 -
Onboard a streaming mode model similar to Moshi.
#10177 opened
Apr 14, 2025 -
Report device health metrics for AWS leased devices
#10141 opened
Apr 14, 2025
71 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Recipe and Input class definitions with e2e export
#10034 commented on
May 14, 2025 • 52 new comments -
Qualcomm AI Engine Direct - Enable custom operator
#8726 commented on
May 14, 2025 • 5 new comments -
[docs][ez] Fix doc build workflow
#8079 commented on
May 13, 2025 • 0 new comments -
Revert to use mean_out than mean_dim_out
#8021 commented on
May 13, 2025 • 0 new comments -
remove the exec_aten namespace
#8018 commented on
May 13, 2025 • 0 new comments -
Fix comment in memory_planning.py
#8010 commented on
May 13, 2025 • 0 new comments -
fix spec_prop_pass
#7974 commented on
May 13, 2025 • 0 new comments -
use dim order in all backend examples
#7953 commented on
Apr 15, 2025 • 0 new comments -
Remove ExecuTorch copy of Vectorized
#7042 commented on
Apr 28, 2025 • 0 new comments -
[pytorch hash update] update the pinned pytorch hash
#4589 commented on
May 14, 2025 • 0 new comments -
Do not link quantized ops libraries into portable_lib by default
#4039 commented on
May 5, 2025 • 0 new comments -
Building ExecuTorch on RPi5 with Clang 14.0.6 fails due to bfloat incompatibility
#8924 commented on
May 14, 2025 • 0 new comments -
llama3.2 1B model run on QNN backend produce wrong result
#5929 commented on
May 12, 2025 • 0 new comments -
Query regarding support of Executorch for ARM Ethos-U65 backend
#9356 commented on
May 12, 2025 • 0 new comments -
Return "platform not supported" when using PyTorch on intel-based Macbooks
#9772 commented on
May 8, 2025 • 0 new comments -
Query embeddings using executorch generate API
#8517 commented on
May 5, 2025 • 0 new comments -
SM8750 Htp not added in Android apk building
#8454 commented on
May 1, 2025 • 0 new comments -
Windows build failure: Failed to query buck for sources.
#9616 commented on
May 1, 2025 • 0 new comments -
Don't Serialize Scales/ZP in Flatbuffer
#9029 commented on
Apr 15, 2025 • 0 new comments -
[TEST] Try to build Android C++ one pass
#10124 commented on
Apr 21, 2025 • 0 new comments -
[ET-VK] Minor improvement to permute op.
#10117 commented on
Apr 16, 2025 • 0 new comments -
[ET-VK] Modify quantized linear naive shader to linearly dispatch work to improve performance.
#10116 commented on
Apr 16, 2025 • 0 new comments -
Qualcomm AI Engine Direct - Add rewrite function of observer
#10093 commented on
May 1, 2025 • 0 new comments -
Add some basic xnnpack recipes
#10035 commented on
May 14, 2025 • 0 new comments -
[ET-VK] Tuning native layer norm local workgroup size to improve thread occupancy during reduce.
#9984 commented on
Apr 16, 2025 • 0 new comments -
[WIP] Devtool end-to-end tests
#9925 commented on
Apr 18, 2025 • 0 new comments -
[ET-VK] Minor performance improvements to native layer norm.
#9892 commented on
Apr 16, 2025 • 0 new comments -
DO NOT COMMIT test for u55 + mv2
#9830 commented on
May 5, 2025 • 0 new comments -
Migrate elementwise_util callers to the variants with out_dtypes in template arguments
#9741 commented on
Apr 23, 2025 • 0 new comments -
Add SupportedTensorDtypes::{BOOL,REALH}
#9584 commented on
Apr 22, 2025 • 0 new comments -
Added support for bias in optimized linear operation
#9527 commented on
May 5, 2025 • 0 new comments -
Add vectorization in elementwise_util
#9432 commented on
Apr 23, 2025 • 0 new comments -
Add small check when input type is a list
#9186 commented on
May 8, 2025 • 0 new comments -
Neuron buffer allocator decouple from ExecuTorch framework
#8760 commented on
Apr 25, 2025 • 0 new comments -
[devtool] create stream_data_sink
#8604 commented on
May 13, 2025 • 0 new comments -
Adjust tolerance for quantized XNN conv1d tests
#8093 commented on
May 13, 2025 • 0 new comments -
Python apis for loading and saving .ptd from dictionaries
#8542 commented on
Apr 15, 2025 • 0 new comments -
RFC: Decoder only LLM runner API
#9341 commented on
Apr 15, 2025 • 0 new comments -
Qwen2.5 spinquant support
#9127 commented on
Apr 14, 2025 • 0 new comments -
Add dtype selective build for OSS
#9983 commented on
Apr 14, 2025 • 0 new comments -
Add dtype selective build for optimized ops
#10069 commented on
Apr 14, 2025 • 0 new comments -
[Request impl] Gracefully error out in ETDump
#9971 commented on
Apr 14, 2025 • 0 new comments -
Changed name and qnn_executor different path
#9784 commented on
Apr 14, 2025 • 0 new comments -
Upgrade QNN support to latest version
#9806 commented on
Apr 14, 2025 • 0 new comments -
Make ExecuTorch Q/DQ representation default and resilient
#9852 commented on
Apr 14, 2025 • 0 new comments -
CoreML model works with torch.jit.trace, but not torch.export.export
#9506 commented on
Apr 14, 2025 • 0 new comments -
Run arm tests in OSS CI unittest-buck
#9476 commented on
Apr 14, 2025 • 0 new comments -
[etLLM] New config system to export_llama
#9449 commented on
Apr 14, 2025 • 0 new comments -
Recipe and Input class definition for executorch.export API
#9366 commented on
Apr 14, 2025 • 0 new comments -
Llava 1.5 poor output quality in iOS app
#9183 commented on
Apr 14, 2025 • 0 new comments -
Run CoreML backend tests in CI
#9115 commented on
Apr 14, 2025 • 0 new comments -
StablediffusionV2.1 on MI15 device
#9080 commented on
Apr 14, 2025 • 0 new comments -
_load_for_executorch pybinding cannot load model for training
#4908 commented on
Apr 14, 2025 • 0 new comments -
Check tensor's dim order ambiguity in IR verifier
#9942 commented on
Apr 30, 2025 • 0 new comments -
Support CoreML export on Linux
#9800 commented on
Apr 29, 2025 • 0 new comments -
Inconsistent logo between GitHub repo and documentation site
#8163 commented on
Apr 25, 2025 • 0 new comments -
[v0.6.0] Release Tracker
#9253 commented on
Apr 25, 2025 • 0 new comments -
[Android] Error running a transformer decoder-based neural network
#8665 commented on
Apr 24, 2025 • 0 new comments -
Android: Tensor type is not very friendly to BFloat16
#6571 commented on
Apr 23, 2025 • 0 new comments -
insert_write_back_for_buffers_pass should inject copy_ nodes at the earliest possible spot.
#7345 commented on
Apr 23, 2025 • 0 new comments -
Refactor binary op partitioner configs under binary op config class
#9024 commented on
Apr 23, 2025 • 0 new comments -
[Request impl] Devtool end-to-end tests
#9778 commented on
Apr 22, 2025 • 0 new comments -
[Request Impl] Apply Segment Serialization in Bundled Program
#9771 commented on
Apr 22, 2025 • 0 new comments -
Add GenerateFromPoS in Android LLAMA API
#8290 commented on
Apr 21, 2025 • 0 new comments -
Support exporting QNN models with Python wheels out-of-the-box
#9474 commented on
Apr 21, 2025 • 0 new comments -
[Android] Add flavors (XNNPACK, QNN) in android-release-artifacts.yml
#10042 commented on
Apr 21, 2025 • 0 new comments -
Temperature Settings in Android llama demo (Mediatek)
#8878 commented on
Apr 18, 2025 • 0 new comments -
Add Android model E2E test
#9550 commented on
Apr 17, 2025 • 0 new comments -
Issues with deloyment on RP2040
#7177 commented on
Apr 17, 2025 • 0 new comments -
What commercial MCU should I choose to achieve on-device learning with MCU?
#9982 commented on
Apr 16, 2025 • 0 new comments -
Is the QNN backend support the model of Llama 3.2 3B instead of XNNPACK?
#9311 commented on
Apr 16, 2025 • 0 new comments