alexwengg commited on
Commit
f30fb77
·
verified ·
1 Parent(s): 77c9940

Upload 25 files

Browse files
Files changed (25) hide show
  1. iteration_2/README.md +60 -0
  2. iteration_2/packages/bert.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  3. iteration_2/packages/bert.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  4. iteration_2/packages/bert.mlpackage/Manifest.json +18 -0
  5. iteration_2/packages/decoder_pre.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  6. iteration_2/packages/decoder_pre.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  7. iteration_2/packages/decoder_pre.mlpackage/Manifest.json +18 -0
  8. iteration_2/packages/decoder_upsample.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  9. iteration_2/packages/decoder_upsample.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  10. iteration_2/packages/decoder_upsample.mlpackage/Manifest.json +18 -0
  11. iteration_2/packages/duration_predictor.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  12. iteration_2/packages/duration_predictor.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  13. iteration_2/packages/duration_predictor.mlpackage/Manifest.json +18 -0
  14. iteration_2/packages/fused_diffusion_sampler.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  15. iteration_2/packages/fused_diffusion_sampler.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  16. iteration_2/packages/fused_diffusion_sampler.mlpackage/Manifest.json +18 -0
  17. iteration_2/packages/fused_f0n_har_source.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  18. iteration_2/packages/fused_f0n_har_source.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  19. iteration_2/packages/fused_f0n_har_source.mlpackage/Manifest.json +18 -0
  20. iteration_2/packages/ref_encoder.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  21. iteration_2/packages/ref_encoder.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  22. iteration_2/packages/ref_encoder.mlpackage/Manifest.json +18 -0
  23. iteration_2/packages/text_encoder.mlpackage/Data/com.apple.CoreML/model.mlmodel +3 -0
  24. iteration_2/packages/text_encoder.mlpackage/Data/com.apple.CoreML/weights/weight.bin +3 -0
  25. iteration_2/packages/text_encoder.mlpackage/Manifest.json +18 -0
iteration_2/README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # StyleTTS2 → CoreML iteration_2
2
+
3
+ Production-ready fp32 mlpackages adopting Trials 4 + 6 + 8b from
4
+ `coreml/fusions.md`.
5
+
6
+ ## Pipeline (8 stages, 8 dispatches)
7
+
8
+ ```
9
+ text_encoder → CPU_ONLY fp32 21 MB
10
+ bert → ALL fp32 23 MB
11
+ ref_encoder → CPU_AND_GPU fp32 106 MB
12
+ fused_diffusion_sampler → ALL fp32 94 MB ← Trial 4 (replaces diffusion_unet × 8)
13
+ duration_predictor → CPU_ONLY fp32 30 MB
14
+ fused_f0n_har_source → CPU_ONLY fp32 32 MB ← Trial 6 (replaces f0n_predictor + har_source)
15
+ decoder_pre → CPU_AND_NE fp32 128 MB
16
+ decoder_upsample → CPU_ONLY fp32 79 MB
17
+ ```
18
+
19
+ Total: **514 MB**, 8 mlpackages, 8 dispatches per utterance.
20
+
21
+ ## Performance
22
+
23
+ Warm latency on M-series Mac, single-process, no other GPU/ANE workloads:
24
+
25
+ * Pipeline warm: **~480–565 ms** (down from ~1030 ms baseline)
26
+ * Stage count: 9 → 8 (Trials 4 + 6)
27
+ * Dispatches per utterance: 16 → 8 (−50%)
28
+
29
+ See `coreml/fusions.md` for full trial history, latency tables, parity
30
+ chains, and per-stage placement sweep results.
31
+
32
+ ## Adopted trials
33
+
34
+ | Trial | Change | Save |
35
+ |-------|------------------------------------------------------|------|
36
+ | 4 | fused 5-step ADPM2 sampler (8 dispatches → 1) | −437 ms warm |
37
+ | 6 | fused f0n_predictor + har_source | −42 ms warm |
38
+ | 8b | bert→ALL, ref_encoder→CPU_AND_GPU, sampler→ALL | small but stable |
39
+
40
+ ## Skipped / dropped
41
+
42
+ | Trial | Outcome |
43
+ |-------|------------------------------------------------------|
44
+ | 5 | har + decoder_upsample fuse — partition tax (+290 ms) |
45
+ | 7 | ref_encoder + sampler fuse — partition tax (200 MB graph) |
46
+ | 8a | aggressive `decoder_upsample → ALL` — bimodal 322–759 ms |
47
+ | 9 | `_hifigan_shift` fold — sub-1 ms saving, dominated by Trial 8 |
48
+
49
+ ## Usage
50
+
51
+ Drop `packages/` into `models/tts/styletts2/coreml/` (or symlink) and
52
+ run `python -m coreml.inference` from the styletts2 root. The
53
+ `_STAGE_COMPUTE` and `_STAGE_PRECISION` manifests in
54
+ `coreml/inference.py` are wired to load these by default.
55
+
56
+ To compare against the legacy 9-package path:
57
+
58
+ ```bash
59
+ python -m coreml.inference --no-fused
60
+ ```
iteration_2/packages/bert.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8012da86238950009773b88ece84d6de35ba9b5c70fc6168aba3a1eb7cc8064b
3
+ size 75521
iteration_2/packages/bert.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0d1ea7dabcc541d818a55e746da625e22bc5937b6fb20ee7a838627145e92ef
3
+ size 24179328
iteration_2/packages/bert.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "1BC2E76C-B9B2-4C7B-AED4-1E889E8F7FBE": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "C180CF2C-D86C-42D0-AD5E-47F193744BC3": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "C180CF2C-D86C-42D0-AD5E-47F193744BC3"
18
+ }
iteration_2/packages/decoder_pre.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:950ae5f552e2d1238b36c3da24a1c63a2983a3ef2969e1a6e42ef2c21fabbb5b
3
+ size 49028
iteration_2/packages/decoder_pre.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ae8235d8dda1e118c01137ee51b8aef02d6e0533831fea5c2bec6177d9fa3f9
3
+ size 134378304
iteration_2/packages/decoder_pre.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "2C253E60-4805-4B39-9657-E2C62926792B": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "9CE7F5C2-332A-49A7-B10C-2522C7E06365": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "9CE7F5C2-332A-49A7-B10C-2522C7E06365"
18
+ }
iteration_2/packages/decoder_upsample.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cef66932900f7c2bd6f15405523542c06f612bb035fd4d41a6ac9e26c6e7b0d3
3
+ size 425250
iteration_2/packages/decoder_upsample.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:625ec116b48cba8e9999fcd45d756327d9726dff343331c29c279137d27193d2
3
+ size 82761984
iteration_2/packages/decoder_upsample.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "24CCEF67-C2F2-4AF9-B311-8CDC60B5FBBB": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Specification",
7
+ "name": "model.mlmodel",
8
+ "path": "com.apple.CoreML/model.mlmodel"
9
+ },
10
+ "29AFBB19-CEF9-48B5-9A99-B3B2152401DA": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Weights",
13
+ "name": "weights",
14
+ "path": "com.apple.CoreML/weights"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "24CCEF67-C2F2-4AF9-B311-8CDC60B5FBBB"
18
+ }
iteration_2/packages/duration_predictor.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13b19f796a2a8b7a7a9eb7be43c6de2d9b6030339c496b5d11ab1ee4f243f426
3
+ size 25605
iteration_2/packages/duration_predictor.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6ce5a0e45ed5d42bbb893191d2e3c27e334d41f30183c5cca87cbf1fc7d62b86
3
+ size 31084928
iteration_2/packages/duration_predictor.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "BDD110CC-1B78-45C1-A80C-1B0B4B2B2CE2": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Specification",
7
+ "name": "model.mlmodel",
8
+ "path": "com.apple.CoreML/model.mlmodel"
9
+ },
10
+ "F5A8FF41-763F-4328-ABB7-AF96AC9711F9": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Weights",
13
+ "name": "weights",
14
+ "path": "com.apple.CoreML/weights"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "BDD110CC-1B78-45C1-A80C-1B0B4B2B2CE2"
18
+ }
iteration_2/packages/fused_diffusion_sampler.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e05e33d4729be4a613135117b85a7aff75116fee83d92621ccf16f252ba9553b
3
+ size 271397
iteration_2/packages/fused_diffusion_sampler.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b786a3779f4c7c4572f5bb16f48756a0ae8988f3b53145307f7afbf08822392
3
+ size 98720192
iteration_2/packages/fused_diffusion_sampler.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "70286131-23BE-4491-8EDF-C1D6B030B687": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "C9EF35A4-A334-4775-85FB-FCE928373822": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "C9EF35A4-A334-4775-85FB-FCE928373822"
18
+ }
iteration_2/packages/fused_f0n_har_source.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c98d56382c336dbea5a864e594022c11ced07372687dc66e56c7c91795e45c63
3
+ size 61645
iteration_2/packages/fused_f0n_har_source.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce01b0523d8e925108b5f9ba113d5ff1721bb3e5e25f3eb6d8a2dfaa56876c59
3
+ size 33640448
iteration_2/packages/fused_f0n_har_source.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "696D6238-15F9-4A4A-AB5B-3957A847EB37": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "8111F480-F3EF-48DE-91B8-5309A11BF779": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "8111F480-F3EF-48DE-91B8-5309A11BF779"
18
+ }
iteration_2/packages/ref_encoder.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c5ca9ec42449e2ffe1c8558f569c78bc1644b73dc91fa1b86390b71c54156cd2
3
+ size 63752
iteration_2/packages/ref_encoder.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:73357e91e4dccfe3159f735e9cca857cb0ebd52d0699fb6e5c87abd08ff56ee3
3
+ size 110767808
iteration_2/packages/ref_encoder.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "01330047-9152-4BED-8B50-2EC2B6B08F4D": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "1501890B-8915-442A-934F-918E83B55691": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "1501890B-8915-442A-934F-918E83B55691"
18
+ }
iteration_2/packages/text_encoder.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:adec64010c7b89aabbce6561eac0d4ffb4fe226025f5475c24ab9b0cee800ff5
3
+ size 14083
iteration_2/packages/text_encoder.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a59f5a30389f40e82f0bb8b44d025f8fcc8914462470db6c6d5d7a83e65eb3c
3
+ size 22414656
iteration_2/packages/text_encoder.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "68619DB6-4DCD-4F09-A236-AEBEF170ADA8": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "F55399A3-FE64-43B0-A99C-197E1861DC1D": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "F55399A3-FE64-43B0-A99C-197E1861DC1D"
18
+ }