Support for images #5681
Replies: 6 comments 15 replies
-
Currently, we are determining the file type by extension. I think we could do the same for images. Let's start with supporting some popular formats, and implement new ones per users requests.
I guess that depends on what we are actually expecting from |
Beta Was this translation helpful? Give feedback.
-
After thinking about it, I lean towards forcing users to be explicit about images. Thinking about a use case like computer vision, there may be lots of image files in the raw data, and then a small number of images used for reporting, plots, etc. It might be important to have dvc only manage the specified images.
Do you see this as being part of |
Beta Was this translation helpful? Give feedback.
-
I don't really see why this needs to be core DVC functionality. DVC should try to be data-type agnostic, and I'm not sure that setting a precedent for handling specific file types is a good idea. To me, this sounds like a use case where a user should be using |
Beta Was this translation helpful? Give feedback.
-
It seems that subject raised in this discussion is also discussed in iterative/enhancement-proposals#4 |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
How should image plots be identified? Here are a few proposals (feel free to suggest others):
These could also be combined so that there is a default behavior but also ways to be more flexible (similar to params reading automatically from |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Image files can be important in dvc projects because:
I initially thought this was a straightforward feature to implement, but I think there are questions to discuss:
dvc images diff
)? Should it support comparison of two images likemetrics diff
or more than two likeplots diff
?image: true
option under advc.yaml
output)? This might be helpful to show/diff all images in a project, and to enable support for apps like viewer to show all images. Could image files instead be inferred from file extensions?Some examples of diffing images:
https://docs.github.com/en/github/managing-files-in-a-repository/rendering-and-diffing-images
https://github.com/ewanmellor/git-diff-image
https://oscarnajera.com/2017/11/git-diff-images-and-pdfs/
@pared @mnrozhkov @JIoJIaJIu @dmpetrov
Beta Was this translation helpful? Give feedback.
All reactions