AutArch

Summary

AutArch is an AI-assisted workflow capable of creating uniform archaeological datasets from heterogeneous published resources. The implementation provided here takes large and unsorted PDF files as input, and uses neural networks to conduct image processing, object detection, and classification.

Starting Guide (AutArch for Windows using WSL2)

The recommended way to run AutArch is to use the prepackaged zip file provided here.

1. Download the 'autarch public.zip' file and unpack the contents into one folder (labelled e.g. AutArch)

2. Installing Docker Desktop

Download WSL2 Docker Desktop: https://docs.docker.com/desktop/features/wsl/ . Install WSL2 Docker Desktop on your computer and restart it. Open the Docker Desktop app and agree to the terms of service.

If Docker Desktop does not start, press CTRL + ALT + DELETE and end any instance of Docker running in the background. If you have not downloaded WSL before, you will get an error message saying that you need to update WSL. Go into the PowerShell in Windows and download/update WSL by entering the following command:

wsl --update

Reopen Docker Desktop and press finish. You should see the start page with the container.

3. Creating a Docker image

Make sure that Docker Desktop is running in the background.

Go into the Windows PowerShell, change directory to the unpacked folder you created. Unpacking may create an autarch folder within the autarch folder. Check that you are in the right directory by entering dir.

All unpacked files should now be listed in this directory. Create the Docker image by using the following command:

docker compose --progress plain build

4. Running AutArch

Make sure that Docker Desktop is running in the background.

Go back to the PowerShell, change to the AutArch directory using cd . Type the following command:

docker compose up

Proceed to running AutArch, leaving the terminal opened in the background. You may have to wait a few minutes.

Go to your internet browser (Chrome, Edge etc.) and enter the following address:

http://localhost:3000 or http://127.0.0.1:3000

You should be at the homepage of the AutArch software.

5. Operating AutArch (starting guide)

We recommend uploading a new publication that contains grave drawings. Click 'Upload Publication', choose a PDF file to upload, enter article information, then press 'create publication'.

Analysing the PDF may take a while, depending on the document. Note: The Docker environment supplied will only rely on the CPU. Certain aspects of the processing of PDFs will be slower than if the environment had a sufficient GPU available.

Once the analysis of the publication is done, proceed to 'Graves' and filter by the publication you just uploaded. The uploaded publication should be available in the list, which is ordered alphabetically. Select it and click 'filter'.

If graves have been successfully detected, these will show in the list below. You can make edits by clicking the 'edit' button. Follow the steps one by one until all graves have been processed or click 'Graves' to return to the list.

Click 'Publications', select any publication from the list and click 'Stats' for a graphical overview of some of the results (e.g. orientation of the graves, whole-outline analysis)

AutArch allows many other functionalities, such as comparing publications, mapping results etc. See Workflow below for more information.

6. Closing AutArch

Close the AutArch tab in your internet browser. In the PowerShell, press Ctrl + C to stop the process

7. Reopening AutArch

Open Docker Desktop. Open the PowerShell and repeat Step 4.

Recommended Hardware

AutArch should run on most systems but the performance of the ML models is heavily depending on the availability of a PyTorch supported graphics card. Please consult the PyTorch manual. The current configuration has been successfully tested on an Nvidia RTX 2060 with 8GB of dedicated GPU memory, AMD Ryzen 9 7900X3D and 64GB of DDR5 RAM. AutArch will fallback to use CPU in case it can not detect a supported GPU.

Alternate ML Models

Some of the ML Models can be replaced, please refer to the AutArch material repository for download. To replace the models please look at scripts/train_object_detection.py#get_model.

Team

Name	Contribution	EMail	Github
Kevin Klein	Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualisation, Writing	[email protected]	github
Antoine Muller	Data curation, Formal analysis, Investigation, Visualisation, Writing
Alyssa Wohde	Methodology, Writing
Alexander V. Gorelik	Data curation, Investigation, Resources, Writing
Volker Heyd	Validation
Ralf Lämmel	Validation, Writing
Yoan Diekmann	Writing, Validation
Maxime Brami	Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Validation, Writing	[email protected]

Workflow

Publications can be imported under Publications -> Import

After the import has completed, the publication is available under Publications. It is recommended to go to the annotation screen and add all false negative objects.

To review all the graves, go to the grave screen. Use the filter on the top to select the publications just uploaded. Then click the edit button of the first grave on the list.

Grave Data

The ID assigned to the burial by the authors in the source publication is recorded. In case multiple images of the same grave are shown, the software will prevent duplicates in the results using this ID. In this step, the expert also has the option to discard drawings incorrectly classified as a grave.

Site

Graves can be assigned to specific sites. Sites can be added here.

Boxes

Correcting bounding boxes. The user can manually add, remove or change the bounding box assigned to a specific grave. Potential tasks include selecting a different scale on the page, resizing bounding boxes because they do not fully encapsulate an object or marking north arrows that were initially missed by object detection. During this step, a manual arrow has to be drawn for every skeleton following the spine and pointing towards the skull, which is necessary to determine the orientation of the skeleton in the grave. Several automated steps are then performed. The contours are calculated using the new bounding boxes and the resulting changes in measurements are saved. The orientation of the north arrow and the deposition type of the skeleton are updated using their respective neural network. The analysis of the scale is performed again.

Contours

All detected outlines in relation to one particular grave are highlighted, allowing the user, if any issue arises, to return to the previous step and fit, for instance, a manual bounding box around the grave or cross-section to indicate the width, length or depth.

Scale

The next step is to validate the scale by checking the text indicating the real-world length of the scale. Once this step is completed, all measurements are updated with the new scale information. In case no individual scale is provided and the publication uses a fixed scale, e.g. all drawings are 1:20, a different screen is shown. In this screen, the actual height of the page (in cm) has to be entered manually, together with the ratio of the drawing. This way, all measurements can be calculated in the absence of a scale and the results are fully compatible with scaled publications.

North Arrow

The angle of the north arrow can be adjusted manually based on a preview. In case an arrow is missing in the drawing, this screen will be skipped and size measurements and contours will still be collected without the orientation.

Skeleton Information

Finally, the pose of all skeletons has to be validated, which (for now) consists of “unknown”, “flexed on the side” or “supine”. As described above, a neural network will set the initial body position, but it can be adjusted manually. Further positions could easily be added in the future. “Unknown” is used in cases where skeletal remains are visible, but no position can be identified.

Output

Under publications. Publications can be analyzed using the analyze link for every publication. The shown page can be used to compare publications by selecting them from the top. Note that only 4 publications can be compared at the same time.

Manual Installation

This installation guide is intended for ubuntu linux. Windows systems are not natively supported but WSL is known to work. Installation procedures on other linux distributions will be similar.

AutArch requires the following packages:

$ sudo apt install libpq-dev postgresql libopencv-dev tesseract-ocr redis-server libvips42 build-essential zlib1g-dev libncurses5-dev libgdbm-dev libnss3-dev libssl-dev libreadline-dev libffi-dev wget libbz2-dev

To manage the installations of ruby, python and nodejs asdf is recommended. Please refer to the asdf guide in case of any problem.

After installing asdf, install these asdf plugins:

NodeJS

$ asdf plugin add nodejs https://github.com/asdf-vm/asdf-nodejs.git

Ruby

$ asdf plugin add ruby https://github.com/asdf-vm/asdf-ruby.git

Python

$ asdf plugin-add python

Clone this repository first. To install the necessary versions of the languages mentioned above:

$ cd autarch
$ asdf install

To install ruby dependencies:

$ gem install bundler
$ bundle install

Change your postgres password:

$ sudo su postgres -c psql
> alter user postgres with password '[your-password]';

Change the password in 'config/database.yml' for the development database settings to the one you chose.

To compile the C++ extensions:

$ cd image_processing
$ ruby extconf.rb
$ make

Download the database dump from the autarch supplementary repo and load the dump into your copy of postgres:

$ chmod a+x bin/rails
$ bin/rails db:create

$ cat autarch.sql | psql -h localhost -U postgres comove_development

$ chmod a+x bin/rails
$ bin/rails db:create

$ bin/rails db:schema:load

Install js dependencies:

$ sudo apt purge cmdtest
$ npm install --global yarn
$ yarn

AutArch was tested with PyTorch 2.4.1. Other compatible versions may work as well. For the best performance a GPU is highly recommended.

To install Torch please consult the torch installation guide. Please note that asdf installs Python 3.12.7. Please consult the asdf documentation in case you want your system python installation to be used or you want to use a different version.

$ pip install numpy pillow bottle

Download the models from the [autarch supplementary repo] (https://github.com/kevin-klein/autarch-material) and copy them to a models folder inside the AutArch folder.

Running AutArch

You need to start three different components, the rails server, shakapacker and the python ml service.

To run shakapacker:

$ chmod a+x bin/shakapacker-dev-server
$ bin/shakapacker-dev-server

To run rails:

$ bin/rails s

To start the python ml service:

$ python scripts/torch_service.py

To start background jobs:

$ bundle exec sidekiq

After all the services have successfully loaded, AutArch is accessible under localhost:3000

License

Currently the same copyright applies to this code as to the text and the other supplementary material. With the publication of the article, the AutArch source code will be available publicly under GPL license.

Name		Name	Last commit message	Last commit date
Latest commit History 238 Commits
.vscode		.vscode
app		app
assets		assets
bin		bin
config		config
db		db
images		images
lib		lib
log		log
map		map
public		public
scripts		scripts
storage		storage
supplementary		supplementary
tmp		tmp
vendor		vendor
.env.development		.env.development
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierrc		.prettierrc
.rubocop.yml		.rubocop.yml
.ruby-version		.ruby-version
.solargraph.yml		.solargraph.yml
.tool-versions		.tool-versions
.tool-versions.ml-services		.tool-versions.ml-services
.tool-versions.web		.tool-versions.web
Dockerfile		Dockerfile
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
README.md		README.md
Rakefile		Rakefile
bar_chart_count.png		bar_chart_count.png
bar_chart_count.py		bar_chart_count.py
bar_chart_error.png		bar_chart_error.png
bar_chart_error.py		bar_chart_error.py
box_plot.py		box_plot.py
compose.yml		compose.yml
config.ru		config.ru
docker-readme.txt		docker-readme.txt
errors_comove.json		errors_comove.json
errors_inkscape.json		errors_inkscape.json
grave_orientation.py		grave_orientation.py
install.sh		install.sh
logo.svg		logo.svg
ml-services.Dockerfile		ml-services.Dockerfile
package.json		package.json
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml
train-object-detection.Dockerfile		train-object-detection.Dockerfile
user_map.json		user_map.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AutArch

Summary

Starting Guide (AutArch for Windows using WSL2)

1. Download the 'autarch public.zip' file and unpack the contents into one folder (labelled e.g. AutArch)

2. Installing Docker Desktop

3. Creating a Docker image

4. Running AutArch

5. Operating AutArch (starting guide)

6. Closing AutArch

7. Reopening AutArch

Recommended Hardware

Alternate ML Models

Team

Workflow

Grave Data

Site

Tags

Boxes

Contours

Scale

North Arrow

Skeleton Information

Output

Manual Installation

NodeJS

Ruby

Python

Running AutArch

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

kevin-klein/AutArch

Folders and files

Latest commit

History

Repository files navigation

AutArch

Summary

Starting Guide (AutArch for Windows using WSL2)

1. Download the 'autarch public.zip' file and unpack the contents into one folder (labelled e.g. AutArch)

2. Installing Docker Desktop

3. Creating a Docker image

4. Running AutArch

5. Operating AutArch (starting guide)

6. Closing AutArch

7. Reopening AutArch

Recommended Hardware

Alternate ML Models

Team

Workflow

Grave Data

Site

Tags

Boxes

Contours

Scale

North Arrow

Skeleton Information

Output

Manual Installation

NodeJS

Ruby

Python

Running AutArch

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages