Name	Name	Last commit message	Last commit date
parent directory ..
python	python
README.md	README.md
main.rs	main.rs

Name

Last commit message

Last commit date

candle-marian-mt

marian-mt is a neural machine translation model. In this example it is used to translate text from French to English. See the associated model card for details on the model itself.

Running an example

cargo run --example marian-mt --release -- \
    --text "Demain, dès l'aube, à l'heure où blanchit la campagne, Je partirai. Vois-tu, je sais que tu m'attends. J'irai par la forêt, j'irai par la montagne. Je ne puis demeurer loin de toi plus longtemps."

<NIL> Tomorrow, at dawn, at the time when the country is whitening, I will go. See,
I know you are waiting for me. I will go through the forest, I will go through the
mountain. I cannot stay far from you any longer.</s>

Changing model and language pairs

$ cargo run --example marian-mt --release -- --text "hello, how are you." --which base --language-pair en-zh

你好,你好吗?

Generating the tokenizer.json files

The tokenizer for each marian-mt model was trained independently, meaning each new model needs unique tokenizer encoders and decoders. You can use the ./python/convert_slow_tokenizer.py script in this directory to generate the tokenizer.json config files from the hf-hub repos. The script requires all the packages in ./python/requirements.txt or ./python/uv.lock to be installed, and has only been tested for python 3.12.7.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

candle-marian-mt

Running an example

Changing model and language pairs

Generating the tokenizer.json files

FilesExpand file tree

marian-mt

Directory actions

More options

Directory actions

More options

Latest commit

History

marian-mt

Folders and files

parent directory

README.md

candle-marian-mt

Running an example

Changing model and language pairs

Generating the tokenizer.json files