💡Doc2Opt

An Effective and Efficient System for Document-Grounded Automatic Optimization Modeling

✨ Overview

The Doc2Opt Framework , which highlights three key designs discussed in the following.

🗂️ 1. Cross-modal retrieval-augmented modeling

The cross-modal retrieval module serves as a semantic filter toretain only relevant document pages for different modeling tasks. The retrieved images of pages and the task description are then fed into a VLM to generate the optimization model as discussed below.

🚀2. Five-element representation

Instead of directly generating the mathematical model, we adopt a more concise and structured five-element formulation as an intermediate representation to improve the reliability and interpretability of the modeling.

🎯 Objective
🔢 Decision Variables
📦 Sets
📊 Parameters
📏 Constraints

Moreover, we instruct the generator to explain the references for the construction of each item in five-element formulation, from which content of the NL description or document pages a specific attribute or constraint is derived, which makes the modeling process tracable and interpretable. Also, it allows an easy refinement of the model by correcting the inappropriate terms based on the explanation.

🔄3. Autonomous Refinement

To achieve more robust and adaptive modeling to improve accuracy, we design a simple yet effective mechanism to automatically evaluate and refine the constructed model.

Inspired bythe LLM-as-a-judge approach, we use another VLM as an evaluator to assess the quality of the model produced by the generator. The evaluator is instructed to analyze the five-element representation and explanation to determine whether the model is required to be refined.

🔧 Usage

To use Doc2Opt, follow these steps:

First, clone the repository:

git clone https://github.com/polarccc/Doc2Opt

Then, install the required dependencies:

pip install -r requirements.txt

We provide two ways to run the project depending on your needs:

🖥️ Option 1: Launch the Interactive App

streamlit run app.py

⚡ Option 2: Run Core Pipeline Directly

python Doc2Opt.py  
    --model "Vision-Language Model" \
    --files "Path/to/your/files" 
    --question "Natural Language Description" \
    --api-key "Your API Key" \
    --base-url "Your API URL"  \

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
files		files
image/README		image/README
prompts		prompts
utils		utils
.gitignore		.gitignore
Doc2Opt.py		Doc2Opt.py
README.md		README.md
app.py		app.py
pipeline.py		pipeline.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💡Doc2Opt

An Effective and Efficient System for Document-Grounded Automatic Optimization Modeling

✨ Overview

🗂️ 1. Cross-modal retrieval-augmented modeling

🚀2. Five-element representation

🔄3. Autonomous Refinement

🔧 Usage

🖥️ Option 1: Launch the Interactive App

⚡ Option 2: Run Core Pipeline Directly

🖥️ GUI Demo

(a) Document upload for cross-modal retrieval

(b) Task description using natural language

(c) Five-element with explanation

(d) Retrieved pages

(e) Complete model

(f) Manual refinement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

💡Doc2Opt

An Effective and Efficient System for Document-Grounded Automatic Optimization Modeling

✨ Overview

🗂️ 1. Cross-modal retrieval-augmented modeling

🚀2. Five-element representation

🔄3. Autonomous Refinement

🔧 Usage

🖥️ Option 1: Launch the Interactive App

⚡ Option 2: Run Core Pipeline Directly

🖥️ GUI Demo

(a) Document upload for cross-modal retrieval

(b) Task description using natural language

(c) Five-element with explanation

(d) Retrieved pages

(e) Complete model

(f) Manual refinement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages