chore(core): remove warning msg for gpt-4 #1331

yuyutaotao · 2025-10-17T07:28:20Z

No description provided.

chatgpt-codex-connector

💡 Codex Review

midscene/packages/core/src/ai-model/llm-planning.ts

Lines 53 to 59 in 9f1d3e6

    
           let imagePayload = screenshotBase64; 
        
           let imageWidth = size.width; 
        
           let imageHeight = size.height; 
        
           const rightLimit = imageWidth; 
        
           const bottomLimit = imageHeight; 
        
           if (vlMode === 'qwen-vl') { 
        
             const paddedResult = await paddingToMatchBlockByBase64(imagePayload);

Planning requests omit element overlays for non-VL models

The planning pipeline now forwards the original screenshot for non-VL models after only handling the qwen-vl padding case. The prompts used for non-VL planning still tell the model that element IDs are “marked with a rectangle in the screenshot” (prompt/llm-planning.ts lines ~272–274). With the overlay removed, GPT‑4o receives an unannotated screenshot and cannot reliably return element IDs for actions, leading to incorrect or null locate parameters. Reintroduce the overlay or revise prompts/consumers so the model’s assumptions match the data it sees.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting

@codex fix this CI failure
@codex address that feedback

chatgpt-codex-connector · 2025-10-17T07:30:25Z

packages/core/src/ai-model/inspect.ts

    imageWidth = paddedResult.width;
    imageHeight = paddedResult.height;
    imagePayload = paddedResult.imageBase64;


Non-VL element lookup no longer overlays element IDs

The AiLocateElement flow now skips markupImageForLLM whenever vlMode is undefined, so GPT‑4o receives the raw screenshot instead of the annotated version. The locator prompt for non-VL models still asserts that non-text elements “have been highlighted” and expects IDs marked in the image (prompt/llm-locator.ts around lines 83‑86). Without those overlays the model cannot map the textual element list to the screenshot, so element searches for default models will degrade or fail outright. Consider restoring the markup step or updating the prompt and downstream logic to align with the unannotated image.

Useful? React with 👍 / 👎.

chatgpt-codex-connector bot reviewed Oct 17, 2025

View reviewed changes

quanru force-pushed the 1.0 branch from 80a2c97 to 46ffc1d Compare October 17, 2025 11:06

yuyutaotao added 2 commits October 20, 2025 11:00

chore(core): remove warning msg for gpt-4

de7c3c6

chore(core): remove dom-based locator

ff22375

quanru force-pushed the feat/remove-gpt-4 branch from 30c347e to ff22375 Compare October 20, 2025 03:01

quanru approved these changes Oct 20, 2025

View reviewed changes

quanru merged commit 2a98471 into 1.0 Oct 20, 2025
3 checks passed

quanru deleted the feat/remove-gpt-4 branch October 20, 2025 03:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(core): remove warning msg for gpt-4 #1331

chore(core): remove warning msg for gpt-4 #1331

Uh oh!

yuyutaotao commented Oct 17, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	let imagePayload = screenshotBase64;
	let imageWidth = size.width;
	let imageHeight = size.height;
	const rightLimit = imageWidth;
	const bottomLimit = imageHeight;
	if (vlMode === 'qwen-vl') {
	const paddedResult = await paddingToMatchBlockByBase64(imagePayload);

chore(core): remove warning msg for gpt-4 #1331

chore(core): remove warning msg for gpt-4 #1331

Uh oh!

Conversation

yuyutaotao commented Oct 17, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants