Authors: Yi Yang*, Xiaoxuan He*, Hongkun Pan*, Xiyan Jiang, Yan Deng, Xingtao Yang, Haoyu Lu, Minfeng Zhu, Bo Zhang, Wei Chen

First Published on: 2025/02/12

GitHub: https://github.com/Fancy-MLLM/R1-onevision

Dataset: https://huggingface.co/datasets/Fancy-MLLM/R1-OneVision

Benchmark: https://huggingface.co/datasets/Fancy-MLLM/R1-OneVision-Bench

Model: https://huggingface.co/Fancy-MLLM/R1-OneVision-7B

Space: https://huggingface.co/spaces/Fancy-MLLM/R1-OneVision

Introduction

R1-OneVision is a versatile multimodal reasoning large model, designed to tackle complex visual reasoning tasks. It seamlessly integrates visual and textual data to offer precise interpretations of multimodal information, excelling in areas such as mathematics, science, deep image understanding, and logical reasoning. With its robust ability to perform multimodal reasoning, R1-OneVision emerges as a powerful AI assistant capable of addressing a wide range of problem-solving challenges across different domains.

Demo1

img_v3_02je_16e44c0c-1334-4722-bd06-1554b819ad6g.jpg

Question:As shown in the figure, CD is a plane mirror, the light is emitted from point A, reflected by point E on CD, and irradiated to point B, If the incident angle is a, AC L CD. BD .I CD. the feet of perpendicular are C. D. and AC = 3.0, BD= 6.0, CD = 10.0, then the length of the line segment ED is ()

Reasoning Process:

Since the angles of incidence and reflection are equal, the triangles ACE and BDE are similar by AA similarity (both areright triangles and share the same angle at E). Given the similarity of triangles ACE and BDE, the ratios of corresponding sides are equal. Therefore, we can write:

\[\frac{AC}{BD}= frac{CE}{DE}\] Substituting the known values: \[ \fracf{3.0}{6.0}= frac{CE}{DE}\] Simplifying the ratio: [ \fracf1H2]= \frac{CEHDE] This implies: \[ CE =\frac{1}{2} DE\] Since CD = 10.0 units and CD is the sum of CE and ED, we can write:

\[CE+DE=10.0\] Substituting\( CE = \frac{1}{2} DE \] into the equation: \[\frac{1}{2} DE+ DE = 10.0\] Combining like terms: \[ \frac{3}{2} DE= 10.0 \] Solving for DE: \[ DE = \frac{10.0 \times 2}{3} = \frac{20}{3}\] Therefore, the length of ED is \( \frac{20}3\) units.

Final Answer:The length of the line segment ED is \( \boxed{\frac{20}{3}\) units.

Demo2

img_v3_02je_eefda53e-dd67-491c-bb2d-9c6010c81d6g.jpg

Question:As shown in the figure, in ΔABC, the points D, E, F are known to be the midpoints of the sides BC, AD, CE and SΔABC = 4cm^2, then SΔDEF is equal to ()

Reasoning Process:

First, let's consider the properties of midpoints in triangles. As seen in the image, D is the midpoint of BC, so AD is a median. This means that AD divides triangle ABC into two smaller triangles, ABD and ADC, each with half the area of ABC. Therefore, each of these smaller triangles has an area of 2 cm².

Next, E is the midpoint of AD. As seen in the image, connecting E to C forms triangle AEC. Since E is the midpoint of AD, triangle AEC is half the area of triangle ADC. Thus, the area of triangle AEC is 1 cm².

Now, F is the midpoint of CE. As seen in the image, connecting F to D forms triangle DEF. Since F is the midpoint of CE, triangle DEF is half the area of triangle AEC. Therefore, the area of triangle DEF is 0.5 cm².

To verify, let's consider the areas step-by-step:

  1. The area of triangle ABC is 4 cm².
  2. Since D is the midpoint of BC, the area of triangle ABD is 2 cm² and the area of triangle ADC is 2 cm².
  3. Since E is the midpoint of AD, the area of triangle AEC is 1 cm².
  4. Since F is the midpoint of CE, the area of triangle DEF is 0.5 cm².

Thus, the area of triangle DEF is 0.5 cm².

Final Answer:The area of triangle DEF is \boxed{0.5} cm²

Demo3

img_v3_02j9_1f9ea62c-4a91-4393-972e-8b56c09b261g.jpg

Hint: Please answer the question and provide the correct option letter, e.g., A, B, C, D, at the end. Question: Theresa moves a pencil along the line. She starts at the arrow shown. In which order will she go past the shapes? Choices: (A) $\Delta, \square, \bullet$ (B) $\Delta, \bullet, \square$ (C) $\bullet, \Delta, \square$ (D) $\square, \Delta, \bullet$ (E) $\square, \bullet, \Delta$

Reasoning Process:

First, I need to determine the path of the pencil. As seen in the image, the line starts at the arrow and goes through the shapes in a specific sequence. The image shows that the line starts at the arrow, then goes through the triangle, followed by the circle, and ends at the square.

So, the order of the shapes as the pencil moves along the line is triangle, circle, square. Therefore, the correct order is triangle, circle, square.

Looking at the options, option A is Δ, □, •, which corresponds to triangle, square, circle. However, the image shows that the line starts at the arrow, then goes through the triangle, followed by the circle, and ends at the square. So the correct order is triangle, circle, square.

Therefore, the correct answer is A.

The pencil starts at the arrow, moves through the triangle, then the circle, and finally the square. Thus, the order is triangle, circle, square.

Final Answer:A

R1-OneVision Dataset

The dataset used to train R1-OneVision consists of LLaVA-OneVision and a variety of specialized datasets that cater to visual reasoning tasks. The dataset processing involves several key steps: