文本记录English

HDVIO2.0: Wind and Disturbance Estimation with Hybrid Dynamics VIO (T-RO 2025)

3m 49s560 字数100 segmentsEnglish

完整文本记录

0:01

Autonomous quadrotors rely on accurate

0:03

state estimation to navigate safely.

0:05

Visual inertial model odometry combines

0:08

camera and IMU data with the quadrotor's

0:10

model to estimate the full vehicle

0:12

state. The accuracy of current methods

0:14

degrades in the presence of strong

0:16

disturbances or when the dynamics are

0:18

not well modeled.

0:20

This factor graph shows how a standard

0:22

tightly coupled VIO system operates. It

0:25

fuses vision factors obtained through

0:27

feature tracking and inertial factors

0:29

obtained from the IMU to estimate the

0:31

quadrotor state along with the IMU

0:33

biases.

0:35

Visual inertial models extend the

0:36

traditional VIO framework by

0:38

incorporating a motion prior based on

0:40

quadrotor dynamics.

0:43

While these state-of-the-art systems

0:44

perform well in many scenarios, their

0:46

accuracy degrades in the presence of

0:48

inaccurate vehicle models or persistent

0:51

external disturbances such as wind due

0:53

to the simplified assumptions in the

0:55

dynamics model. In our method, we

0:57

address these limitations by proposing a

0:59

hybrid dynamics model that combines a

1:01

first principles quadrotor model with a

1:03

learningbased component that captures

1:05

residual effects such as aerodynamic

1:08

drag. It models the translational and

1:10

rotational vehicle dynamics and tightly

1:12

integrates them into the VIO system with

1:14

minimal runtime overhead. In contrast to

1:17

prior work, our learn dynamics model

1:19

does not require access to the full

1:21

drone state. Only the commanded thrust

1:23

and torqus as well as the gyroscope

1:25

measurements are needed. These inputs

1:28

are processed by two temporal

1:29

convolutional networks. One predicts a

1:31

residual thrust and the other predicts a

1:34

residual torque. The predicted residuals

1:36

are then added to the measured thrust

1:38

and torque and the resulting signals are

1:40

integrated to estimate updates in

1:42

velocity, position, and relative

1:43

orientation. We validate our learned

1:46

dynamics model with a neurobam data set

1:48

which contains very fast and agile

1:50

trajectories. Our method performs

1:52

remarkably well compared to modelbased,

1:54

learning based and hybrid baselines

1:56

despite not having access to the full

1:58

quadrotor state.

2:00

In terms of trajectory estimation, our

2:02

method outperforms the visual inertial

2:03

and visual inertial model baselines on

2:05

the Blackbird data set. The Blackbird

2:08

data set contains diverse trajectories

2:09

at medium speeds. Compared to the

2:11

baselines, the largest improvements are

2:13

in the faster trajectories where the

2:14

camera motion and rapid yaw changes make

2:16

the tracking of visual features

2:18

challenging.

2:20

Next, we move to real world experiments

2:23

to demonstrate the performance of our

2:24

hybrid dynamics VIO pipeline in the

2:27

presence of disturbances. We equip a

2:29

quadrotor with a dragboard and fly it in

2:31

strong winds up to 25 km/h.

2:35

On the bottom right, the force estimate

2:37

and on the left the position estimate

2:39

from the pipeline are shown.

2:42

Our method is plotted in red and the

2:44

ground truth positions are shown in

2:45

blue.

2:50

Despite the challenging flight

2:51

conditions, our method is very accurate.

3:01

Finally, we want to demonstrate that

3:03

HDVIO2 runs efficiently on board the

3:05

quadrotor and provides real-time state

3:07

estimates for closed loop control.

3:11

The goal of this experiment is to track

3:12

a random trajectory and only use our

3:15

HDVIO2 state estimate for control. The

3:17

plot at the bottom shows the ground

3:19

truth position in blue, the Intel Real

3:21

Sense estimate in yellow, and ours in

3:24

red.

3:30

Notably, HDVIO2 outperforms the

3:32

commercial stereo-based visual inertial

3:34

slam system of the Intel Real Sense T265

3:37

as shown in the plot. Towards the end of

3:40

the 120 seconds flight, the Real Sense

3:42

has drifted 50 cm, whereas our approach

3:45

only drifts about 20 cm.

解锁更多

免费注册以访问高级功能

互动查看器

观看带有同步字幕、可调节叠加层和完整播放控制的视频。

免费注册以解锁

AI 摘要

获取由 AI 立即生成的视频内容摘要、要点和结论。

免费注册以解锁

翻译

一键将字幕翻译成 100 多种语言。以任何格式下载。

免费注册以解锁

思维导图

将字幕可视化为交互式思维导图。一目了然地了解结构。

免费注册以解锁

与字幕聊天

提出关于视频内容的问题。直接从字幕中获取由 AI 驱动的答案。

免费注册以解锁

从您的字幕中获得更多

免费注册并解锁交互式查看器、AI 摘要、翻译、思维导图等。无需信用卡。

    HDVIO2.0: Wind and Disturbance… - 完整文字记录 | YouTubeTranscript.dev