TRANSKRIPTEnglish

HDVIO2.0: Wind and Disturbance Estimation with Hybrid Dynamics VIO (T-RO 2025)

3m 49s560 ord100 segmentsEnglish

FULLSTÄNDIGT TRANSKRIPT

0:01

Autonomous quadrotors rely on accurate

0:03

state estimation to navigate safely.

0:05

Visual inertial model odometry combines

0:08

camera and IMU data with the quadrotor's

0:10

model to estimate the full vehicle

0:12

state. The accuracy of current methods

0:14

degrades in the presence of strong

0:16

disturbances or when the dynamics are

0:18

not well modeled.

0:20

This factor graph shows how a standard

0:22

tightly coupled VIO system operates. It

0:25

fuses vision factors obtained through

0:27

feature tracking and inertial factors

0:29

obtained from the IMU to estimate the

0:31

quadrotor state along with the IMU

0:33

biases.

0:35

Visual inertial models extend the

0:36

traditional VIO framework by

0:38

incorporating a motion prior based on

0:40

quadrotor dynamics.

0:43

While these state-of-the-art systems

0:44

perform well in many scenarios, their

0:46

accuracy degrades in the presence of

0:48

inaccurate vehicle models or persistent

0:51

external disturbances such as wind due

0:53

to the simplified assumptions in the

0:55

dynamics model. In our method, we

0:57

address these limitations by proposing a

0:59

hybrid dynamics model that combines a

1:01

first principles quadrotor model with a

1:03

learningbased component that captures

1:05

residual effects such as aerodynamic

1:08

drag. It models the translational and

1:10

rotational vehicle dynamics and tightly

1:12

integrates them into the VIO system with

1:14

minimal runtime overhead. In contrast to

1:17

prior work, our learn dynamics model

1:19

does not require access to the full

1:21

drone state. Only the commanded thrust

1:23

and torqus as well as the gyroscope

1:25

measurements are needed. These inputs

1:28

are processed by two temporal

1:29

convolutional networks. One predicts a

1:31

residual thrust and the other predicts a

1:34

residual torque. The predicted residuals

1:36

are then added to the measured thrust

1:38

and torque and the resulting signals are

1:40

integrated to estimate updates in

1:42

velocity, position, and relative

1:43

orientation. We validate our learned

1:46

dynamics model with a neurobam data set

1:48

which contains very fast and agile

1:50

trajectories. Our method performs

1:52

remarkably well compared to modelbased,

1:54

learning based and hybrid baselines

1:56

despite not having access to the full

1:58

quadrotor state.

2:00

In terms of trajectory estimation, our

2:02

method outperforms the visual inertial

2:03

and visual inertial model baselines on

2:05

the Blackbird data set. The Blackbird

2:08

data set contains diverse trajectories

2:09

at medium speeds. Compared to the

2:11

baselines, the largest improvements are

2:13

in the faster trajectories where the

2:14

camera motion and rapid yaw changes make

2:16

the tracking of visual features

2:18

challenging.

2:20

Next, we move to real world experiments

2:23

to demonstrate the performance of our

2:24

hybrid dynamics VIO pipeline in the

2:27

presence of disturbances. We equip a

2:29

quadrotor with a dragboard and fly it in

2:31

strong winds up to 25 km/h.

2:35

On the bottom right, the force estimate

2:37

and on the left the position estimate

2:39

from the pipeline are shown.

2:42

Our method is plotted in red and the

2:44

ground truth positions are shown in

2:45

blue.

2:50

Despite the challenging flight

2:51

conditions, our method is very accurate.

3:01

Finally, we want to demonstrate that

3:03

HDVIO2 runs efficiently on board the

3:05

quadrotor and provides real-time state

3:07

estimates for closed loop control.

3:11

The goal of this experiment is to track

3:12

a random trajectory and only use our

3:15

HDVIO2 state estimate for control. The

3:17

plot at the bottom shows the ground

3:19

truth position in blue, the Intel Real

3:21

Sense estimate in yellow, and ours in

3:24

red.

3:30

Notably, HDVIO2 outperforms the

3:32

commercial stereo-based visual inertial

3:34

slam system of the Intel Real Sense T265

3:37

as shown in the plot. Towards the end of

3:40

the 120 seconds flight, the Real Sense

3:42

has drifted 50 cm, whereas our approach

3:45

only drifts about 20 cm.

LÅS UPP MER

Registrera dig gratis för att få tillgång till premiumfunktioner

INTERAKTIV VISARE

Titta på videon med synkroniserad undertext, justerbart överlägg och fullständig uppspelningskontroll.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

AI-SAMMANFATTNING

Få en omedelbar AI-genererad sammanfattning av videoinnehållet, nyckelpunkter och slutsatser.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

ÖVERSÄTT

Översätt transkriptet till över 100 språk med ett klick. Ladda ner i valfritt format.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

MIND MAP

Visualisera transkriptet som en interaktiv mind map. Förstå strukturen med ett ögonkast.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

CHATTA MED TRANSKRIPT

Ställ frågor om videoinnehållet. Få svar från AI direkt från transkriptet.

REGISTRERA DIG GRATIS FÖR ATT LÅSA UPP

FÅ UT MER AV DINA TRANSKRIPT

Registrera dig gratis och lås upp interaktiv visning, AI-sammanfattningar, översättningar, mind maps och mer. Inget kreditkort krävs.

    HDVIO2.0: Wind… - Fullständigt Transkript | YouTubeTranscript.dev