TRANSCRIPTIONEnglish

HDVIO2.0: Wind and Disturbance Estimation with Hybrid Dynamics VIO (T-RO 2025)

3m 49s560 mots100 segmentsEnglish

TRANSCRIPTION COMPLÈTE

0:01

Autonomous quadrotors rely on accurate

0:03

state estimation to navigate safely.

0:05

Visual inertial model odometry combines

0:08

camera and IMU data with the quadrotor's

0:10

model to estimate the full vehicle

0:12

state. The accuracy of current methods

0:14

degrades in the presence of strong

0:16

disturbances or when the dynamics are

0:18

not well modeled.

0:20

This factor graph shows how a standard

0:22

tightly coupled VIO system operates. It

0:25

fuses vision factors obtained through

0:27

feature tracking and inertial factors

0:29

obtained from the IMU to estimate the

0:31

quadrotor state along with the IMU

0:33

biases.

0:35

Visual inertial models extend the

0:36

traditional VIO framework by

0:38

incorporating a motion prior based on

0:40

quadrotor dynamics.

0:43

While these state-of-the-art systems

0:44

perform well in many scenarios, their

0:46

accuracy degrades in the presence of

0:48

inaccurate vehicle models or persistent

0:51

external disturbances such as wind due

0:53

to the simplified assumptions in the

0:55

dynamics model. In our method, we

0:57

address these limitations by proposing a

0:59

hybrid dynamics model that combines a

1:01

first principles quadrotor model with a

1:03

learningbased component that captures

1:05

residual effects such as aerodynamic

1:08

drag. It models the translational and

1:10

rotational vehicle dynamics and tightly

1:12

integrates them into the VIO system with

1:14

minimal runtime overhead. In contrast to

1:17

prior work, our learn dynamics model

1:19

does not require access to the full

1:21

drone state. Only the commanded thrust

1:23

and torqus as well as the gyroscope

1:25

measurements are needed. These inputs

1:28

are processed by two temporal

1:29

convolutional networks. One predicts a

1:31

residual thrust and the other predicts a

1:34

residual torque. The predicted residuals

1:36

are then added to the measured thrust

1:38

and torque and the resulting signals are

1:40

integrated to estimate updates in

1:42

velocity, position, and relative

1:43

orientation. We validate our learned

1:46

dynamics model with a neurobam data set

1:48

which contains very fast and agile

1:50

trajectories. Our method performs

1:52

remarkably well compared to modelbased,

1:54

learning based and hybrid baselines

1:56

despite not having access to the full

1:58

quadrotor state.

2:00

In terms of trajectory estimation, our

2:02

method outperforms the visual inertial

2:03

and visual inertial model baselines on

2:05

the Blackbird data set. The Blackbird

2:08

data set contains diverse trajectories

2:09

at medium speeds. Compared to the

2:11

baselines, the largest improvements are

2:13

in the faster trajectories where the

2:14

camera motion and rapid yaw changes make

2:16

the tracking of visual features

2:18

challenging.

2:20

Next, we move to real world experiments

2:23

to demonstrate the performance of our

2:24

hybrid dynamics VIO pipeline in the

2:27

presence of disturbances. We equip a

2:29

quadrotor with a dragboard and fly it in

2:31

strong winds up to 25 km/h.

2:35

On the bottom right, the force estimate

2:37

and on the left the position estimate

2:39

from the pipeline are shown.

2:42

Our method is plotted in red and the

2:44

ground truth positions are shown in

2:45

blue.

2:50

Despite the challenging flight

2:51

conditions, our method is very accurate.

3:01

Finally, we want to demonstrate that

3:03

HDVIO2 runs efficiently on board the

3:05

quadrotor and provides real-time state

3:07

estimates for closed loop control.

3:11

The goal of this experiment is to track

3:12

a random trajectory and only use our

3:15

HDVIO2 state estimate for control. The

3:17

plot at the bottom shows the ground

3:19

truth position in blue, the Intel Real

3:21

Sense estimate in yellow, and ours in

3:24

red.

3:30

Notably, HDVIO2 outperforms the

3:32

commercial stereo-based visual inertial

3:34

slam system of the Intel Real Sense T265

3:37

as shown in the plot. Towards the end of

3:40

the 120 seconds flight, the Real Sense

3:42

has drifted 50 cm, whereas our approach

3:45

only drifts about 20 cm.

DÉBLOQUER PLUS

Inscrivez-vous gratuitement pour accéder aux fonctionnalités premium

VISUALISEUR INTERACTIF

Regardez la vidéo avec des sous-titres synchronisés, une superposition réglable et un contrôle total de la lecture.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

RÉSUMÉ IA

Obtenez un résumé instantané généré par l'IA du contenu de la vidéo, des points clés et des principaux enseignements.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

TRADUIRE

Traduisez la transcription dans plus de 100 langues en un seul clic. Téléchargez dans n'importe quel format.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

CARTE MENTALE

Visualisez la transcription sous forme de carte mentale interactive. Comprenez la structure en un coup d'œil.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

DISCUTER AVEC LA TRANSCRIPTION

Posez des questions sur le contenu de la vidéo. Obtenez des réponses alimentées par l'IA directement à partir de la transcription.

INSCRIVEZ-VOUS GRATUITEMENT POUR DÉBLOQUER

TIREZ LE MEILLEUR PARTI DE VOS TRANSCRIPTIONS

Inscrivez-vous gratuitement et débloquez la visionneuse interactive, les résumés IA, les traductions, les cartes mentales, et plus encore. Aucune carte de crédit requise.

ESSAYEZ YOUTUBETRANSCRIPT.DEV COMMENCER GRATUITEMENT

HDVIO2.0: Wind… - Transcription Complète | YouTubeTranscript.dev