ABSCHRIFTEnglish

HDVIO2.0: Wind and Disturbance Estimation with Hybrid Dynamics VIO (T-RO 2025)

3m 49s560 Wörter100 segmentsEnglish

VOLLSTÄNDIGE ABSCHRIFT

0:01

Autonomous quadrotors rely on accurate

0:03

state estimation to navigate safely.

0:05

Visual inertial model odometry combines

0:08

camera and IMU data with the quadrotor's

0:10

model to estimate the full vehicle

0:12

state. The accuracy of current methods

0:14

degrades in the presence of strong

0:16

disturbances or when the dynamics are

0:18

not well modeled.

0:20

This factor graph shows how a standard

0:22

tightly coupled VIO system operates. It

0:25

fuses vision factors obtained through

0:27

feature tracking and inertial factors

0:29

obtained from the IMU to estimate the

0:31

quadrotor state along with the IMU

0:33

biases.

0:35

Visual inertial models extend the

0:36

traditional VIO framework by

0:38

incorporating a motion prior based on

0:40

quadrotor dynamics.

0:43

While these state-of-the-art systems

0:44

perform well in many scenarios, their

0:46

accuracy degrades in the presence of

0:48

inaccurate vehicle models or persistent

0:51

external disturbances such as wind due

0:53

to the simplified assumptions in the

0:55

dynamics model. In our method, we

0:57

address these limitations by proposing a

0:59

hybrid dynamics model that combines a

1:01

first principles quadrotor model with a

1:03

learningbased component that captures

1:05

residual effects such as aerodynamic

1:08

drag. It models the translational and

1:10

rotational vehicle dynamics and tightly

1:12

integrates them into the VIO system with

1:14

minimal runtime overhead. In contrast to

1:17

prior work, our learn dynamics model

1:19

does not require access to the full

1:21

drone state. Only the commanded thrust

1:23

and torqus as well as the gyroscope

1:25

measurements are needed. These inputs

1:28

are processed by two temporal

1:29

convolutional networks. One predicts a

1:31

residual thrust and the other predicts a

1:34

residual torque. The predicted residuals

1:36

are then added to the measured thrust

1:38

and torque and the resulting signals are

1:40

integrated to estimate updates in

1:42

velocity, position, and relative

1:43

orientation. We validate our learned

1:46

dynamics model with a neurobam data set

1:48

which contains very fast and agile

1:50

trajectories. Our method performs

1:52

remarkably well compared to modelbased,

1:54

learning based and hybrid baselines

1:56

despite not having access to the full

1:58

quadrotor state.

2:00

In terms of trajectory estimation, our

2:02

method outperforms the visual inertial

2:03

and visual inertial model baselines on

2:05

the Blackbird data set. The Blackbird

2:08

data set contains diverse trajectories

2:09

at medium speeds. Compared to the

2:11

baselines, the largest improvements are

2:13

in the faster trajectories where the

2:14

camera motion and rapid yaw changes make

2:16

the tracking of visual features

2:18

challenging.

2:20

Next, we move to real world experiments

2:23

to demonstrate the performance of our

2:24

hybrid dynamics VIO pipeline in the

2:27

presence of disturbances. We equip a

2:29

quadrotor with a dragboard and fly it in

2:31

strong winds up to 25 km/h.

2:35

On the bottom right, the force estimate

2:37

and on the left the position estimate

2:39

from the pipeline are shown.

2:42

Our method is plotted in red and the

2:44

ground truth positions are shown in

2:45

blue.

2:50

Despite the challenging flight

2:51

conditions, our method is very accurate.

3:01

Finally, we want to demonstrate that

3:03

HDVIO2 runs efficiently on board the

3:05

quadrotor and provides real-time state

3:07

estimates for closed loop control.

3:11

The goal of this experiment is to track

3:12

a random trajectory and only use our

3:15

HDVIO2 state estimate for control. The

3:17

plot at the bottom shows the ground

3:19

truth position in blue, the Intel Real

3:21

Sense estimate in yellow, and ours in

3:24

red.

3:30

Notably, HDVIO2 outperforms the

3:32

commercial stereo-based visual inertial

3:34

slam system of the Intel Real Sense T265

3:37

as shown in the plot. Towards the end of

3:40

the 120 seconds flight, the Real Sense

3:42

has drifted 50 cm, whereas our approach

3:45

only drifts about 20 cm.

MEHR FREISCHALTEN

Melden Sie sich kostenlos an, um Premium-Funktionen zu nutzen

INTERAKTIVER VIEWER

Sehen Sie sich das Video mit synchronisierten Untertiteln, anpassbarer Überlagerung und voller Wiedergabesteuerung an.

KOSTENLOS ANMELDEN ZUM FREISCHALTEN

KI-ZUSAMMENFASSUNG

Erhalten Sie eine sofortige KI-generierte Zusammenfassung des Videoinhalts, der wichtigsten Punkte und Erkenntnisse.

KOSTENLOS ANMELDEN ZUM FREISCHALTEN

ÜBERSETZEN

Übersetzen Sie das Transkript mit einem Klick in über 100 Sprachen. Download in jedem Format.

KOSTENLOS ANMELDEN ZUM FREISCHALTEN

MIND MAP

Visualisieren Sie das Transkript als interaktive Mind Map. Verstehen Sie die Struktur auf einen Blick.

KOSTENLOS ANMELDEN ZUM FREISCHALTEN

CHAT MIT TRANSKRIPT

Stellen Sie Fragen zum Videoinhalt. Erhalten Sie Antworten von der KI direkt aus dem Transkript.

KOSTENLOS ANMELDEN ZUM FREISCHALTEN

HOLEN SIE MEHR AUS IHREN TRANSKRIPTEN HERAUS

Melden Sie sich kostenlos an und schalten Sie interaktiven Viewer, KI-Zusammenfassungen, Übersetzungen, Mind Maps und mehr frei. Keine Kreditkarte erforderlich.

YOUTUBETRANSCRIPT.DEV AUSPROBIEREN KOSTENLOS STARTEN

HDVIO2.0: Win… - Vollständiges Transkript | YouTubeTranscript.dev