No Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Control

Höfer, Sebastian; Raisch, Jörg; Toussaint, Marc; Brock, Oliver

No Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Control

dc.contributor.author	Höfer, Sebastian
dc.contributor.author	Raisch, Jörg
dc.contributor.author	Toussaint, Marc
dc.contributor.author	Brock, Oliver
dc.date.accessioned	2018-05-18T11:26:22Z
dc.date.available	2018-05-18T11:26:22Z
dc.date.issued	2018
dc.description.abstract	How to run most effectively to catch a projectile, such as a baseball, that is flying in the air for a long period of time? The question about the best solution to the ball catching problem has been subject to intense scientific debate for almost 50 years. It turns out that this scientific debate is not focused on the ball catching problem alone, but revolves around the research question what constitutes the ingredients of intelligent decision making. Over time, two opposing views have emerged: the generalist view regarding intelligence as the ability to solve any task without knowing goal and environment in advance, based on optimal decision making using predictive models; and the specialist view which argues that intelligent decision making does not have to be based on predictive models and not even optimal, advocating simple and efficient rules of thumb (heuristics) as superior to enable accurate decisions. We study two types of approaches to the ball catching problem, one for each view, and investigate their properties using both a theoretical analysis and a broad set of simulation experiments. Our study shows that neither of the two types of approaches can be regarded as superior in solving all relevant variants of the ball catching problem: each approach is optimal under a different realistic environmental condition. Therefore, predictive models neither guarantee nor prevent success a priori, and we further show that the key difference between the generalist and the specialist approach to ball catching is the type of input representation used to control the agent. From this finding, we conclude that the right solution to a decision making or control problem is orthogonal to the generalist and specialist approach, and thus requires a reconciliation of the two views in favor of a representation-centric view.	en
dc.description.sponsorship	DFG, SPP 1527, Autonomes Lernen	en
dc.identifier.uri	https://depositonce.tu-berlin.de/handle/11303/7813
dc.identifier.uri	http://dx.doi.org/10.14279/depositonce-6988
dc.language.iso	en	en
dc.relation.issupplementto	https://doi.org/10.1371/journal.pone.0197803	en
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en
dc.subject.ddc	000 Informatik, Informationswissenschaft, allgemeine Werke	de
dc.subject.other	ball catching	en
dc.subject.other	gaze heuristic	en
dc.subject.other	Chapman's strategy	en
dc.subject.other	optical acceleration cancellation	en
dc.subject.other	optimal control	en
dc.subject.other	reinforcement learning	en
dc.subject.other	no free lunch	en
dc.title	No Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Control	en
dc.type	Generic Research Data	en
tub.accessrights.dnb	free	*
tub.affiliation	Fak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Robotics	de
tub.affiliation.faculty	Fak. 4 Elektrotechnik und Informatik	de
tub.affiliation.group	FG Robotics	de
tub.affiliation.institute	Inst. Technische Informatik und Mikroelektronik	de

Files

Original bundle

Now showing 1 - 12 of 12

Name:: README.md
Size:: 1.14 KB
Format:: Unknown data format
Description:: README

Download

Name:: e2e_baseline_cov-io_manual.zip
Size:: 2.23 MB
Format:: ZIP archive format.
Description:: Experimental evaluation of COV-IO baseline (previously trained by supervised learning)

Download

Name:: e2e_baseline_cov-oac_manual.zip
Size:: 2.35 MB
Format:: ZIP archive format.
Description:: Experimental evaluation of COV-OAC baseline (previously trained by supervised learning)

Download

Name:: e2e_cma.zip
Size:: 120.34 MB
Format:: ZIP archive format.
Description:: CMA-ES evaluated to different observation types

Download

Name:: 2DExperiments_fr60.tar.gz
Size:: 2.21 GB
Format:: Unknown data format
Description:: Experiments in 2D environment with frame rate of 60Hz

Download

Name:: 2DExperiments_fr10.tar.gz
Size:: 178.03 MB
Format:: Unknown data format
Description:: Experiments in 2D environment with frame rate of 10Hz (including MPC experiments)

Download

Name:: AdversarialConstCOV.tar.gz
Size:: 1.56 MB
Format:: Unknown data format
Description:: Single experiment, demonstrating adversarial parameter choice for constant COV strategy (Figure 5 in paper)

Download

Name:: 3DExperiments_fr10.tar.gz
Size:: 717.27 MB
Format:: Unknown data format
Description:: Experiments in 3D environment with frame rate of 10Hz (including MPC experiments)

Download

Name:: 3DExperiments_fr60_part1.tar.gz
Size:: 1.76 GB
Format:: Unknown data format
Description:: Experiments in 3D environment with frame rate of 60Hz (1: COV-IO)

Download

Name:: 3DExperiments_fr60_part2.tar.gz
Size:: 1.75 GB
Format:: Unknown data format
Description:: Experiments in 3D environment with frame rate of 60Hz (2: COV-OAC)

Download

Name:: 3DExperiments_fr60_part3.tar.gz
Size:: 1.86 GB
Format:: Unknown data format
Description:: Experiments in 3D environment with frame rate of 60Hz (3: iLQG)

Download

Name:: 3DExperiments_fr60_part4.tar.gz
Size:: 1.81 GB
Format:: Unknown data format
Description:: Experiments in 3D environment with frame rate of 60Hz (4: LQG)

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Research Data