No Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Control

dc.contributor.authorHöfer, Sebastian
dc.contributor.authorRaisch, Jörg
dc.contributor.authorToussaint, Marc
dc.contributor.authorBrock, Oliver
dc.date.accessioned2018-05-18T11:26:22Z
dc.date.available2018-05-18T11:26:22Z
dc.date.issued2018
dc.description.abstractHow to run most effectively to catch a projectile, such as a baseball, that is flying in the air for a long period of time? The question about the best solution to the ball catching problem has been subject to intense scientific debate for almost 50 years. It turns out that this scientific debate is not focused on the ball catching problem alone, but revolves around the research question what constitutes the ingredients of intelligent decision making. Over time, two opposing views have emerged: the generalist view regarding intelligence as the ability to solve any task without knowing goal and environment in advance, based on optimal decision making using predictive models; and the specialist view which argues that intelligent decision making does not have to be based on predictive models and not even optimal, advocating simple and efficient rules of thumb (heuristics) as superior to enable accurate decisions. We study two types of approaches to the ball catching problem, one for each view, and investigate their properties using both a theoretical analysis and a broad set of simulation experiments. Our study shows that neither of the two types of approaches can be regarded as superior in solving all relevant variants of the ball catching problem: each approach is optimal under a different realistic environmental condition. Therefore, predictive models neither guarantee nor prevent success a priori, and we further show that the key difference between the generalist and the specialist approach to ball catching is the type of input representation used to control the agent. From this finding, we conclude that the right solution to a decision making or control problem is orthogonal to the generalist and specialist approach, and thus requires a reconciliation of the two views in favor of a representation-centric view.en
dc.description.sponsorshipDFG, SPP 1527, Autonomes Lernenen
dc.identifier.urihttps://depositonce.tu-berlin.de/handle/11303/7813
dc.identifier.urihttp://dx.doi.org/10.14279/depositonce-6988
dc.language.isoenen
dc.relation.issupplementtohttps://doi.org/10.1371/journal.pone.0197803en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subject.ddc000 Informatik, Informationswissenschaft, allgemeine Werkede
dc.subject.otherball catchingen
dc.subject.othergaze heuristicen
dc.subject.otherChapman's strategyen
dc.subject.otheroptical acceleration cancellationen
dc.subject.otheroptimal controlen
dc.subject.otherreinforcement learningen
dc.subject.otherno free lunchen
dc.titleNo Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Controlen
dc.typeGeneric Research Dataen
tub.accessrights.dnbfree*
tub.affiliationFak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Roboticsde
tub.affiliation.facultyFak. 4 Elektrotechnik und Informatikde
tub.affiliation.groupFG Roboticsde
tub.affiliation.instituteInst. Technische Informatik und Mikroelektronikde

Files

Original bundle
Now showing 1 - 12 of 12
No Thumbnail Available
Name:
README.md
Size:
1.14 KB
Format:
Unknown data format
Description:
README
No Thumbnail Available
Name:
e2e_baseline_cov-io_manual.zip
Size:
2.23 MB
Format:
ZIP archive format.
Description:
Experimental evaluation of COV-IO baseline (previously trained by supervised learning)
No Thumbnail Available
Name:
e2e_baseline_cov-oac_manual.zip
Size:
2.35 MB
Format:
ZIP archive format.
Description:
Experimental evaluation of COV-OAC baseline (previously trained by supervised learning)
No Thumbnail Available
Name:
e2e_cma.zip
Size:
120.34 MB
Format:
ZIP archive format.
Description:
CMA-ES evaluated to different observation types
No Thumbnail Available
Name:
2DExperiments_fr60.tar.gz
Size:
2.21 GB
Format:
Unknown data format
Description:
Experiments in 2D environment with frame rate of 60Hz
No Thumbnail Available
Name:
2DExperiments_fr10.tar.gz
Size:
178.03 MB
Format:
Unknown data format
Description:
Experiments in 2D environment with frame rate of 10Hz (including MPC experiments)
No Thumbnail Available
Name:
AdversarialConstCOV.tar.gz
Size:
1.56 MB
Format:
Unknown data format
Description:
Single experiment, demonstrating adversarial parameter choice for constant COV strategy (Figure 5 in paper)
No Thumbnail Available
Name:
3DExperiments_fr10.tar.gz
Size:
717.27 MB
Format:
Unknown data format
Description:
Experiments in 3D environment with frame rate of 10Hz (including MPC experiments)
No Thumbnail Available
Name:
3DExperiments_fr60_part1.tar.gz
Size:
1.76 GB
Format:
Unknown data format
Description:
Experiments in 3D environment with frame rate of 60Hz (1: COV-IO)
No Thumbnail Available
Name:
3DExperiments_fr60_part2.tar.gz
Size:
1.75 GB
Format:
Unknown data format
Description:
Experiments in 3D environment with frame rate of 60Hz (2: COV-OAC)
No Thumbnail Available
Name:
3DExperiments_fr60_part3.tar.gz
Size:
1.86 GB
Format:
Unknown data format
Description:
Experiments in 3D environment with frame rate of 60Hz (3: iLQG)
No Thumbnail Available
Name:
3DExperiments_fr60_part4.tar.gz
Size:
1.81 GB
Format:
Unknown data format
Description:
Experiments in 3D environment with frame rate of 60Hz (4: LQG)
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections