References

    • [1] S. Agarwal, N. Snavely, I. Simon, S.M. Seitz, and R. Szeliski. Building Rome in a Day. In ICCV, 2009.

      • [2] C. Arth, D. Wagner, M. Klopschitz, A. Irschara, and D. Schmalstieg. Wide area localization on mobile phones. In ISMAR, 2009.

      • [3] M. Aubry, B. C. Russell, and J. Sivic. Painting-to-3d model alignment via discriminative visual elements. ACM Transactions on Graphics (TOG), 33(2):14, 2014.

      • [4] Y. Avrithis, Y. Kalantidis, G. Tolias, and E. Spyrou. Retrieving Landmark and Non-Landmark Images from Community Photo Collections. In ACM Multimedia, 2010.

      • [5] M. Bujnak, Z. Kukelova, and T. Pajdla. New efficient solution to the absolute pose problem for camera with unknown focal length. In ACCV, 2010.

    • [6] S. Cao and N. Snavely. Graph-Based Discriminative Learning for Location Recognition. In CVPR, 2013.

      • [7] S. Cao and N. Snavely. Minimal Scene Descriptions from Structure from Motion Models. In CVPR, 2014.

      • [8] D.M. Chen, G. Baatz, K. Koeser, S.S. Tsai, R. Vedantham, T. Pylvanainen, K. Roimela, Xin Chen, J. Bach, M. Pollefeys, B. Girod, and R. Grzeszczuk. City-scale landmark identification on mobile devices. In CVPR, 2011.

      • [9] S. Choudhary and P. J. Narayanan. Visibility probability structure from sfm datasets and applications. In ECCV, 2012.

      • [10] Mark Cummins and Paul Newman. Highly Scalable Appearance-Only SLAM - FAB-MAP 2.0. In RSS, 2019.

      • [11] J. Delhumeau, P.-H. Gosselin, H. J ́egou, and P. P ́erez. Revisiting the VLAD image representation. In ACM Multimedia, Barcelona, Spain, October 2013.

      • [12] M. Donoser and D. Schmalstieg. Discriminative Feature-to-Point Matching in Image-Based Locallization. In CVPR, 2014.

      • [13] P. Gronat, G. Obozinski, J. Sivic, and T. Pajdla. Learning per-location classifiers for visual place recognition. In CVPR, 2013.

      • [14] R.M. Haralick, C.-N. Lee, K. Ottenberg, and M. Noelle. Review and analysis of solutions of the three point perspective pose estimation problem. IJCV, 13(3):331–356, 1994.

      • [15] R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge Univ. Press, 2nd edition, 2004.

      • [16] M. Havlena, A. Torii, and T. Pajdla. Efficient structure from motion by graph optimization. In ECCV, 2010.

      • [17] A. Irschara, C. Zach, J.-M. Frahm, and H. Bischof. From Structure-from-Motion Point Clouds to Fast Location Recognition. In CVPR, 2009.

      • [18] H. Jegou, M. Douze, and C. Schmid. On the burstiness of visual elements. In CVPR, 2009.

      • [19] H. Jegou, M. Douze, C. Schmid, and P. P ́erez. Aggregating local descriptors into a compact image representation. In CVPR, pages 3304–3311, jun 2010.

      • [20] H. Jegou, F. Perronnin, M. Douze, J. Sanchez, P. P ́erez, and C. Schmid. Aggregating local image descriptors into compact codes. PAMI, 34(9):1704–1716, September 2012.

      • [21] K. Josephson and M. Byrod. Pose estimation with radial distortion and unknown focal length. In CVPR, 2009.

      • [22] J. Knopp, J. Sivic, and T. Pajdla. Avoding Confusing Features in Place Recognition. In ECCV, 2010.

      • [23] S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR, volume 2, pages 2169–2178. IEEE, 2006.

      • [24] Y. Li, N. Snavely, D. Huttenlocher, and P. Fua. Worldwide Pose Estimation Using 3D Point Clouds. In ECCV, 2012.

      • [25] Y. Li, N. Snavely, and D. P. Huttenlocher. Location Recognition using Prioritized Feature Matching. In ECCV, 2010.

      • [26] H. Lim, S. N. Sinha, M. F. Cohen, and M. Uyttendaele. Real-Time Image-Based 6-DOF Localization in Large-Scale Environments. In CVPR, 2012.

      • [27] S. Middelberg, T. Sattler, O. Untzelmann, and L. Kobbelt. Scalable 6-DOF Localization on Mobile Devices. In ECCV, 2014.

      • [28] M. Muja and D. G. Lowe. Fast approximate nearest neighbors with automatic algorithm configuration. In VISAPP, 2009.

      • [29] D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In CVPR, 2006.

      • [30] K. Nozawa, A. Torii, and M. Okutomi. Stable two view reconstruction using the six-point algorithm. In ACCV, 2012.

      • [31] J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object Retrieval with Large Vocabularies and Fast Spatial Matching. In CVPR, 2007.

    • [32] T. Quack, B. Leibe, and L. Van Gool. World-Scale Mining of Objects and Events from Community Photo Collections. In CIVR, 2008.

      • [33] T. Sattler, B. Leibe, and L. Kobbelt. SCRAMSAC: Improving RANSAC’s Efficiency with a Spatial Consistency Filter. In ICCV, 2009.

      • [34] T. Sattler, B. Leibe, and L. Kobbelt. Fast Image-Based Localization using Direct 2D-to-3D Matching. In ICCV, 2011.

      • [35] T. Sattler, B. Leibe, and L. Kobbelt. Improving Image-Based Localization by Active Correspondence Search. In ECCV, 2012.

      • [36] T. Sattler, B. Leibe, and L. Kobbelt. Towards Fast Image-Based Localization on a City-Scale. In Outdoor and Large-Scale Real-World Scene Analysis, volume 7474 of Lecture Notes in Computer Science, pages 191–211. Springer Berlin Heidelberg, 2012.

      • [37] T. Sattler, C. Sweeney, and M. Pollefeys. On Sampling Focal Length Values to Solve the Absolute Pose Problem. In ECCV, 2014.

      • [38] T. Sattler, T. Weyand, B. Leibe, and L. Kobbelt. Image Retrieval for Image-Based Localization Revisited. In BMVC, 2012.

      • [39] G. Schindler, M. Brown, and R. Szeliski. City-Scale Location Recognition. In CVPR, 2007.

      • [40] D. Sibbing, T. Sattler, B. Leibe, and L. Kobbelt. SIFT-Realistic Rendering. In 3DV, 2013.

      • [41] J. Sivic and A. Zisserman. Video Google: A Text Retrieval Approach to Object Matching in Videos. In ICCV, 2003.

      • [42] N. Snavely, S. M. Seitz, and R. Szeliski. Modeling the world from internet photo collections. IJCV, 80(2):189–210, 2008.

      • [43] L. Svarm, O. Enqvist, M. Oskarsson, and F. Kahl. Accurate Localization and Pose Estimation for Large 3D Models. In CVPR, 2014.

      • [44] A. Torii, M. Havlena, and T. Pajdla. From Google street view to 3D city models. In OMNIVIS, 2009.

      • [45] A. Torii, M. Havlena, T. Pajdla, and B. Leibe. Measuring camera translation by the dominant apical angle. In CVPR, 2008.

      • [46] A. Torii, Z. Kukelova, M. Bujnak, and T. Pajdla. The six point algorithm revisited. In CVVT:E2M, 2010.

      • [47] A. Torii and T. Pajdla. Omnidirectional camera motion estimation. In VISAPP, 2008.

      • [48] A. Torii, J. Sivic, and T. Pajdla. Visual Localization by Linear Combination of Image Descriptors. In MVW, 2011.

      • [49] A. Torii, J. Sivic, T. Pajdla, and M. Okutomi. Visual Place Recognition with Repetitive Structures. In CVPR, 2013.

      • [50] O. Untzelmann, T. Sattler, S. Middelberg, and L. Kobbelt. A Scalable Collaborative Online System for City Reconstruction. In International Conference on Computer Vision (ICCV) Workshops, 2013.

      • [51] K. E. A. van de Sande, C. G. M. Snoek, and A. W. M. Smeulders. Fisher and vlad with flair. In CVPR, June 2014.

      • [52] A. R. Zamir and M. Shah. Accurate Image Localization Based on Google Maps Street View. In ECCV, 2010.

      • [53] W. Zhang and J. Kosecka. Image based localization in urban environments. In 3DPVT, 2006.