Follow
Dustin Lange
Dustin Lange
Stealth mode (ex Amazon)
Verified email at sent.com
Title
Cited by
Cited by
Year
Automating large-scale data quality verification
S Schelter, D Lange, P Schmidt, M Celikel, F Biessmann, A Grafberger
Proceedings of the VLDB Endowment 11 (12), 1781-1794, 2018
1452018
Probabilistic demand forecasting at scale
JH Böse, V Flunkert, J Gasthaus, T Januschowski, D Lange, D Salinas, ...
Proceedings of the VLDB Endowment 10 (12), 1694-1705, 2017
962017
Extracting structured information from Wikipedia articles to populate infoboxes
D Lange, C Böhm, F Naumann
Proceedings of the 19th ACM international conference on Information and …, 2010
812010
" Deep" Learning for Missing Value Imputationin Tables with Non-numerical Data
F Biessmann, D Salinas, S Schelter, P Schmidt, D Lange
Proceedings of the 27th ACM international conference on information and …, 2018
642018
Cross-lingual entity matching and infobox alignment in Wikipedia
D Rinser, D Lange, F Naumann
Information Systems 38 (6), 887-907, 2013
612013
DataWig: Missing Value Imputation for Tables.
F Biessmann, T Rukat, P Schmidt, P Naidu, S Schelter, A Taptunov, ...
J. Mach. Learn. Res. 20 (175), 1-6, 2019
562019
Efficient similarity search in very large string sets
D Fenz, D Lange, A Rheinländer, F Naumann, U Leser
International Conference on Scientific and Statistical Database Management …, 2012
242012
Efficient Similarity Search: Arbitrary Similarity Measures, Arbitrary Composition
D Lange, F Naumann
Proceedings of the 20th ACM international conference on Information and …, 2011
182011
Reach for gold: An annealing standard to evaluate duplicate detection results
T Vogel, A Heise, U Draisbach, D Lange, F Naumann
Journal of Data and Information Quality (JDIQ) 5 (1-2), 1-25, 2014
162014
Frequency-aware similarity measures: why Arnold Schwarzenegger is always a duplicate
D Lange, F Naumann
Proceedings of the 20th ACM international conference on Information and …, 2011
152011
Differential data quality verification on partitioned data
S Schelter, S Grafberger, P Schmidt, T Rukat, M Kiessling, A Taptunov, ...
2019 IEEE 35th International Conference on Data Engineering (ICDE), 1940-1945, 2019
112019
Unit testing data with deequ
S Schelter, F Biessmann, D Lange, T Rukat, P Schmidt, S Seufert, ...
Proceedings of the 2019 International Conference on Management of Data, 1993 …, 2019
92019
Automated Data Validation in Machine Learning Systems.
F Biessmann, J Golebiowski, T Rukat, D Lange, P Schmidt
IEEE Data Eng. Bull. 44 (1), 51-65, 2021
82021
Towards automated ml model monitoring: Measure, improve and quantify data quality
T Rukat, D Lange, S Schelter, F Biessmann
ML Ops workshop at MLSys, 2019
72019
Towards automated data quality management for machine learning
T Rukat, D Lange, S Schelter, F Biessmann
ML Ops Work. Conf. Mach. Learn. Syst, 1-3, 2020
62020
Deequ-data quality validation for machine learning pipelines
S Schelter, S Grafberger, P Schmidt, T Rukat, M Kiessling, A Taptunov, ...
Machine Learning Systems Workshop at the Conference on Neural Information …, 2018
62018
Cost-aware query planning for similarity search
D Lange, F Naumann
Information Systems 38 (4), 455-469, 2013
52013
An interpretable latent variable model for attribute applicability in the amazon catalogue
T Rukat, D Lange, C Archambeau
arXiv preprint arXiv:1712.00126, 2017
32017
Projektseminar „Similarity Search Algorithms “
D Lange, T Vogel, U Draisbach, F Naumann
Datenbank-Spektrum, 1-7, 2011
22011
Effective and efficient similarity search in databases
D Lange
Universität Potsdam, 2013
12013
The system can't perform the operation now. Try again later.
Articles 1–20