Dustin Lange
Dustin Lange
Verified email at amazon.com
Title
Cited by
Cited by
Year
Automating large-scale data quality verification
S Schelter, D Lange, P Schmidt, M Celikel, F Biessmann, A Grafberger
Proceedings of the VLDB Endowment 11 (12), 1781-1794, 2018
972018
Extracting structured information from Wikipedia articles to populate infoboxes
D Lange, C Böhm, F Naumann
Proceedings of the 19th ACM international conference on Information and …, 2010
732010
Probabilistic demand forecasting at scale
JH Böse, V Flunkert, J Gasthaus, T Januschowski, D Lange, D Salinas, ...
Proceedings of the VLDB Endowment 10 (12), 1694-1705, 2017
692017
Cross-lingual entity matching and infobox alignment in Wikipedia
D Rinser, D Lange, F Naumann
Information Systems 38 (6), 887-907, 2013
592013
" Deep" Learning for Missing Value Imputationin Tables with Non-Numerical Data
F Biessmann, D Salinas, S Schelter, P Schmidt, D Lange
Proceedings of the 27th ACM International Conference on Information and …, 2018
392018
DataWig: Missing Value Imputation for Tables.
F Biessmann, T Rukat, P Schmidt, P Naidu, S Schelter, A Taptunov, ...
J. Mach. Learn. Res. 20, 175:1-175:6, 2019
262019
Efficient similarity search in very large string sets
D Fenz, D Lange, A Rheinländer, F Naumann, U Leser
International Conference on Scientific and Statistical Database Management …, 2012
202012
Efficient Similarity Search: Arbitrary Similarity Measures, Arbitrary Composition
D Lange, F Naumann
Proceedings of the 20th ACM international conference on Information and …, 2011
182011
Frequency-aware similarity measures: why Arnold Schwarzenegger is always a duplicate
D Lange, F Naumann
Proceedings of the 20th ACM international conference on Information and …, 2011
152011
Reach for gold: An annealing standard to evaluate duplicate detection results
T Vogel, A Heise, U Draisbach, D Lange, F Naumann
Journal of Data and Information Quality (JDIQ) 5 (1-2), 1-25, 2014
132014
Differential data quality verification on partitioned data
S Schelter, S Grafberger, P Schmidt, T Rukat, M Kiessling, A Taptunov, ...
2019 IEEE 35th International Conference on Data Engineering (ICDE), 1940-1945, 2019
92019
Unit testing data with deequ
S Schelter, F Biessmann, D Lange, T Rukat, P Schmidt, S Seufert, ...
Proceedings of the 2019 International Conference on Management of Data, 1993 …, 2019
82019
Cost-aware query planning for similarity search
D Lange, F Naumann
Information Systems 38 (4), 455-469, 2013
52013
An interpretable latent variable model for attribute applicability in the amazon catalogue
T Rukat, D Lange, C Archambeau
arXiv preprint arXiv:1712.00126, 2017
32017
Towards Automated Data Quality Management for Machine Learning
T Rukat, D Lange, S Schelter, F Biessmann
ML Ops workshop at the Conference on ML and Systems (MLSys), 2020
22020
Towards Automated ML Model Monitoring: Measure, Improve and Quantify Data Quality
T Rukat, D Lange, S Schelter, F Biessmann
ML Ops workshop at MLSys, 2019
22019
Deequ-data quality validation for machine learning pipelines
S Schelter, S Grafberger, P Schmidt, T Rukat, M Kiessling, A Taptunov, ...
Machine Learning Systems workshop at the conference on Neural Information …, 2018
22018
Scientific and Statistical Database Management
D Fenz, D Lange, A Rheinländer, F Naumann, U Leser, A Ailamaki, ...
Lecture Notes in Computer Science 7338, 2012
22012
Projektseminar „Similarity Search Algorithms “
D Lange, T Vogel, U Draisbach, F Naumann
Datenbank-Spektrum, 1-7, 2011
22011
Automated Data Validation in Machine Learning Systems
F Biessmann, J Golebiowski, T Rukat, D Lange, P Schmidt
Bulletin of the IEEE Computer Society Technical Committee on Data …, 2021
12021
The system can't perform the operation now. Try again later.
Articles 1–20