Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms |
Chris Thornton, Frank Hutter, Holger H. Hoos, Kevin LeytonBrown |
|
|
|
code |
623 |
U-Air: when urban air quality inference meets big data |
Yu Zheng, Furui Liu, HsunPing Hsieh |
|
|
|
code |
527 |
Ad click prediction: a view from the trenches |
H. Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, Sharat Chikkerur, Dan Liu, Martin Wattenberg, Arnar Mar Hrafnkelsson, Tom Boulos, Jeremy Kubica |
|
|
|
code |
471 |
FISM: factored item similarity models for top-N recommender systems |
Santosh Kabbur, Xia Ning, George Karypis |
|
|
|
code |
381 |
Learning geographical preferences for point-of-interest recommendation |
Bin Liu, Yanjie Fu, Zijun Yao, Hui Xiong |
|
|
|
code |
281 |
Connecting users across social media sites: a behavioral-modeling approach |
Reza Zafarani, Huan Liu |
|
|
|
code |
261 |
LCARS: a location-content-aware recommender system |
Hongzhi Yin, Yizhou Sun, Bin Cui, Zhiting Hu, Ling Chen |
|
|
|
code |
251 |
Why people hate your app: making sense of user feedback in a mobile app store |
Bin Fu, Jialiu Lin, Lei Li, Christos Faloutsos, Jason I. Hong, Norman M. Sadeh |
|
|
|
code |
246 |
Spotting opinion spammers using behavioral footprints |
Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malú Castellanos, Riddhiman Ghosh |
|
|
|
code |
212 |
Online controlled experiments at large scale |
Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, Nils Pohlmann |
|
|
|
code |
206 |
Collaborative matrix factorization with multiple similarities for predicting drug-target interactions |
Xiaodong Zheng, Hao Ding, Hiroshi Mamitsuka, Shanfeng Zhu |
|
|
|
code |
177 |
Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees |
Charalampos E. Tsourakakis, Francesco Bonchi, Aristides Gionis, Francesco Gullo, Maria A. Tsiarli |
|
|
|
code |
176 |
Geo-spotting: mining online location-based services for optimal retail store placement |
Dmytro Karamshuk, Anastasios Noulas, Salvatore Scellato, Vincenzo Nicosia, Cecilia Mascolo |
|
|
|
code |
167 |
Who, where, when and what: discover spatio-temporal topics for twitter users |
Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, Nadia MagnenatThalmann |
|
|
|
code |
161 |
Accurate intelligible models with pairwise interactions |
Yin Lou, Rich Caruana, Johannes Gehrke, Giles Hooker |
|
|
|
code |
146 |
TurboGraph: a fast parallel graph engine handling billion-scale graphs in a single PC |
WookShin Han, Sangyeon Lee, Kyungyeol Park, JeongHoon Lee, MinSoo Kim, Jinha Kim, Hwanjo Yu |
|
|
|
code |
146 |
Fast and scalable polynomial kernels via explicit feature maps |
Ninh Pham, Rasmus Pagh |
|
|
|
code |
135 |
Real-time disease surveillance using Twitter data: demonstration on flu and cancer |
Kathy Lee, Ankit Agrawal, Alok N. Choudhary |
|
|
|
code |
117 |
Big data analytics for healthcare |
Jimeng Sun, Chandan K. Reddy |
|
|
|
code |
117 |
Simple and deterministic matrix sketching |
Edo Liberty |
|
|
|
code |
112 |
Combining latent factor model with location features for event-based group recommendation |
Wei Zhang, Jianyong Wang, Wei Feng |
|
|
|
code |
108 |
Subsampling for efficient and effective unsupervised outlier detection ensembles |
Arthur Zimek, Matthew Gaudet, Ricardo J. G. B. Campello, Jörg Sander |
|
|
|
code |
107 |
Discriminant malware distance learning on structural information for automated malware classification |
Deguang Kong, Guanhua Yan |
|
|
|
code |
100 |
The role of information diffusion in the evolution of social networks |
Lilian Weng, Jacob Ratkiewicz, Nicola Perra, Bruno Gonçalves, Carlos Castillo, Francesco Bonchi, Rossano Schifanella, Filippo Menczer, Alessandro Flammini |
|
|
|
code |
96 |
Privacy-preserving data exploration in genome-wide association studies |
Aaron Johnson, Vitaly Shmatikov |
|
|
|
code |
90 |
Cascading outbreak prediction in networks: a data-driven approach |
Peng Cui, Shifei Jin, Linyun Yu, Fei Wang, Wenwu Zhu, Shiqiang Yang |
|
|
|
code |
88 |
Location-aware publish/subscribe |
Guoliang Li, Yang Wang, Ting Wang, Jianhua Feng |
|
|
|
code |
85 |
Linking named entities in Tweets with knowledge base via user interest modeling |
Wei Shen, Jianyong Wang, Ping Luo, Min Wang |
|
|
|
code |
84 |
On the equivalent of low-rank linear regressions and linear discriminant analysis based regressions |
Xiao Cai, Chris H. Q. Ding, Feiping Nie, Heng Huang |
|
|
|
code |
82 |
Graph cluster randomization: network exposure to multiple universes |
Johan Ugander, Brian Karrer, Lars Backstrom, Jon M. Kleinberg |
|
|
|
code |
80 |
Social influence based clustering of heterogeneous information networks |
Yang Zhou, Ling Liu |
|
|
|
code |
75 |
Confluence: conformity influence in large social networks |
Jie Tang, Sen Wu, Jimeng Sun |
|
|
|
code |
75 |
Discovering latent influence in online social activities via shared cascade poisson processes |
Tomoharu Iwata, Amar Shah, Zoubin Ghahramani |
|
|
|
code |
74 |
Cost-sensitive online active learning with application to malicious URL detection |
Peilin Zhao, Steven C. H. Hoi |
|
|
|
code |
73 |
Restreaming graph partitioning: simple versatile algorithms for advanced balancing |
Joel Nishimura, Johan Ugander |
|
|
|
code |
70 |
A space efficient streaming algorithm for triangle counting using the birthday paradox |
Madhav Jha, C. Seshadhri, Ali Pinar |
|
|
|
code |
68 |
SIGMa: simple greedy matching for aligning large knowledge bases |
Simon LacosteJulien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, Zoubin Ghahramani |
|
|
|
code |
65 |
Modeling and probabilistic reasoning of population evacuation during large-scale disaster |
Xuan Song, Quanshi Zhang, Yoshihide Sekimoto, Teerayut Horanont, Satoshi Ueyama, Ryosuke Shibasaki |
|
|
|
code |
62 |
Stochastic collapsed variational Bayesian inference for latent Dirichlet allocation |
James R. Foulds, Levi Boyles, Christopher DuBois, Padhraic Smyth, Max Welling |
|
|
|
code |
60 |
Detecting insider threats in a real corporate database of computer usage activity |
Ted E. Senator, Henry G. Goldberg, Alex Memory, William T. Young, Brad Rees, Robert Pierce, Daniel Huang, Matthew Reardon, David A. Bader, Edmond Chow, Irfan A. Essa, Joshua Jones, Vinay Bettadapura, Duen Horng Chau, Oded Green, Oguz Kaya, Anita Zakrzewska, Erica Briscoe, Rudolph L. Mappus IV, Robert McColl, Lora Weiss, Thomas G. Dietterich, Alan Fern, WengKeen Wong, Shubhomoy Das, Andrew Emmott, Jed Irvine, Jay Yoon Lee, Danai Koutra, Christos Faloutsos, Daniel D. Corkill, Lisa Friedland, Amanda Gentzel, David D. Jensen |
|
|
|
code |
60 |
Recursive regularization for large-scale classification with hierarchical and graphical dependencies |
Siddharth Gopal, Yiming Yang |
|
|
|
code |
59 |
Knowledge discovery from massive healthcare claims data |
Varun Chandola, Sreenivas R. Sukumar, Jack C. Schryver |
|
|
|
code |
59 |
DTW-D: time series semi-supervised learning from a single example |
Yanping Chen, Bing Hu, Eamonn J. Keogh, Gustavo E. A. P. A. Batista |
|
|
|
code |
57 |
Mining evidences for named entity disambiguation |
Yang Li, Chi Wang, Fangqiu Han, Jiawei Han, Dan Roth, Xifeng Yan |
|
|
|
code |
57 |
Information cartography: creating zoomable, large-scale maps of information |
Dafna Shahaf, Jaewon Yang, Caroline Suen, Jeff Jacobs, Heidi Wang, Jure Leskovec |
|
|
|
code |
56 |
Entity resolution for big data |
Lise Getoor, Ashwin Machanavajjhala |
|
|
|
code |
56 |
Mining high utility episodes in complex event sequences |
ChengWei Wu, YuFeng Lin, Philip S. Yu, Vincent S. Tseng |
|
|
|
code |
55 |
Mining frequent graph patterns with differential privacy |
Entong Shen, Ting Yu |
|
|
|
code |
53 |
Assessing team strategy using spatiotemporal data |
Patrick Lucey, Dean Oliver, Peter Carr, Joe Roth, Iain A. Matthews |
|
|
|
code |
53 |
Statistical quality estimation for general crowdsourcing tasks |
Yukino Baba, Hisashi Kashima |
|
|
|
code |
51 |
Model-based kernel for efficient time series analysis |
Huanhuan Chen, Fengzhen Tang, Peter Tiño, Xin Yao |
|
|
|
code |
51 |
A phrase mining framework for recursive construction of a topical hierarchy |
Chi Wang, Marina Danilevsky, Nihit Desai, Yinan Zhang, Phuong Nguyen, Thrivikrama Taula, Jiawei Han |
|
|
|
code |
51 |
Evaluating the crowd with confidence |
Manas Joglekar, Hector GarciaMolina, Aditya G. Parameswaran |
|
|
|
code |
50 |
Inferring social roles and statuses in social networks |
Yuchen Zhao, Guan Wang, Philip S. Yu, Shaobo Liu, Simon Zhang |
|
|
|
code |
50 |
The bang for the buck: fair competitive viral marketing from the host perspective |
Wei Lu, Francesco Bonchi, Amit Goyal, Laks V. S. Lakshmanan |
|
|
|
code |
49 |
Multi-label classification by mining label and instance correlations from heterogeneous information networks |
Xiangnan Kong, Bokai Cao, Philip S. Yu |
|
|
|
code |
48 |
Robust principal component analysis via capped norms |
Qian Sun, Shuo Xiang, Jieping Ye |
|
|
|
code |
46 |
Understanding Twitter data with TweetXplorer |
Fred Morstatter, Shamanth Kumar, Huan Liu, Ross Maciejewski |
|
|
|
code |
45 |
Network discovery via constrained tensor analysis of fMRI data |
Ian N. Davidson, Sean Gilpin, Owen T. Carmichael, Peter B. Walker |
|
|
|
code |
45 |
Flexible and robust co-regularized multi-domain graph clustering |
Wei Cheng, Xiang Zhang, Zhishan Guo, Yubao Wu, Patrick F. Sullivan, Wei Wang |
|
|
|
code |
44 |
Making recommendations from multiple domains |
Wei Chen, Wynne Hsu, MongLi Lee |
|
|
|
code |
43 |
Maximizing acceptance probability for active friending in online social networks |
DeNian Yang, HuiJu Hung, WangChien Lee, Wei Chen |
|
|
|
code |
43 |
Silence is also evidence: interpreting dwell time for recommendation from psychological perspective |
Peifeng Yin, Ping Luo, WangChien Lee, Min Wang |
|
|
|
code |
42 |
Redundancy-aware maximal cliques |
Jia Wang, James Cheng, Ada WaiChee Fu |
|
|
|
code |
42 |
Comparing apples to oranges: a scalable solution with heterogeneous hashing |
Mingdong Ou, Peng Cui, Fei Wang, Jun Wang, Wenwu Zhu, Shiqiang Yang |
|
|
|
code |
41 |
Multi-label relational neighbor classification using social context features |
Xi Wang, Gita Sukthankar |
|
|
|
code |
41 |
Trace complexity of network inference |
Bruno D. Abrahao, Flavio Chierichetti, Robert Kleinberg, Alessandro Panconesi |
|
|
|
code |
41 |
STED: semi-supervised targeted-interest event detectionin in twitter |
Ting Hua, Feng Chen, Liang Zhao, ChangTien Lu, Naren Ramakrishnan |
|
|
|
code |
40 |
STRIP: stream learning of influence probabilities |
Konstantin Kutzkov, Albert Bifet, Francesco Bonchi, Aristides Gionis |
|
|
|
code |
40 |
Fast rank-2 nonnegative matrix factorization for hierarchical document clustering |
Da Kuang, Haesun Park |
|
|
|
code |
39 |
A new collaborative filtering approach for increasing the aggregate diversity of recommender systems |
Katja Niemann, Martin Wolpers |
|
|
|
code |
38 |
Guided learning for role discovery (GLRD): framework, algorithms, and applications |
Sean Gilpin, Tina EliassiRad, Ian N. Davidson |
|
|
|
code |
38 |
Scalable all-pairs similarity search in metric spaces |
Ye Wang, Ahmed Metwally, Srinivasan Parthasarathy |
|
|
|
code |
37 |
Diversity maximization under matroid constraints |
Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur |
|
|
|
code |
37 |
Synthetic review spamming and defense |
Huan Sun, Alex Morales, Xifeng Yan |
|
|
|
code |
37 |
An efficient ADMM algorithm for multidimensional anisotropic total variation regularization problems |
Sen Yang, Jie Wang, Wei Fan, Xiatian Zhang, Peter Wonka, Jieping Ye |
|
|
|
code |
35 |
One theme in all views: modeling consensus topics in multiple contexts |
Jian Tang, Ming Zhang, Qiaozhu Mei |
|
|
|
code |
35 |
Multi-source learning with block-wise missing data for Alzheimer's disease prediction |
Shuo Xiang, Lei Yuan, Wei Fan, Yalin Wang, Paul M. Thompson, Jieping Ye |
|
|
|
code |
34 |
Unsupervised link prediction using aggregative statistics on heterogeneous social networks |
TsungTing Kuo, Rui Yan, YuYang Huang, PerngHwa Kung, ShouDe Lin |
|
|
|
code |
34 |
Scalable text and link analysis with mixed-topic link models |
Yaojia Zhu, Xiaoran Yan, Lise Getoor, Cristopher Moore |
|
|
|
code |
33 |
Uncertainty in online experiments with dependent data: an evaluation of bootstrap methods |
Eytan Bakshy, Dean Eckles |
|
|
|
code |
32 |
Cross-task crowdsourcing |
Kaixiang Mo, Erheng Zhong, Qiang Yang |
|
|
|
code |
32 |
Understanding evolution of research themes: a probabilistic generative model for citations |
Xiaolong Wang, Chengxiang Zhai, Dan Roth |
|
|
|
code |
30 |
Debiasing social wisdom |
Abhimanyu Das, Sreenivas Gollapudi, Rina Panigrahy, Mahyar Salek |
|
|
|
code |
30 |
Multi-source deep learning for information trustworthiness estimation |
Liang Ge, Jing Gao, Xiaoyi Li, Aidong Zhang |
|
|
|
code |
30 |
Big data analytics with small footprint: squaring the cloud |
John F. Canny, Huasha Zhao |
|
|
|
code |
30 |
Adaptive collective routing using gaussian process dynamic congestion models |
Siyuan Liu, Yisong Yue, Ramayya Krishnan |
|
|
|
code |
28 |
A "semi-lazy" approach to probabilistic path prediction |
Jingbo Zhou, Anthony K. H. Tung, Wei Wu, Wee Siong Ng |
|
|
|
code |
28 |
Text-based measures of document diversity |
Kevin Bache, David Newman, Padhraic Smyth |
|
|
|
code |
27 |
Forex-foreteller: currency trend modeling using news articles |
Fang Jin, Nathan Self, Parang Saraf, Patrick Butler, Wei Wang, Naren Ramakrishnan |
|
|
|
code |
25 |
Querying discriminative and representative samples for batch mode active learning |
Zheng Wang, Jieping Ye |
|
|
|
code |
24 |
Information cascade at group scale |
Milad Eftekhar, Yashar Ganjali, Nick Koudas |
|
|
|
code |
24 |
JobMiner: a real-time system for mining job-related patterns from social media |
Yu Cheng, Yusheng Xie, Zhengzhang Chen, Ankit Agrawal, Alok N. Choudhary, Songtao Guo |
|
|
|
code |
24 |
Gaussian multiple instance learning approach for mapping the slums of the world using very high resolution imagery |
Ranga Raju Vatsavai |
|
|
|
code |
22 |
Active learning and search on low-rank matrices |
Danica J. Sutherland, Barnabás Póczos, Jeff G. Schneider |
|
|
|
code |
21 |
WiseMarket: a new paradigm for managing wisdom of online social users |
Caleb Chen Cao, Yongxin Tong, Lei Chen, H. V. Jagadish |
|
|
|
code |
21 |
Extracting social events for learning better information diffusion models |
Shuyang Lin, Fengjiao Wang, Qingbo Hu, Philip S. Yu |
|
|
|
code |
21 |
Efficient single-source shortest path and distance queries on large graphs |
Andy Diwen Zhu, Xiaokui Xiao, Sibo Wang, Wenqing Lin |
|
|
|
code |
21 |
An integrated framework for optimizing automatic monitoring systems in large IT infrastructures |
Liang Tang, Tao Li, Larisa Shwartz, Florian Pinel, Genady Grabarnik |
|
|
|
code |
21 |
Psychological advertising: exploring user psychology for click prediction in sponsored search |
Taifeng Wang, Jiang Bian, Shusen Liu, Yuyu Zhang, TieYan Liu |
|
|
|
code |
20 |
Towards never-ending learning from time series streams |
Yuan Hao, Yanping Chen, Jesin Zakaria, Bing Hu, Thanawin Rakthanmanon, Eamonn J. Keogh |
|
|
|
code |
20 |
An integrated framework for suicide risk prediction |
Truyen Tran, Dinh Q. Phung, Wei Luo, Richard Harvey, Michael Berk, Svetha Venkatesh |
|
|
|
code |
20 |
A tool for collecting provenance data in social media |
Pritam Gundecha, Suhas Ranganath, Zhuo Feng, Huan Liu |
|
|
|
code |
20 |
Selective sampling on graphs for classification |
Quanquan Gu, Charu C. Aggarwal, Jialu Liu, Jiawei Han |
|
|
|
code |
19 |
On community detection in real-world networks and the importance of degree assortativity |
Marek Ciglan, Michal Laclavik, Kjetil Nørvåg |
|
|
|
code |
19 |
FIU-Miner: a fast, integrated, and user-friendly system for data mining in distributed environment |
Chunqiu Zeng, Yexi Jiang, Li Zheng, Jingxuan Li, Lei Li, Hongtai Li, Chao Shen, Wubai Zhou, Tao Li, Bing Duan, Ming Lei, Pengnian Wang |
|
|
|
code |
18 |
Mining discriminative subgraphs from global-state networks |
Sayan Ranu, Minh X. Hoang, Ambuj K. Singh |
|
|
|
code |
18 |
Multi-space probabilistic sequence modeling |
Shuo Chen, Jiexun Xu, Thorsten Joachims |
|
|
|
code |
18 |
Active search on graphs |
Xuezhi Wang, Roman Garnett, Jeff G. Schneider |
|
|
|
code |
17 |
Automatic selection of social media responses to news |
Tadej Stajner, Bart Thomee, AnaMaria Popescu, Marco Pennacchiotti, Alejandro Jaimes |
|
|
|
code |
17 |
Approximate graph mining with label costs |
Pranay Anchuri, Mohammed J. Zaki, Omer Barkol, Shahar Golan, Moshe Shamy |
|
|
|
code |
16 |
Using co-visitation networks for detecting large scale online display advertising exchange fraud |
Ori Stitelman, Claudia Perlich, Brian Dalessandro, Rod Hook, Troy Raeder, Foster J. Provost |
|
|
|
code |
16 |
A transfer learning based framework of crowd-selection on twitter |
Zhou Zhao, Da Yan, Wilfred Ng, Shi Gao |
|
|
|
code |
16 |
Heat pump detection from coarse grained smart meter data with positive and unlabeled learning |
Hongliang Fei, Younghun Kim, Sambit Sahu, Milind R. Naphade, Sanjay K. Mamidipalli, John Hutchinson |
|
|
|
code |
16 |
FeaFiner: biomarker identification from medical data through feature generalization and selection |
Jiayu Zhou, Zhaosong Lu, Jimeng Sun, Lei Yuan, Fei Wang, Jieping Ye |
|
|
|
code |
15 |
A unified search federation system based on online user feedback |
Luo Jie, Sudarshan Lamkhede, Rochit Sapra, Evans Hsu, Helen Song, Yi Chang |
|
|
|
code |
14 |
Direct optimization of ranking measures for learning to rank models |
Ming Tan, Tian Xia, Lily Guo, Shaojun Wang |
|
|
|
code |
14 |
Predictive model performance: offline and online evaluations |
Jeonghee Yi, Ye Chen, Jie Li, Swaraj Sett, Tak W. Yan |
|
|
|
code |
14 |
SVMpAUCtight: a new support vector method for optimizing partial AUC based on a tight convex upper bound |
Harikrishna Narasimhan, Shivani Agarwal |
|
|
|
code |
14 |
Summarizing probabilistic frequent patterns: a fast approach |
Chunyang Liu, Ling Chen, Chengqi Zhang |
|
|
|
code |
14 |
Link prediction with social vector clocks |
Conrad Lee, Bobo Nick, Ulrik Brandes, Pádraig Cunningham |
|
|
|
code |
14 |
Empirical bayes model to combine signals of adverse drug reactions |
Rave Harpaz, William DuMouchel, Paea LePendu, Nigam H. Shah |
|
|
|
code |
14 |
Density-based logistic regression |
Wenlin Chen, Yixin Chen, Yi Mao, Baolong Guo |
|
|
|
code |
13 |
Massively parallel expectation maximization using graphics processing units |
Muzaffer Can Altinigneli, Claudia Plant, Christian Böhm |
|
|
|
code |
13 |
Modeling the dynamics of composite social networks |
Erheng Zhong, Wei Fan, Yin Zhu, Qiang Yang |
|
|
|
code |
13 |
Learning to question: leveraging user preferences for shopping advice |
Mahashweta Das, Gianmarco De Francisci Morales, Aristides Gionis, Ingmar Weber |
|
|
|
code |
12 |
Mining evolutionary multi-branch trees from text streams |
Xiting Wang, Shixia Liu, Yangqiu Song, Baining Guo |
|
|
|
code |
12 |
MI2LS: multi-instance learning from multiple informationsources |
Dan Zhang, Jingrui He, Richard D. Lawrence |
|
|
|
code |
12 |
The business impact of deep learning |
Jeremy Howard |
|
|
|
code |
12 |
Towards long-lead forecasting of extreme flood events: a data mining framework for precipitation cluster precursors identification |
Dawei Wang, Wei Ding, Kui Yu, Xindong Wu, Ping Chen, David L. Small, Shafiqul Islam |
|
|
|
code |
12 |
KeySee: supporting keyword search on evolving events in social streams |
Pei Lee, Laks V. S. Lakshmanan, Evangelos E. Milios |
|
|
|
code |
11 |
Collaborative boosting for activity classification in microblogs |
Yangqiu Song, Zhengdong Lu, Cane Wingki Leung, Qiang Yang |
|
|
|
code |
11 |
Learning mixed kronecker product graph models with simulated method of moments |
Sebastián Moreno, Jennifer Neville, Sergey Kirshner |
|
|
|
code |
11 |
iHR: an online recruiting system for Xiamen Talent Service Center |
Wenxing Hong, Lei Li, Tao Li, Wenfu Pan |
|
|
|
code |
11 |
A time-dependent enhanced support vector machine for time series regression |
Goce Ristanoski, Wei Liu, James Bailey |
|
|
|
code |
11 |
A general bootstrap performance diagnostic |
Ariel Kleiner, Ameet Talwalkar, Sameer Agarwal, Ion Stoica, Michael I. Jordan |
|
|
|
code |
10 |
LAICOS: an open source platform for personalized social web search |
Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub |
|
|
|
code |
9 |
Measuring spontaneous devaluations in user preferences |
Komal Kapoor, Nisheeth Srivastava, Jaideep Srivastava, Paul R. Schrater |
|
|
|
code |
9 |
Scalable inference in max-margin topic models |
Jun Zhu, Xun Zheng, Li Zhou, Bo Zhang |
|
|
|
code |
9 |
Improving quality control by early prediction of manufacturing outcomes |
Sholom M. Weiss, Amit Dhurandhar, Robert J. Baseman |
|
|
|
code |
9 |
Palette power: enabling visual search through colors |
Anurag Bhardwaj, Atish Das Sarma, Wei Di, Raffay Hamid, Robinson Piramuthu, Neel Sundaresan |
|
|
|
code |
8 |
EventCube: multi-dimensional search and mining of structured and text data |
Fangbo Tao, Kin Hou Lei, Jiawei Han, Chengxiang Zhai, Xiao Cheng, Marina Danilevsky, Nihit Desai, Bolin Ding, Jing Ge, Heng Ji, Rucha Kanade, Anne Kao, Qi Li, Yanen Li, Cindy Xide Lin, Jialu Liu, Nikunj C. Oza, Ashok N. Srivastava, Rodney Tjoelker, Chi Wang, Duo Zhang, Bo Zhao |
|
|
|
code |
8 |
Amplifying the voice of youth in Africa via text analytics |
Prem Melville, Vijil Chenthamarakshan, Richard D. Lawrence, James Powell, Moses Mugisha, Sharad Sapra, Rajesh Anandan, Solomon Assefa |
|
|
|
code |
8 |
Inferring distant-time location in low-sampling-rate trajectories |
MengFen Chiang, YungHsiang Lin, WenChih Peng, Philip S. Yu |
|
|
|
code |
7 |
AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data |
Marina Danilevsky, Chi Wang, Fangbo Tao, Son Nguyen, Gong Chen, Nihit Desai, Lidan Wang, Jiawei Han |
|
|
|
code |
7 |
Representing documents through their readers |
Khalid ElArini, Min Xu, Emily B. Fox, Carlos Guestrin |
|
|
|
code |
7 |
Risk-O-Meter: an intelligent clinical risk calculator |
Kiyana Zolfaghar, Jayshree Agarwal, Deepthi Sistla, SiChi Chin, Senjuti Basu Roy, Nele Verbiest |
|
|
|
code |
7 |
Query clustering based on bid landscape for sponsored search auction optimization |
Ye Chen, Weiguo Liu, Jeonghee Yi, Anton Schwaighofer, Tak W. Yan |
|
|
|
code |
6 |
Dynamic memory allocation policies for postings in real-time Twitter search |
Nima Asadi, Jimmy Lin, Michael Busch |
|
|
|
code |
6 |
Fast structure learning in generalized stochastic processes with latent factors |
Mohammad Taha Bahadori, Yan Liu, Eric P. Xing |
|
|
|
code |
6 |
SEA: a system for event analysis on chinese tweets |
Yaqiong Wang, Hongfu Liu, Hao Lin, Junjie Wu, Zhiang Wu, Jie Cao |
|
|
|
code |
6 |
Network sampling |
Lise Getoor, Ashwin Machanavajjhala |
|
|
|
code |
5 |
Optimizing parallel belief propagation in junction treesusing regression |
Lu Zheng, Ole J. Mengshoel |
|
|
|
code |
5 |
Cyber security: how visual analytics unlock insight |
Raffael Marty |
|
|
|
code |
5 |
Analysis of advanced meter infrastructure data of water consumption in apartment buildings |
Einat Kermany, Hanna Mazzawi, Dorit Baras, Yehuda Naveh, Hagai Michaelis |
|
|
|
code |
5 |
Mining for geographically disperse communities in social networks by leveraging distance modularity |
Paulo Shakarian, Patrick Roos, Devon Callahan, Cory Kirk |
|
|
|
code |
5 |
Estimating sharer reputation via social data calibration |
Jaewon Yang, BeeChung Chen, Deepak Agarwal |
|
|
|
code |
4 |
Constrained stochastic gradient descent for large-scale least squares problem |
Yang Mu, Wei Ding, Tianyi Zhou, Dacheng Tao |
|
|
|
code |
4 |
Financing lead triggers: empowering sales reps through knowledge discovery and fusion |
Kareem S. Aggour, Bethany Hoogs |
|
|
|
code |
4 |
Exploratory analysis of highly heterogeneous document collections |
Arun S. Maiya, John P. Thompson, Francisco LoaizaLemos, Robert M. Rolfe |
|
|
|
code |
4 |
A data-driven method for in-game decision making in MLB: when to pull a starting pitcher |
Gartheeban Ganeshapillai, John V. Guttag |
|
|
|
code |
4 |
Mining data from mobile devices: a survey of smart sensing and analytics |
Spiros Papadimitriou, Tina EliassiRad |
|
|
|
code |
4 |
Robust sparse estimation of multiresponse regression and inverse covariance matrix via the L2 distance |
Aurélie C. Lozano, Huijing Jiang, Xinwei Deng |
|
|
|
code |
3 |
Beyond myopic inference in big data pipelines |
Karthik Raman, Adith Swaminathan, Johannes Gehrke, Thorsten Joachims |
|
|
|
code |
3 |
Model selection in markovian processes |
Assaf Hallak, Dotan Di Castro, Shie Mannor |
|
|
|
code |
3 |
Mining lines in the sand: on trajectory discovery from untrustworthy data in cyber-physical system |
LuAn Tang, Xiao Yu, Quanquan Gu, Jiawei Han, Alice Leung, Thomas La Porta |
|
|
|
code |
3 |
Nonparametric hierarchal bayesian modeling in non-contractual heterogeneous survival data |
Shouichi Nagano, Yusuke Ichikawa, Noriko Takaya, Tadasu Uchiyama, Makoto Abe |
|
|
|
code |
3 |
Quadratic optimization to identify highly heritable quantitative traits from complex phenotypic features |
Jiangwen Sun, Jinbo Bi, Henry R. Kranzler |
|
|
|
code |
3 |
Adaptive adversaries: building systems to fight fraud and cyber intruders |
Ari Gesher |
|
|
|
code |
3 |
Hadoop: a view from the trenches |
Milind Bhandarkar |
|
|
|
code |
3 |
Scalable supervised dimensionality reduction using clustering |
Troy Raeder, Claudia Perlich, Brian Dalessandro, Ori Stitelman, Foster J. Provost |
|
|
|
code |
3 |
Experience from hosting a corporate prediction market: benefits beyond the forecasts |
Thomas A. Montgomery, Paul M. Stieg, Michael J. Cavaretta, Paul E. Moraal |
|
|
|
code |
3 |
Exploiting user clicks for automatic seed set generation for entity matching |
Xiao Bai, Flavio Paiva Junqueira, Srinivasan H. Sengamedu |
|
|
|
code |
2 |
Speeding up large-scale learning with a social prior |
Deepayan Chakrabarti, Ralf Herbrich |
|
|
|
code |
2 |
LAFT-Explorer: inferring, visualizing and predicting how your social network expands |
Jun Zhang, Chaokun Wang, Yuanchi Ning, Yichi Liu, Jianmin Wang, Philip S. Yu |
|
|
|
code |
2 |
Trial and error in influential social networks |
Xiaohui Bei, Ning Chen, Liyu Dou, Xiangru Huang, Ruixin Qiang |
|
|
|
code |
2 |
Succinct interval-splitting tree for scalable similarity search of compound-protein pairs with property constraints |
Yasuo Tabei, Akihiro Kishimoto, Masaaki Kotera, Yoshihiro Yamanishi |
|
|
|
code |
1 |
The online revolution: education for everyone |
Andrew Y. Ng, Daphne Koller |
|
|
|
code |
1 |
Indexed block coordinate descent for large-scale linear classification with limited memory |
Ian EnHsu Yen, ChunFu Chang, TingWei Lin, ShanWei Lin, ShouDe Lin |
|
|
|
code |
1 |
Exact sparse recovery with L0 projections |
Ping Li, CunHui Zhang |
|
|
|
code |
1 |
Targeting and influencing at scale: from presidential elections to social good |
Rayid Ghani |
|
|
|
code |
1 |
A data mining driven risk profiling method for road asset management |
Daniel Emerson, Justin Weligamage, Richi Nayak |
|
|
|
code |
1 |
Efficiently rewriting large multimedia application execution traces with few event sequences |
Christiane Kamdem Kengne, Léon Constantin Fopa, Alexandre Termier, Noha Ibrahim, MarieChristine Rousset, Takashi Washio, Miguel Santana |
|
|
|
code |
1 |
A privacy preserving framework for managing vehicle data in road pricing systems |
Huayu Wu, Wee Siong Ng, KianLee Tan, Wei Wu, Shili Xiang, Mingqiang Xue |
|
|
|
code |
1 |
Algorithmic techniques for modeling and mining large graphs (AMAzING) |
Alan M. Frieze, Aristides Gionis, Charalampos E. Tsourakakis |
|
|
|
code |
1 |
Predicting the present with search engine data |
Hal R. Varian |
|
|
|
code |
0 |
Mining the digital universe of data to develop personalized cancer therapies |
Eric E. Schadt |
|
|
|
code |
0 |
An online system with end-user services: mining novelty concepts from tv broadcast subtitles |
Mika Rautiainen, Jouni Sarvanko, Arto Heikkinen, Mika Ylianttila, Vassilis Kostakos |
|
|
|
code |
0 |
When TEDDY meets GrizzLY: temporal dependency discovery for triggering road deicing operations |
Céline Robardet, VasileMarian Scuturici, Marc Plantevit, Antoine Fraboulet |
|
|
|
code |
0 |
Scale-out beyond map-reduce |
Raghu Ramakrishnan |
|
|
|
code |
0 |
Optimization in learning and data analysis |
Stephen J. Wright |
|
|
|
code |
0 |
Repetition-aware content placement in navigational networks |
Dóra Erdös, Vatche Ishakian, Azer Bestavros, Evimaria Terzi |
|
|
|
code |
0 |
To buy or not to buy: that is the question |
Oren Etzioni |
|
|
|
code |
0 |
Using "big data" to solve "small data" problems |
Chris Neumann |
|
|
|
code |
0 |
Panel: a data scientist's guide to making money from start-ups |
Foster J. Provost, Geoffrey I. Webb |
|
|
|
code |
0 |
SAE: social analytic engine for large networks |
Yang Yang, Jianfei Wang, Yutao Zhang, Wei Chen, Jing Zhang, Honglei Zhuang, Zhilin Yang, Bo Ma, Zhanpeng Fang, Sen Wu, Xiaoxiao Li, Debing Liu, Jie Tang |
|
|
|
code |
0 |
The dataminer's guide to scalable mixed-membership and nonparametric bayesian models |
Amr Ahmed, Alexander J. Smola |
|
|
|
code |
0 |