Vipin Kumar

Vipin Kumar

University of Minnesota-Twin Cities

H-index: 137

North America-United States

Professor Information

University

University of Minnesota-Twin Cities

Position

___

Citations(all)

138320

Citations(since 2020)

44875

Cited By

106828

hIndex(all)

137

hIndex(since 2020)

77

i10Index(all)

706

i10Index(since 2020)

347

Email

University Profile Page

University of Minnesota-Twin Cities

Research & Interests List

Data mining

parallel computing

high performance computing

Artificial Intelligence

machine learning

Top articles of Vipin Kumar

Probabilistic inverse modeling: An application in hydrology

Rapid advancement in inverse modeling methods have brought into light their susceptibility to imperfect data. This has made it imperative to obtain more explainable and trustworthy estimates from these models. In hydrology, basin characteristics can be noisy or missing, impacting streamflow prediction. We propose a probabilistic inverse model framework that can reconstruct robust hydrology basin characteristics from dynamic input weather driver and streamflow response data. We address two aspects of building more explainable inverse models, uncertainty estimation (uncertainty due to imperfect data and imperfect model) and robustness. This can help improve the trust of water managers, handling of noisy data and reduce costs. We also propose an uncertainty based loss regularization that offers removal of 17% of temporal artifacts in reconstructions, 36% reduction in uncertainty and 4% higher coverage rate …

Authors

Somya Sharma,Rahul Ghosh,Arvind Renganathan,Xiang Li,Snigdhansu Chatterjee,John Nieber,Christopher Duffy,Vipin Kumar

Published Date

2023

A scalable framework for quantifying field-level agricultural carbon outcomes

Agriculture contributes nearly a quarter of global greenhouse gas (GHG) emissions, which is motivating interest in adopting certain farming practices that have the potential to reduce GHG emissions or sequester carbon in soil. The related GHG emission (including N2O and CH4) and changes in soil carbon stock are defined here as “agricultural carbon outcomes”. Accurate quantification of agricultural carbon outcomes is the basis for achieving emission reductions for agriculture, but existing approaches for measuring carbon outcomes (including direct measurements, emission factors, and process-based modeling) fall short of achieving the required accuracy and scalability necessary to support credible, verifiable, and cost-effective measurement and improvement of these carbon outcomes. Here we propose a foundational and scalable framework to quantify field-level carbon outcomes for farmland, which is based …

Authors

Kaiyu Guan,Zhenong Jin,Bin Peng,Jinyun Tang,Evan H DeLucia,Paul West,Chongya Jiang,Sheng Wang,Taegon Kim,Wang Zhou,Tim Griffis,Licheng Liu,Wendy H Yang,Ziqi Qin,Qi Yang,Andrew Margenot,Emily R Stuchiner,Vipin Kumar,Carl Bernacchi,Jonathan Coppess,Kimberly A Novick,James Gerber,Molly Jahn,Madhu Khanna,DoKyoung Lee,Zhangliang Chen,Shang-Jen Yang

Published Date

2023/5/29

Spatiotemporal Classification with limited labels using Constrained Clustering for large datasets

Creating separable representations via representation learning and clustering is critical in analyzing large unstructured datasets with only a few labels. Separable representations can lead to supervised models with better classification capabilities and additionally aid in generating new labeled samples. Most unsupervised and semisupervised methods to analyze large datasets do not leverage the existing small amounts of labels to get better representations. In this paper, we propose a spatiotemporal clustering paradigm that uses spatial and temporal features combined with a constrained loss to produce separable representations. We show the working of this method on the newly published dataset ReaLSAT, a dataset of surface water dynamics for over 680,000 lakes across the world, making it an essential dataset in terms of ecology and sustainability. Using this large un- labelled dataset, we first show how a …

Authors

Praveen Ravirathinam,Rahul Ghosh,Ke Wang,Keyang Xuan,Ankush Khandelwal,Hilary Dugan,Paul Hanson,Vipin Kumar

Published Date

2023

Satellite image classification across multiple resolutions and time using ordering constraint among instances

A method includes receiving a satellite image of an area and classifying each pixel in the satellite image as representing water, land or unknown using a model. For each of a plurality of possible water levels, a cost associated with the water level is determined, wherein determining the cost associated with a water level includes determining a number of pixels for which the model classification must change to be consistent with the water level and determining a difference between the water level and a water level determined for the area at a previous time point. The lowest cost water level is selected and used to reclassify at least one pixel.

Published Date

2023/4/11

Mini-Batch Learning Strategies for modeling long term temporal dependencies: a study in environmental applications

In many environmental applications, recurrent neural networks (RNNs) are often used to model physical variables with long temporal dependencies. However, due to minibatch training, temporal relationships between training segments within the batch (intra-batch) as well as between batches (inter-batch) are not considered, which can lead to limited performance. Stateful RNNs aim to address this issue by passing hidden states between batches. Since Stateful RNNs ignore intra-batch temporal dependency, there exists a trade-off between training stability and capturing temporal dependency. In this paper, we provide a quantitative comparison of different Stateful RNN modeling strategies, and propose two strategies to enforce both intra- and inter-batch temporal dependency. First, we extend Stateful RNNs by defining a batch as a temporally ordered set of training segments, which enables intra-batch sharing of …

Authors

Shaoming Xu,Ankush Khandelwal,Xiang Li,Xiaowei Jia,Licheng Liu,Jared Willard,Rahul Ghosh,Kelly Cutler,Michael Steinbach,Christopher Duffy,John Nieber,Vipin Kumar

Published Date

2023

Near‐term forecasts of stream temperature using deep learning and data assimilation in support of management decisions

Deep learning (DL) models are increasingly used to make accurate hindcasts of management‐relevant variables, but they are less commonly used in forecasting applications. Data assimilation (DA) can be used for forecasts to leverage real‐time observations, where the difference between model predictions and observations today is used to adjust the model to make better predictions tomorrow. In this use case, we developed a process‐guided DL and DA approach to make 7‐day probabilistic forecasts of daily maximum water temperature in the Delaware River Basin in support of water management decisions. Our modeling system produced forecasts of daily maximum water temperature with an average root mean squared error (RMSE) from 1.1 to 1.4°C for 1‐day‐ahead and 1.4 to 1.9°C for 7‐day‐ahead forecasts across all sites. The DA algorithm marginally improved forecast performance when compared with …

Authors

Jacob A Zwart,Samantha K Oliver,William David Watkins,Jeffrey M Sadler,Alison P Appling,Hayley R Corson‐Dosch,Xiaowei Jia,Vipin Kumar,Jordan S Read

Journal

JAWRA Journal of the American Water Resources Association

Published Date

2023/4

Entity aware modelling: A survey

Personalized prediction of responses for individual entities caused by external drivers is vital across many disciplines. Recent machine learning (ML) advances have led to new state-of-the-art response prediction models. Models built at a population level often lead to sub-optimal performance in many personalized prediction settings due to heterogeneity in data across entities (tasks). In personalized prediction, the goal is to incorporate inherent characteristics of different entities to improve prediction performance. In this survey, we focus on the recent developments in the ML community for such entity-aware modeling approaches. ML algorithms often modulate the network using these entity characteristics when they are readily available. However, these entity characteristics are not readily available in many real-world scenarios, and different ML methods have been proposed to infer these characteristics from the data. In this survey, we have organized the current literature on entity-aware modeling based on the availability of these characteristics as well as the amount of training data. We highlight how recent innovations in other disciplines, such as uncertainty quantification, fairness, and knowledge-guided machine learning, can improve entity-aware modeling.

Authors

Rahul Ghosh,Haoyu Yang,Ankush Khandelwal,Erhu He,Arvind Renganathan,Somya Sharma,Xiaowei Jia,Vipin Kumar

Journal

arXiv preprint arXiv:2302.08406

Published Date

2023/2/16

Toward Large-Scale Mapping of Tree Crops with High-Resolution Satellite Imagery and Deep Learning Algorithms: A Case Study of Olive Orchards in Morocco (vol 13, 1740, 2021)

Timely and accurate monitoring of tree crop extent and productivities are necessary for informing policy-making and investments. However, except for a very few tree species (e.g., oil palms) with obvious canopy and extensive planting, most small-crown tree crops are understudied in the remote sensing domain. To conduct large-scale small-crown tree mapping, several key questions remain to be answered, such as the choice of satellite imagery with different spatial and temporal resolution and model generalizability. In this study, we use olive trees in Morocco as an example to explore the two abovementioned questions in mapping small-crown orchard trees using 0.5 m DigitalGlobe (DG) and 3 m Planet imagery and deep learning (DL) techniques. Results show that compared to DG imagery whose mean overall accuracy (OA) can reach 0.94 and 0.92 in two climatic regions, Planet imagery has limited capacity to detect olive orchards even with multi-temporal information. The temporal information of Planet only helps when enough spatial features can be captured, e.g., when olives are with large crown sizes (e.g., >3 m) and small tree spacings (e.g., <3 m). Regarding model generalizability, experiments with DG imagery show a decrease in F1 score up to 5% and OA to 4% when transferring models to new regions with distribution shift in the feature space. Findings from this study can serve as a practical reference for many other similar mapping tasks (e.g., nuts and citrus) around the world.

Authors

Chenxi Lin,Zhenong Jin,David Mulla,Rahul Ghosh,Kaiyu Guan,Vipin Kumar,Yaping Cai

Journal

Remote Sensing

Published Date

2021/4/30

Professor FAQs

What is Vipin Kumar's h-index at University of Minnesota-Twin Cities?

The h-index of Vipin Kumar has been 77 since 2020 and 137 in total.

What are Vipin Kumar's research interests?

The research interests of Vipin Kumar are: Data mining, parallel computing, high performance computing, Artificial Intelligence, machine learning

What is Vipin Kumar's total number of citations?

Vipin Kumar has 138,320 citations in total.

What are the co-authors of Vipin Kumar?

The co-authors of Vipin Kumar are George Karypis, Hui Xiong, Fellow of AAAS and IEEE, Xindong Wu, Shashi Shekhar, Jaideep Srivastava, Arindam Banerjee.

Co-Authors

H-index: 113
George Karypis

George Karypis

University of Minnesota-Twin Cities

H-index: 88
Hui Xiong, Fellow of AAAS and IEEE

Hui Xiong, Fellow of AAAS and IEEE

Rutgers, The State University of New Jersey

H-index: 82
Xindong Wu

Xindong Wu

Hefei University of Technology

H-index: 71
Shashi Shekhar

Shashi Shekhar

University of Minnesota-Twin Cities

H-index: 64
Jaideep Srivastava

Jaideep Srivastava

University of Minnesota-Twin Cities

H-index: 63
Arindam Banerjee

Arindam Banerjee

University of Illinois at Urbana-Champaign

academic-engine

Useful Links