-
Atypical antiferromagnetic ordering in single crystalline quasi-2D honeycomb magnet YbI$_3$
Authors:
Nashra Pistawala,
Luminita Harnagea,
Sitaram Ramakrishnan,
Priyanshi Tiwari,
M. P. Saravanan,
Rajeev Rawat,
Surjeet Singh
Abstract:
Here, we study YbI$_3$, a quasi-2D layered material with Yb atoms arranged on an ideal honeycomb network of edge-sharing YbI$_6$ octahedra, analogous to the low-temperature phase of $α-$RuCl$_3$. High quality single crystals of YbI$_3$ are grown from Yb and I as starting precursors, using the vapor transport technique. The grown crystals are characterized by single crystal x-ray diffraction, Raman…
▽ More
Here, we study YbI$_3$, a quasi-2D layered material with Yb atoms arranged on an ideal honeycomb network of edge-sharing YbI$_6$ octahedra, analogous to the low-temperature phase of $α-$RuCl$_3$. High quality single crystals of YbI$_3$ are grown from Yb and I as starting precursors, using the vapor transport technique. The grown crystals are characterized by single crystal x-ray diffraction, Raman spectroscopy, magnetization, and heat capacity probes. The crystal-field split ground state of Yb$^{3+}$ in \Yb~ is a well-isolated Kramers doublet with an effective moment $\rm J_{eff} = 1/2$. Upon cooling, the low-temperature heat capacity of \Yb~ reveals a broad peak at $\rm T_1 = 0.95$~K due to short-range ordering of the Yb moments, followed by a sharp peak at $\rm T_2 = T_N = 0.6$~K due to long-range ordering. The magnetic behavior is found to be weakly anisotropic with $χ^\parallel > χ^\perp$, where $χ^\parallel$ and $χ^\perp$ refers to the in-plane ($H \parallel ab$) and out-of-plane ($H \perp ab$) susceptibilities. The 2~K isothermal magnetization saturates at $\rm \approx~1.5~μ_B/Yb^{3+}$ (in-plane) and $\rm \approx~1~μ_B/Yb^{3+}$ (out-of-plane), suggesting the anisotropy to be easy-plane type. Low-temperature heat capacity, well below T$_N$, is found to vary as T$^α$ with $α~\approx~2.5$, indicating a possible unconventional magnetic ground state for YbI$_3$.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Multiple Kronecker RLS fusion-based link propagation for drug-side effect prediction
Authors:
Yuqing Qian,
Ziyu Zheng,
Prayag Tiwari,
Yijie Ding,
Quan Zou
Abstract:
Drug-side effect prediction has become an essential area of research in the field of pharmacology. As the use of medications continues to rise, so does the importance of understanding and mitigating the potential risks associated with them. At present, researchers have turned to data-driven methods to predict drug-side effects. Drug-side effect prediction is a link prediction problem, and the rela…
▽ More
Drug-side effect prediction has become an essential area of research in the field of pharmacology. As the use of medications continues to rise, so does the importance of understanding and mitigating the potential risks associated with them. At present, researchers have turned to data-driven methods to predict drug-side effects. Drug-side effect prediction is a link prediction problem, and the related data can be described from various perspectives. To process these kinds of data, a multi-view method, called Multiple Kronecker RLS fusion-based link propagation (MKronRLSF-LP), is proposed. MKronRLSF-LP extends the Kron-RLS by finding the consensus partitions and multiple graph Laplacian constraints in the multi-view setting. Both of these multi-view settings contribute to a higher quality result. Extensive experiments have been conducted on drug-side effect datasets, and our empirical results provide evidence that our approach is effective and robust.
△ Less
Submitted 27 June, 2024;
originally announced July 2024.
-
LMVD: A Large-Scale Multimodal Vlog Dataset for Depression Detection in the Wild
Authors:
Lang He,
Kai Chen,
Junnan Zhao,
Yimeng Wang,
Ercheng Pei,
Haifeng Chen,
Jiewei Jiang,
Shiqing Zhang,
Jie Zhang,
Zhongmin Wang,
Tao He,
Prayag Tiwari
Abstract:
Depression can significantly impact many aspects of an individual's life, including their personal and social functioning, academic and work performance, and overall quality of life. Many researchers within the field of affective computing are adopting deep learning technology to explore potential patterns related to the detection of depression. However, because of subjects' privacy protection con…
▽ More
Depression can significantly impact many aspects of an individual's life, including their personal and social functioning, academic and work performance, and overall quality of life. Many researchers within the field of affective computing are adopting deep learning technology to explore potential patterns related to the detection of depression. However, because of subjects' privacy protection concerns, that data in this area is still scarce, presenting a challenge for the deep discriminative models used in detecting depression. To navigate these obstacles, a large-scale multimodal vlog dataset (LMVD), for depression recognition in the wild is built. In LMVD, which has 1823 samples with 214 hours of the 1475 participants captured from four multimedia platforms (Sina Weibo, Bilibili, Tiktok, and YouTube). A novel architecture termed MDDformer to learn the non-verbal behaviors of individuals is proposed. Extensive validations are performed on the LMVD dataset, demonstrating superior performance for depression detection. We anticipate that the LMVD will contribute a valuable function to the depression detection community. The data and code will released at the link: https://github.com/helang818/LMVD/.
△ Less
Submitted 8 May, 2024;
originally announced July 2024.
-
Flux dependence of redshift distribution and clustering of LOFAR radio sources
Authors:
Nitesh Bhardwaj,
Dominik J. Schwarz,
Catherine L. Hale,
Kenneth J. Duncan,
Stefano Camera,
Caroline S. Heneka,
Szymon J. Nakoneczny,
Huub J. A. Röttgering,
Thilo M. Siewert,
Prabhakar Tiwari,
Jinglan Zheng,
George Miley,
Cyril Tasse
Abstract:
In this work we study the flux density dependence of the redshift distribution of low-frequency radio sources observed in the LOFAR Two-metre Sky Survey (LoTSS) deep fields and apply it to estimate the clustering length of the large-scale structure of the Universe, examining flux density limited samples (1 mJy, 2 mJy, 4 mJy and 8 mJy) of LoTSS wide field radio sources. We utilise and combine the p…
▽ More
In this work we study the flux density dependence of the redshift distribution of low-frequency radio sources observed in the LOFAR Two-metre Sky Survey (LoTSS) deep fields and apply it to estimate the clustering length of the large-scale structure of the Universe, examining flux density limited samples (1 mJy, 2 mJy, 4 mJy and 8 mJy) of LoTSS wide field radio sources. We utilise and combine the posterior probability distributions of photometric redshift determinations for LoTSS deep field observations from three different fields (Boötes, Lockman hole and ELAIS-N1, together about $26$ square degrees of sky), which are available for between $91\%$ to $96\%$ of all sources above the studied flux density thresholds and observed in the area covered by multi-frequency data. We estimate uncertainties by a bootstrap method. We apply the inferred redshift distribution on the LoTSS wide area radio sources from the HETDEX field (LoTSS-DR1; about $424$ square degrees) and make use of the Limber approximation and a power-law model of three dimensional clustering to measure the clustering length, $r_0$, for various models of the evolution of clustering. We find that the redshift distributions from all three LoTSS deep fields agree within expected uncertainties. We show that the radio source population probed by LoTSS at flux densities above $1$ mJy has a median redshift of at least $0.9$. At $2$ mJy, we measure the clustering length of LoTSS radio sources to be $r_0 = (10.1\pm 2.6) \ h^{-1}$Mpc in the context of the comoving clustering model. Our findings are in agreement with measurements at higher flux density thresholds at the same frequency and with measurements at higher frequencies in the context of the comoving clustering model.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Association of neighborhood disadvantage with cognitive function and cortical disorganization in an unimpaired cohort
Authors:
Apoorva Safai,
Erin Jonaitis,
Rebecca E Langhough,
William R Buckingham,
Sterling C. Johnson,
W. Ryan Powell,
Amy J. H. Kind,
Barbara B. Bendlin,
Pallavi Tiwari
Abstract:
Neighborhood disadvantage is associated with worse health and cognitive outcomes. Morphological similarity network (MSN) is a promising approach to elucidate cortical network patterns underlying complex cognitive functions. We hypothesized that MSNs could capture changes in cortical patterns related to neighborhood disadvantage and cognitive function. This cross-sectional study included cognitivel…
▽ More
Neighborhood disadvantage is associated with worse health and cognitive outcomes. Morphological similarity network (MSN) is a promising approach to elucidate cortical network patterns underlying complex cognitive functions. We hypothesized that MSNs could capture changes in cortical patterns related to neighborhood disadvantage and cognitive function. This cross-sectional study included cognitively unimpaired participants from two large Alzheimers studies at University of Wisconsin-Madison. Neighborhood disadvantage status was obtained using the Area Deprivation Index (ADI). Cognitive performance was assessed on memory, processing speed and executive function. Morphological Similarity Networks (MSN) were constructed for each participant based on the similarity in distribution of cortical thickness of brain regions, followed by computation of local and global network features. Association of ADI with cognitive scores and MSN features were examined using linear regression and mediation analysis. ADI showed negative association with category fluency,implicit learning speed, story recall and modified pre-clinical Alzheimers cognitive composite scores, indicating worse cognitive function among those living in more disadvantaged neighborhoods. Local network features of frontal and temporal regions differed based on ADI status. Centrality of left lateral orbitofrontal region showed a partial mediating effect between association of neighborhood disadvantage and story recall performance. Our preliminary findings suggest differences in local cortical organization by neighborhood disadvantage, which partially mediated the relationship between ADI and cognitive performance, providing a possible network-based mechanism to, in-part, explain the risk for poor cognitive functioning associated with disadvantaged neighborhoods.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Expected Grad-CAM: Towards gradient faithfulness
Authors:
Vincenzo Buono,
Peyman Sheikholharam Mashhadi,
Mahmoud Rahat,
Prayag Tiwari,
Stefan Byttner
Abstract:
Although input-gradients techniques have evolved to mitigate and tackle the challenges associated with gradients, modern gradient-weighted CAM approaches still rely on vanilla gradients, which are inherently susceptible to the saturation phenomena. Despite recent enhancements have incorporated counterfactual gradient strategies as a mitigating measure, these local explanation techniques still exhi…
▽ More
Although input-gradients techniques have evolved to mitigate and tackle the challenges associated with gradients, modern gradient-weighted CAM approaches still rely on vanilla gradients, which are inherently susceptible to the saturation phenomena. Despite recent enhancements have incorporated counterfactual gradient strategies as a mitigating measure, these local explanation techniques still exhibit a lack of sensitivity to their baseline parameter. Our work proposes a gradient-weighted CAM augmentation that tackles both the saturation and sensitivity problem by reshaping the gradient computation, incorporating two well-established and provably approaches: Expected Gradients and kernel smoothing. By revisiting the original formulation as the smoothed expectation of the perturbed integrated gradients, one can concurrently construct more faithful, localized and robust explanations which minimize infidelity. Through fine modulation of the perturbation distribution it is possible to regulate the complexity characteristic of the explanation, selectively discriminating stable features. Our technique, Expected Grad-CAM, differently from recent works, exclusively optimizes the gradient computation, purposefully designed as an enhanced substitute of the foundational Grad-CAM algorithm and any method built therefrom. Quantitative and qualitative evaluations have been conducted to assess the effectiveness of our method.
△ Less
Submitted 25 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Deep Network Pruning: A Comparative Study on CNNs in Face Recognition
Authors:
Fernando Alonso-Fernandez,
Kevin Hernandez-Diaz,
Jose Maria Buades Rubio,
Prayag Tiwari,
Josef Bigun
Abstract:
The widespread use of mobile devices for all kind of transactions makes necessary reliable and real-time identity authentication, leading to the adoption of face recognition (FR) via the cameras embedded in such devices. Progress of deep Convolutional Neural Networks (CNNs) has provided substantial advances in FR. Nonetheless, the size of state-of-the-art architectures is unsuitable for mobile dep…
▽ More
The widespread use of mobile devices for all kind of transactions makes necessary reliable and real-time identity authentication, leading to the adoption of face recognition (FR) via the cameras embedded in such devices. Progress of deep Convolutional Neural Networks (CNNs) has provided substantial advances in FR. Nonetheless, the size of state-of-the-art architectures is unsuitable for mobile deployment, since they often encompass hundreds of megabytes and millions of parameters. We address this by studying methods for deep network compression applied to FR. In particular, we apply network pruning based on Taylor scores, where less important filters are removed iteratively. The method is tested on three networks based on the small SqueezeNet (1.24M parameters) and the popular MobileNetv2 (3.5M) and ResNet50 (23.5M) architectures. These have been selected to showcase the method on CNNs with different complexities and sizes. We observe that a substantial percentage of filters can be removed with minimal performance loss. Also, filters with the highest amount of output channels tend to be removed first, suggesting that high-dimensional spaces within popular CNNs are over-dimensionated.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Mixture of Experts Using Tensor Products
Authors:
Zhan Su,
Fengran Mo,
Prayag Tiwari,
Benyou Wang,
Jian-Yun Nie,
Jakob Grue Simonsen
Abstract:
In multi-task learning, the conventional approach involves training a model on multiple tasks simultaneously. However, the training signals from different tasks can interfere with one another, potentially leading to \textit{negative transfer}. To mitigate this, we investigate if modular language models can facilitate positive transfer and systematic generalization. Specifically, we propose a novel…
▽ More
In multi-task learning, the conventional approach involves training a model on multiple tasks simultaneously. However, the training signals from different tasks can interfere with one another, potentially leading to \textit{negative transfer}. To mitigate this, we investigate if modular language models can facilitate positive transfer and systematic generalization. Specifically, we propose a novel modular language model (\texttt{TensorPoly}), that balances parameter efficiency with nuanced routing methods. For \textit{modules}, we reparameterize Low-Rank Adaptation (\texttt{LoRA}) by employing an entangled tensor through the use of tensor product operations and name the resulting approach \texttt{TLoRA}. For \textit{routing function}, we tailor two innovative routing functions according to the granularity: \texttt{TensorPoly-I} which directs to each rank within the entangled tensor while \texttt{TensorPoly-II} offers a finer-grained routing approach targeting each order of the entangled tensor. The experimental results from the multi-task T0-benchmark demonstrate that: 1) all modular LMs surpass the corresponding dense approaches, highlighting the potential of modular language models to mitigate negative inference in multi-task learning and deliver superior outcomes. 2) \texttt{TensorPoly-I} achieves higher parameter efficiency in adaptation and outperforms other modular LMs, which shows the potential of our approach in multi-task transfer learning.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning
Authors:
Zheyuan Zhang,
Elif Keles,
Gorkem Durak,
Yavuz Taktak,
Onkar Susladkar,
Vandan Gorade,
Debesh Jha,
Asli C. Ormeci,
Alpay Medetalibeyoglu,
Lanhong Yao,
Bin Wang,
Ilkin Sevgi Isler,
Linkai Peng,
Hongyi Pan,
Camila Lopes Vendrami,
Amir Bourhani,
Yury Velichko,
Boqing Gong,
Concetto Spampinato,
Ayis Pyrros,
Pallavi Tiwari,
Derk C. F. Klatte,
Megan Engels,
Sanne Hoogenboom,
Candice W. Bolan
, et al. (13 additional authors not shown)
Abstract:
Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective st…
▽ More
Automated volumetric segmentation of the pancreas on cross-sectional imaging is needed for diagnosis and follow-up of pancreatic diseases. While CT-based pancreatic segmentation is more established, MRI-based segmentation methods are understudied, largely due to a lack of publicly available datasets, benchmarking research efforts, and domain-specific deep learning methods. In this retrospective study, we collected a large dataset (767 scans from 499 participants) of T1-weighted (T1W) and T2-weighted (T2W) abdominal MRI series from five centers between March 2004 and November 2022. We also collected CT scans of 1,350 patients from publicly available sources for benchmarking purposes. We developed a new pancreas segmentation method, called PanSegNet, combining the strengths of nnUNet and a Transformer network with a new linear attention module enabling volumetric computation. We tested PanSegNet's accuracy in cross-modality (a total of 2,117 scans) and cross-center settings with Dice and Hausdorff distance (HD95) evaluation metrics. We used Cohen's kappa statistics for intra and inter-rater agreement evaluation and paired t-tests for volume and Dice comparisons, respectively. For segmentation accuracy, we achieved Dice coefficients of 88.3% (std: 7.2%, at case level) with CT, 85.0% (std: 7.9%) with T1W MRI, and 86.3% (std: 6.4%) with T2W MRI. There was a high correlation for pancreas volume prediction with R^2 of 0.91, 0.84, and 0.85 for CT, T1W, and T2W, respectively. We found moderate inter-observer (0.624 and 0.638 for T1W and T2W MRI, respectively) and high intra-observer agreement scores. All MRI data is made available at https://osf.io/kysnj/. Our source code is available at https://github.com/NUBagciLab/PaNSegNet.
△ Less
Submitted 25 May, 2024; v1 submitted 20 May, 2024;
originally announced May 2024.
-
Physics-incorporated Graph Neural Network for Multivariate Time Series Imputation
Authors:
Guojun Liang,
Prayag Tiwari,
Slawomir Nowaczyk,
Stefan Byttner
Abstract:
Exploring the missing values is an essential but challenging issue due to the complex latent spatio-temporal correlation and dynamic nature of time series. Owing to the outstanding performance in dealing with structure learning potentials, Graph Neural Networks (GNNs) and Recurrent Neural Networks (RNNs) are often used to capture such complex spatio-temporal features in multivariate time series. H…
▽ More
Exploring the missing values is an essential but challenging issue due to the complex latent spatio-temporal correlation and dynamic nature of time series. Owing to the outstanding performance in dealing with structure learning potentials, Graph Neural Networks (GNNs) and Recurrent Neural Networks (RNNs) are often used to capture such complex spatio-temporal features in multivariate time series. However, these data-driven models often fail to capture the essential spatio-temporal relationships when significant signal corruption occurs. Additionally, calculating the high-order neighbor nodes in these models is of high computational complexity. To address these problems, we propose a novel higher-order spatio-temporal physics-incorporated GNN (HSPGNN). Firstly, the dynamic Laplacian matrix can be obtained by the spatial attention mechanism. Then, the generic inhomogeneous partial differential equation (PDE) of physical dynamic systems is used to construct the dynamic higher-order spatio-temporal GNN to obtain the missing time series values. Moreover, we estimate the missing impact by Normalizing Flows (NF) to evaluate the importance of each node in the graph for better explainability. Experimental results on four benchmark datasets demonstrate the effectiveness of HSPGNN and the superior performance when combining various order neighbor nodes. Also, graph-like optical flow, dynamic graphs, and missing impact can be obtained naturally by HSPGNN, which provides better dynamic analysis and explanation than traditional data-driven models. Our code is available at https://github.com/gorgen2020/HSPGNN.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Is Your LLM Outdated? Evaluating LLMs at Temporal Generalization
Authors:
Chenghao Zhu,
Nuo Chen,
Yufei Gao,
Yunyi Zhang,
Prayag Tiwari,
Benyou Wang
Abstract:
The rapid advancement of Large Language Models (LLMs) highlights the urgent need for evolving evaluation methodologies that keep pace with improvements in language comprehension and information processing. However, traditional benchmarks, which are often static, fail to capture the continually changing information landscape, leading to a disparity between the perceived and actual effectiveness of…
▽ More
The rapid advancement of Large Language Models (LLMs) highlights the urgent need for evolving evaluation methodologies that keep pace with improvements in language comprehension and information processing. However, traditional benchmarks, which are often static, fail to capture the continually changing information landscape, leading to a disparity between the perceived and actual effectiveness of LLMs in ever-changing real-world scenarios. Our study examines temporal generalization, which includes the ability to understand, predict, and generate text relevant to past, present, and future contexts, revealing significant temporal biases in LLMs. We propose an evaluation framework, for dynamically generating benchmarks from recent real-world predictions. Experiments demonstrate that LLMs struggle with temporal generalization, showing performance decline over time. These findings highlight the necessity for improved training and updating processes to enhance adaptability and reduce biases. Our code, dataset and benchmark are available at https://github.com/FreedomIntelligence/FreshBench.
△ Less
Submitted 10 July, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Deep TOV to characterize Neutron Stars
Authors:
Praveer Tiwari,
Archana Pai
Abstract:
Astrophysical observations, theoretical models, and terrestrial experiments probe different regions of neutron star (NS) interior. Therefore, it is essential to consistently combine the information from these sources. This analysis requires multiple evaluations of Tolman Oppenheimer Volkoff equations which can become computationally expensive with a large number of observations. Further, multi-mes…
▽ More
Astrophysical observations, theoretical models, and terrestrial experiments probe different regions of neutron star (NS) interior. Therefore, it is essential to consistently combine the information from these sources. This analysis requires multiple evaluations of Tolman Oppenheimer Volkoff equations which can become computationally expensive with a large number of observations. Further, multi-messenger astronomy requires rapid NS characterization via gravitational waves for efficient electromagnetic follow-up. In this work, we develop a novel neural network-based map from the EoS curve to the mass and radius of cold non-rotating NS. We estimate a speed-up of an order of magnitude when compared with the state-of-the-art RePrimAnd solver and an average error of 1e-3 when calculating the mass and radius of the neutron star. Additionally, we also develop neural network solvers for obtaining EoS curves from a physics conforming EoS model, FRZ$χ_{1.5}$. We utilize this efficient continuous map to measure the sensitivity of model parameters of FRZ$χ_{1.5}$ towards mass and radius. We show that 8 out of 18 parameters of this model are sensitive by at least three orders of magnitude higher than the remaining 10 parameters. This information will be useful in further speeding up, as well as probing the crucial parameter space, in the parameter estimation from astrophysical observations using this physics-conforming EoS model.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
ResNCT: A Deep Learning Model for the Synthesis of Nephrographic Phase Images in CT Urography
Authors:
Syed Jamal Safdar Gardezi,
Lucas Aronson,
Peter Wawrzyn,
Hongkun Yu,
E. Jason Abel,
Daniel D. Shapiro,
Meghan G. Lubner,
Joshua Warner,
Giuseppe Toia,
Lu Mao,
Pallavi Tiwari,
Andrew L. Wentland
Abstract:
Purpose: To develop and evaluate a transformer-based deep learning model for the synthesis of nephrographic phase images in CT urography (CTU) examinations from the unenhanced and urographic phases.
Materials and Methods: This retrospective study was approved by the local Institutional Review Board. A dataset of 119 patients (mean $\pm$ SD age, 65 $\pm$ 12 years; 75/44 males/females) with three-…
▽ More
Purpose: To develop and evaluate a transformer-based deep learning model for the synthesis of nephrographic phase images in CT urography (CTU) examinations from the unenhanced and urographic phases.
Materials and Methods: This retrospective study was approved by the local Institutional Review Board. A dataset of 119 patients (mean $\pm$ SD age, 65 $\pm$ 12 years; 75/44 males/females) with three-phase CT urography studies was curated for deep learning model development. The three phases for each patient were aligned with an affine registration algorithm. A custom model, coined Residual transformer model for Nephrographic phase CT image synthesis (ResNCT), was developed and implemented with paired inputs of non-contrast and urographic sets of images trained to produce the nephrographic phase images, that were compared with the corresponding ground truth nephrographic phase images. The synthesized images were evaluated with multiple performance metrics, including peak signal to noise ratio (PSNR), structural similarity index (SSIM), normalized cross correlation coefficient (NCC), mean absolute error (MAE), and root mean squared error (RMSE).
Results: The ResNCT model successfully generated synthetic nephrographic images from non-contrast and urographic image inputs. With respect to ground truth nephrographic phase images, the images synthesized by the model achieved high PSNR (27.8 $\pm$ 2.7 dB), SSIM (0.88 $\pm$ 0.05), and NCC (0.98 $\pm$ 0.02), and low MAE (0.02 $\pm$ 0.005) and RMSE (0.042 $\pm$ 0.016).
Conclusion: The ResNCT model synthesized nephrographic phase CT images with high similarity to ground truth images. The ResNCT model provides a means of eliminating the acquisition of the nephrographic phase with a resultant 33% reduction in radiation dose for CTU examinations.
△ Less
Submitted 28 May, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
A hybrid source of quantum light for generation of frequency tunable Fock states
Authors:
Aleksa Krstić,
Priyanshu Tiwari,
Florian Höhe,
Frank Setzpfandt,
Ulf Peschel,
Joachim Ankerhold,
Sina Saravi
Abstract:
We propose a scheme for quantum-light generation in a nonlinear cavity hybridized with a 2-level system. We theoretically show that, when excited by a series of controlled pump pulses, the hybrid source can generate various Fock states with high probabilities, such as near-on-demand generation of 1- and 2-photon states, and above 50% probability for generation of Fock states with up to 7 photons.…
▽ More
We propose a scheme for quantum-light generation in a nonlinear cavity hybridized with a 2-level system. We theoretically show that, when excited by a series of controlled pump pulses, the hybrid source can generate various Fock states with high probabilities, such as near-on-demand generation of 1- and 2-photon states, and above 50% probability for generation of Fock states with up to 7 photons. More importantly, the tailorable nature of the nonlinear cavity and its pumping allows for generating Fock states with arbitrary frequencies, even with a fixed 2-level system, creating fundamentally new opportunities in all areas of quantum technologies.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Predicting Overtakes in Trucks Using CAN Data
Authors:
Talha Hanif Butt,
Prayag Tiwari,
Fernando Alonso-Fernandez
Abstract:
Safe overtakes in trucks are crucial to prevent accidents, reduce congestion, and ensure efficient traffic flow, making early prediction essential for timely and informed driving decisions. Accordingly, we investigate the detection of truck overtakes from CAN data. Three classifiers, Artificial Neural Networks (ANN), Random Forest, and Support Vector Machines (SVM), are employed for the task. Our…
▽ More
Safe overtakes in trucks are crucial to prevent accidents, reduce congestion, and ensure efficient traffic flow, making early prediction essential for timely and informed driving decisions. Accordingly, we investigate the detection of truck overtakes from CAN data. Three classifiers, Artificial Neural Networks (ANN), Random Forest, and Support Vector Machines (SVM), are employed for the task. Our analysis covers up to 10 seconds before the overtaking event, using an overlapping sliding window of 1 second to extract CAN features. We observe that the prediction scores of the overtake class tend to increase as we approach the overtake trigger, while the no-overtake class remain stable or oscillates depending on the classifier. Thus, the best accuracy is achieved when approaching the trigger, making early overtaking prediction challenging. The classifiers show good accuracy in classifying overtakes (Recall/TPR > 93%), but accuracy is suboptimal in classifying no-overtakes (TNR typically 80-90% and below 60% for one SVM variant). We further combine two classifiers (Random Forest and linear SVM) by averaging their output scores. The fusion is observed to improve no-overtake classification (TNR > 92%) at the expense of reducing overtake accuracy (TPR). However, the latter is kept above 91% near the overtake trigger. Therefore, the fusion balances TPR and TNR, providing more consistent performance than individual classifiers.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Pushing The Limit of LLM Capacity for Text Classification
Authors:
Yazhou Zhang,
Mengyao Wang,
Chenyu Ren,
Qiuchi Li,
Prayag Tiwari,
Benyou Wang,
Jing Qin
Abstract:
The value of text classification's future research has encountered challenges and uncertainties, due to the extraordinary efficacy demonstrated by large language models (LLMs) across numerous downstream NLP tasks. In this era of open-ended language modeling, where task boundaries are gradually fading, an urgent question emerges: have we made significant advances in text classification under the fu…
▽ More
The value of text classification's future research has encountered challenges and uncertainties, due to the extraordinary efficacy demonstrated by large language models (LLMs) across numerous downstream NLP tasks. In this era of open-ended language modeling, where task boundaries are gradually fading, an urgent question emerges: have we made significant advances in text classification under the full benefit of LLMs? To answer this question, we propose RGPT, an adaptive boosting framework tailored to produce a specialized text classification LLM by recurrently ensembling a pool of strong base learners. The base learners are constructed by adaptively adjusting the distribution of training samples and iteratively fine-tuning LLMs with them. Such base learners are then ensembled to be a specialized text classification LLM, by recurrently incorporating the historical predictions from the previous learners. Through a comprehensive empirical comparison, we show that RGPT significantly outperforms 8 SOTA PLMs and 7 SOTA LLMs on four benchmarks by 1.36% on average. Further evaluation experiments show a clear surpassing of RGPT over human classification.
△ Less
Submitted 16 February, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Climate Change from Large Language Models
Authors:
Hongyin Zhu,
Prayag Tiwari
Abstract:
Climate change poses grave challenges, demanding widespread understanding and low-carbon lifestyle awareness. Large language models (LLMs) offer a powerful tool to address this crisis, yet comprehensive evaluations of their climate-crisis knowledge are lacking. This paper proposes an automated evaluation framework to assess climate-crisis knowledge within LLMs. We adopt a hybrid approach for data…
▽ More
Climate change poses grave challenges, demanding widespread understanding and low-carbon lifestyle awareness. Large language models (LLMs) offer a powerful tool to address this crisis, yet comprehensive evaluations of their climate-crisis knowledge are lacking. This paper proposes an automated evaluation framework to assess climate-crisis knowledge within LLMs. We adopt a hybrid approach for data acquisition, combining data synthesis and manual collection, to compile a diverse set of questions encompassing various aspects of climate change. Utilizing prompt engineering based on the compiled questions, we evaluate the model's knowledge by analyzing its generated answers. Furthermore, we introduce a comprehensive set of metrics to assess climate-crisis knowledge, encompassing indicators from 10 distinct perspectives. These metrics provide a multifaceted evaluation, enabling a nuanced understanding of the LLMs' climate crisis comprehension. The experimental results demonstrate the efficacy of our proposed method. In our evaluation utilizing diverse high-performing LLMs, we discovered that while LLMs possess considerable climate-related knowledge, there are shortcomings in terms of timeliness, indicating a need for continuous updating and refinement of their climate-related content.
△ Less
Submitted 1 July, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Topological Thermal Hall Conductance of Even Denominator Fractional States
Authors:
Arup Kumar Paul,
Priya Tiwari,
Ron Melcer,
Vladimir Umansky,
Moty Heiblum
Abstract:
The even denominator fractional quantum Hall (FQH) states $ν=5/2$ and $ν=7/2$, have been long predicted to host non-abelian quasiparticles (QPs). The presence of energy-carrying neutral modes cripples customary conductance measurements and thus motivates thermal transport measurements, which already proved to be sensitive to all energy-carrying modes. Each state has a different capacity to carry q…
▽ More
The even denominator fractional quantum Hall (FQH) states $ν=5/2$ and $ν=7/2$, have been long predicted to host non-abelian quasiparticles (QPs). The presence of energy-carrying neutral modes cripples customary conductance measurements and thus motivates thermal transport measurements, which already proved to be sensitive to all energy-carrying modes. Each state has a different capacity to carry quanta of heat - as expressed by the so-called: 'central charge' - identifying the state's topological order. While the 'two-terminal' thermal conductance measurements identified the topological orders of abelian and non-abelian QH states, they are prone to partial thermal equilibration among counter-propagating modes. Here, we report a 'four-terminal' thermal Hall conductance measurement, which separately measures the heat carried by the downstream and upstream chiral modes. This measurement is insensitive to thermal equilibration among modes. We verify that the $ν=5/2$ and $ν=7/2$ states are non-abelian, supporting a single upstream Majorana mode, thus obeying the Particle-Hole Pfaffian topological order. While current numerical works predict a different central charge, this contribution should motivate further theoretical work.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Occluded Person Re-Identification with Deep Learning: A Survey and Perspectives
Authors:
Enhao Ning,
Changshuo Wang,
Huang Zhangc,
Xin Ning,
Prayag Tiwari
Abstract:
Person re-identification (Re-ID) technology plays an increasingly crucial role in intelligent surveillance systems. Widespread occlusion significantly impacts the performance of person Re-ID. Occluded person Re-ID refers to a pedestrian matching method that deals with challenges such as pedestrian information loss, noise interference, and perspective misalignment. It has garnered extensive attenti…
▽ More
Person re-identification (Re-ID) technology plays an increasingly crucial role in intelligent surveillance systems. Widespread occlusion significantly impacts the performance of person Re-ID. Occluded person Re-ID refers to a pedestrian matching method that deals with challenges such as pedestrian information loss, noise interference, and perspective misalignment. It has garnered extensive attention from researchers. Over the past few years, several occlusion-solving person Re-ID methods have been proposed, tackling various sub-problems arising from occlusion. However, there is a lack of comprehensive studies that compare, summarize, and evaluate the potential of occluded person Re-ID methods in detail. In this review, we start by providing a detailed overview of the datasets and evaluation scheme used for occluded person Re-ID. Next, we scientifically classify and analyze existing deep learning-based occluded person Re-ID methods from various perspectives, summarizing them concisely. Furthermore, we conduct a systematic comparison among these methods, identify the state-of-the-art approaches, and present an outlook on the future development of occluded person Re-ID.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Towards Subject Agnostic Affective Emotion Recognition
Authors:
Amit Kumar Jaiswal,
Haiming Liu,
Prayag Tiwari
Abstract:
This paper focuses on affective emotion recognition, aiming to perform in the subject-agnostic paradigm based on EEG signals. However, EEG signals manifest subject instability in subject-agnostic affective Brain-computer interfaces (aBCIs), which led to the problem of distributional shift. Furthermore, this problem is alleviated by approaches such as domain generalisation and domain adaptation. Ty…
▽ More
This paper focuses on affective emotion recognition, aiming to perform in the subject-agnostic paradigm based on EEG signals. However, EEG signals manifest subject instability in subject-agnostic affective Brain-computer interfaces (aBCIs), which led to the problem of distributional shift. Furthermore, this problem is alleviated by approaches such as domain generalisation and domain adaptation. Typically, methods based on domain adaptation confer comparatively better results than the domain generalisation methods but demand more computational resources given new subjects. We propose a novel framework, meta-learning based augmented domain adaptation for subject-agnostic aBCIs. Our domain adaptation approach is augmented through meta-learning, which consists of a recurrent neural network, a classifier, and a distributional shift controller based on a sum-decomposable function. Also, we present that a neural network explicating a sum-decomposable function can effectively estimate the divergence between varied domains. The network setting for augmented domain adaptation follows meta-learning and adversarial learning, where the controller promptly adapts to new domains employing the target data via a few self-adaptation steps in the test phase. Our proposed approach is shown to be effective in experiments on a public aBICs dataset and achieves similar performance to state-of-the-art domain adaptation methods while avoiding the use of additional computational resources.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations
Authors:
Yazhou Zhang,
Mengyao Wang,
Youxi Wu,
Prayag Tiwari,
Qiuchi Li,
Benyou Wang,
Jing Qin
Abstract:
Large language models (LLMs) and their variants have shown extraordinary efficacy across numerous downstream natural language processing (NLP) tasks, which has presented a new vision for the development of NLP. Despite their remarkable performance in natural language generating (NLG), LLMs lack a distinct focus on the emotion understanding domain. As a result, using LLMs for emotion recognition ma…
▽ More
Large language models (LLMs) and their variants have shown extraordinary efficacy across numerous downstream natural language processing (NLP) tasks, which has presented a new vision for the development of NLP. Despite their remarkable performance in natural language generating (NLG), LLMs lack a distinct focus on the emotion understanding domain. As a result, using LLMs for emotion recognition may lead to suboptimal and inadequate precision. Another limitation of LLMs is that they are typical trained without leveraging multi-modal information. To overcome these limitations, we propose DialogueLLM, a context and emotion knowledge tuned LLM that is obtained by fine-tuning LLaMA models with 13,638 multi-modal (i.e., texts and videos) emotional dialogues. The visual information is considered as the supplementary knowledge to construct high-quality instructions. We offer a comprehensive evaluation of our proposed model on three benchmarking emotion recognition in conversations (ERC) datasets and compare the results against the SOTA baselines and other SOTA LLMs. Additionally, DialogueLLM-7B can be easily trained using LoRA on a 40GB A100 GPU in 5 hours, facilitating reproducibility for other researchers.
△ Less
Submitted 16 January, 2024; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Cosmology from LOFAR Two-metre Sky Survey Data Release 2: Cross-correlation with the cosmic microwave background
Authors:
S. J. Nakoneczny,
D. Alonso,
M. Bilicki,
D. J. Schwarz,
C. L. Hale,
A. Pollo,
C. Heneka,
P. Tiwari,
J. Zheng,
M. Brüggen,
M. J. Jarvis,
T. W. Shimwell
Abstract:
We combine the LOw-Frequency ARray (LOFAR) Two-metre Sky Survey (LoTSS) second data release (DR2) catalogue with gravitational lensing maps from the Cosmic Microwave Background (CMB) to place constraints on the bias evolution of LoTSS radio galaxies, and on the amplitude of matter perturbations. We construct a flux-limited catalogue, and analyse its harmonic-space cross-correlation with CMB lensin…
▽ More
We combine the LOw-Frequency ARray (LOFAR) Two-metre Sky Survey (LoTSS) second data release (DR2) catalogue with gravitational lensing maps from the Cosmic Microwave Background (CMB) to place constraints on the bias evolution of LoTSS radio galaxies, and on the amplitude of matter perturbations. We construct a flux-limited catalogue, and analyse its harmonic-space cross-correlation with CMB lensing maps from Planck, $C_\ell^{gκ}$, as well as its auto-correlation, $C_\ell^{gg}$. We explore the models describing the redshift evolution of the large-scale radio galaxy bias, discriminating between them through the combination of both $C_\ell^{gκ}$ and $C_\ell^{gg}$. Fixing the bias evolution, we then use these data to place constraints on the amplitude of large scale density fluctuations. We report the significance of the $C_\ell^{gκ}$ signal at a level of $26.6σ$. We determine that a linear bias evolution of the form $b_g(z) = b_{g,D} / D(z)$, where $D(z)$ is the growth rate, is able to provide a good description of the data, and measure $b_{g,D} = 1.41 \pm 0.06$ for a sample flux-limited at $1.5\,{\rm mJy}$, for scales $\ell < 250$ for $C_\ell^{gg}$, and $\ell < 500$ for $C_\ell^{gκ}$. At the sample's median redshift, we obtain $b(z = 0.82) = 2.34 \pm 0.10$. Using $σ_8$ as a free parameter, while keeping other cosmological parameters fixed to the Planck values, we find fluctuations of $σ_8 = 0.75^{+0.05}_{-0.04}$. The result is in agreement with weak lensing surveys, and at $1σ$ difference with Planck CMB constraints. We also attempt to detect the late-time integrated Sachs-Wolfe effect with LOFAR, but with the current sky coverage, the cross-correlation with CMB temperature maps is consistent with zero. Our results are an important step towards constraining cosmology with radio continuum surveys from LOFAR and other future large radio surveys.
△ Less
Submitted 15 May, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Cosmology from LOFAR Two-metre Sky Survey Data Release 2: Angular Clustering of Radio Sources
Authors:
C. L. Hale,
D. J. Schwarz,
P. N. Best,
S. J. Nakoneczny,
D. Alonso,
D. Bacon,
L. Böhme,
N. Bhardwaj,
M. Bilicki,
S. Camera,
C. S. Heneka,
M. Pashapour-Ahmadabadi,
P. Tiwari,
J. Zheng,
K. J. Duncan,
M. J. Jarvis,
R. Kondapally,
M. Magliocchetti,
H. J. A. Rottgering,
T. W. Shimwell
Abstract:
Covering $\sim$5600 deg$^2$ to rms sensitivities of $\sim$70$-$100 $μ$Jy beam$^{-1}$, the LOFAR Two-metre Sky Survey Data Release 2 (LoTSS-DR2) provides the largest low-frequency ($\sim$150 MHz) radio catalogue to date, making it an excellent tool for large-area radio cosmology studies. In this work, we use LoTSS-DR2 sources to investigate the angular two-point correlation function of galaxies wit…
▽ More
Covering $\sim$5600 deg$^2$ to rms sensitivities of $\sim$70$-$100 $μ$Jy beam$^{-1}$, the LOFAR Two-metre Sky Survey Data Release 2 (LoTSS-DR2) provides the largest low-frequency ($\sim$150 MHz) radio catalogue to date, making it an excellent tool for large-area radio cosmology studies. In this work, we use LoTSS-DR2 sources to investigate the angular two-point correlation function of galaxies within the survey. We discuss systematics in the data and an improved methodology for generating random catalogues, compared to that used for LoTSS-DR1, before presenting the angular clustering for $\sim$900,000 sources $\geq$$1.5$ mJy and a peak signal-to-noise $\geq$$7.5$ across $\sim$$80\%$ of the observed area. Using the clustering we infer the bias assuming two evolutionary models. When fitting {angular scales of $0.5 \leqθ<5\,°$, using a linear bias model, we find LoTSS-DR2 sources are biased tracers of the underlying matter, with a bias of $b_{C}= 2.14^{+0.22}_{-0.20}$ (assuming constant bias) and $b_{E}(z=0)= 1.79^{+0.15}_{-0.14}$ (for an evolving model, inversely proportional to the growth factor), corresponding to $b_E= 2.81^{+0.24}_{-0.22}$ at the median redshift of our sample, assuming the LoTSS Deep Fields redshift distribution is representative of our data. This reduces to $b_{C}= 2.02^{+0.17}_{-0.16}$ and $b_{E}(z=0)= 1.67^{+0.12}_{-0.12}$ when allowing preferential redshift distributions from the Deep Fields to model our data. Whilst the clustering amplitude is slightly lower than LoTSS-DR1 ($\geq$2 mJy), our study benefits from larger samples and improved redshift estimates.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
DyFFPAD: Dynamic Fusion of Convolutional and Handcrafted Features for Fingerprint Presentation Attack Detection
Authors:
Anuj Rai,
Parsheel Kumar Tiwari,
Jyotishna Baishya,
Ram Prakash Sharma,
Somnath Dey
Abstract:
Automatic fingerprint recognition systems suffer from the threat of presentation attacks due to their wide range of applications in areas including national borders and commercial applications. Presentation attacks can be performed by fabricating the fake fingerprint of a user with or without the intention of the subject. This paper presents a dynamic ensemble of deep learning and handcrafted feat…
▽ More
Automatic fingerprint recognition systems suffer from the threat of presentation attacks due to their wide range of applications in areas including national borders and commercial applications. Presentation attacks can be performed by fabricating the fake fingerprint of a user with or without the intention of the subject. This paper presents a dynamic ensemble of deep learning and handcrafted features to detect presentation attacks in known-material and unknown-material protocols. The proposed model is a dynamic ensemble of deep CNN and handcrafted features empowered deep neural networks both of which learn their parameters together. The proposed presentation attack detection model, in this way, utilizes the capabilities of both classification techniques and exhibits better performance than their individual results. The proposed model's performance is validated using benchmark LivDet 2015, 2017, and 2019 databases, with an overall accuracy of 96.10\%, 96.49\%, and 95.99\% attained on them, respectively. The proposed model outperforms state-of-the-art methods in benchmark protocols of presentation attack detection in terms of classification accuracy.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
VISU at WASSA 2023 Shared Task: Detecting Emotions in Reaction to News Stories Leveraging BERT and Stacked Embeddings
Authors:
Vivek Kumar,
Sushmita Singh,
Prayag Tiwari
Abstract:
Our system, VISU, participated in the WASSA 2023 Shared Task (3) of Emotion Classification from essays written in reaction to news articles. Emotion detection from complex dialogues is challenging and often requires context/domain understanding. Therefore in this research, we have focused on developing deep learning (DL) models using the combination of word embedding representations with tailored…
▽ More
Our system, VISU, participated in the WASSA 2023 Shared Task (3) of Emotion Classification from essays written in reaction to news articles. Emotion detection from complex dialogues is challenging and often requires context/domain understanding. Therefore in this research, we have focused on developing deep learning (DL) models using the combination of word embedding representations with tailored prepossessing strategies to capture the nuances of emotions expressed. Our experiments used static and contextual embeddings (individual and stacked) with Bidirectional Long short-term memory (BiLSTM) and Transformer based models. We occupied rank tenth in the emotion detection task by scoring a Macro F1-Score of 0.2717, validating the efficacy of our implemented approaches for small and imbalanced datasets with mixed categories of target emotions.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
An Explainable Model-Agnostic Algorithm for CNN-based Biometrics Verification
Authors:
Fernando Alonso-Fernandez,
Kevin Hernandez-Diaz,
Jose M. Buades,
Prayag Tiwari,
Josef Bigun
Abstract:
This paper describes an adaptation of the Local Interpretable Model-Agnostic Explanations (LIME) AI method to operate under a biometric verification setting. LIME was initially proposed for networks with the same output classes used for training, and it employs the softmax probability to determine which regions of the image contribute the most to classification. However, in a verification setting,…
▽ More
This paper describes an adaptation of the Local Interpretable Model-Agnostic Explanations (LIME) AI method to operate under a biometric verification setting. LIME was initially proposed for networks with the same output classes used for training, and it employs the softmax probability to determine which regions of the image contribute the most to classification. However, in a verification setting, the classes to be recognized have not been seen during training. In addition, instead of using the softmax output, face descriptors are usually obtained from a layer before the classification layer. The model is adapted to achieve explainability via cosine similarity between feature vectors of perturbated versions of the input image. The method is showcased for face biometrics with two CNN models based on MobileNetv2 and ResNet50.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Framework for Multi-messenger Inference from Neutron Stars: Combining Nuclear Theory Priors
Authors:
Praveer Tiwari,
Dake Zhou,
Bhaskar Biswas,
Michael McNeil Forbes,
Sukanta Bose
Abstract:
We construct an efficient parameterization of the pure neutron-matter equation of state (EoS) that incorporates the uncertainties from both chiral effective field theory ($χ$EFT) and phenomenological potential calculations. This parameterization yields a family of EoSs including and extending the forms based purely on these two calculations. In combination with an agnostic inner core EoS, this par…
▽ More
We construct an efficient parameterization of the pure neutron-matter equation of state (EoS) that incorporates the uncertainties from both chiral effective field theory ($χ$EFT) and phenomenological potential calculations. This parameterization yields a family of EoSs including and extending the forms based purely on these two calculations. In combination with an agnostic inner core EoS, this parameterization is used in a Bayesian inference pipeline to obtain constraints on the e os parameters using multi-messenger observations of neutron stars. We specifically considered observations of the massive pulsar J0740+6620, the binary neutron star coalescence GW170817, and the NICER pulsar J0030+0451. Constraints on neutron star mass-radius relations are obtained and compared. The Bayes factors for the different EoS models are also computed. While current constraints do not reveal any significant preference among these models, the framework developed here may enable future observations with more sensitive detectors to discriminate them.
△ Less
Submitted 25 June, 2024; v1 submitted 7 June, 2023;
originally announced June 2023.
-
We never go out of Style: Motion Disentanglement by Subspace Decomposition of Latent Space
Authors:
Rishubh Parihar,
Raghav Magazine,
Piyush Tiwari,
R. Venkatesh Babu
Abstract:
Real-world objects perform complex motions that involve multiple independent motion components. For example, while talking, a person continuously changes their expressions, head, and body pose. In this work, we propose a novel method to decompose motion in videos by using a pretrained image GAN model. We discover disentangled motion subspaces in the latent space of widely used style-based GAN mode…
▽ More
Real-world objects perform complex motions that involve multiple independent motion components. For example, while talking, a person continuously changes their expressions, head, and body pose. In this work, we propose a novel method to decompose motion in videos by using a pretrained image GAN model. We discover disentangled motion subspaces in the latent space of widely used style-based GAN models that are semantically meaningful and control a single explainable motion component. The proposed method uses only a few $(\approx10)$ ground truth video sequences to obtain such subspaces. We extensively evaluate the disentanglement properties of motion subspaces on face and car datasets, quantitatively and qualitatively. Further, we present results for multiple downstream tasks such as motion editing, and selective motion transfer, e.g. transferring only facial expressions without training for it.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Dynamic Causal Explanation Based Diffusion-Variational Graph Neural Network for Spatio-temporal Forecasting
Authors:
Guojun Liang,
Prayag Tiwari,
Sławomir Nowaczyk,
Stefan Byttner,
Fernando Alonso-Fernandez
Abstract:
Graph neural networks (GNNs), especially dynamic GNNs, have become a research hotspot in spatio-temporal forecasting problems. While many dynamic graph construction methods have been developed, relatively few of them explore the causal relationship between neighbour nodes. Thus, the resulting models lack strong explainability for the causal relationship between the neighbour nodes of the dynamical…
▽ More
Graph neural networks (GNNs), especially dynamic GNNs, have become a research hotspot in spatio-temporal forecasting problems. While many dynamic graph construction methods have been developed, relatively few of them explore the causal relationship between neighbour nodes. Thus, the resulting models lack strong explainability for the causal relationship between the neighbour nodes of the dynamically generated graphs, which can easily lead to a risk in subsequent decisions. Moreover, few of them consider the uncertainty and noise of dynamic graphs based on the time series datasets, which are ubiquitous in real-world graph structure networks. In this paper, we propose a novel Dynamic Diffusion-Variational Graph Neural Network (DVGNN) for spatio-temporal forecasting. For dynamic graph construction, an unsupervised generative model is devised. Two layers of graph convolutional network (GCN) are applied to calculate the posterior distribution of the latent node embeddings in the encoder stage. Then, a diffusion model is used to infer the dynamic link probability and reconstruct causal graphs in the decoder stage adaptively. The new loss function is derived theoretically, and the reparameterization trick is adopted in estimating the probability distribution of the dynamic graphs by Evidence Lower Bound during the backpropagation period. After obtaining the generated graphs, dynamic GCN and temporal attention are applied to predict future states. Experiments are conducted on four real-world datasets of different graph structures in different domains. The results demonstrate that the proposed DVGNN model outperforms state-of-the-art approaches and achieves outstanding Root Mean Squared Error result while exhibiting higher robustness. Also, by F1-score and probability distribution analysis, we demonstrate that DVGNN better reflects the causal relationship and uncertainty of dynamic graphs.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Robust Model Predictive Techno-Economic Control of Active Distribution Networks
Authors:
Salish Maharjan,
Prashant Tiwari,
Rui Cheng,
Zhaoyu Wang
Abstract:
Stochastic controllers are perceived as a promising solution for techno-economic operation of distribution networks having higher generation uncertainties at large penetration of renewables. These controllers are supported by forecasters capable of predicting generation uncertainty by means of lower/upper bounds rather than by probability density function (PDF). Hence, the stochastic controller as…
▽ More
Stochastic controllers are perceived as a promising solution for techno-economic operation of distribution networks having higher generation uncertainties at large penetration of renewables. These controllers are supported by forecasters capable of predicting generation uncertainty by means of lower/upper bounds rather than by probability density function (PDF). Hence, the stochastic controller assumes a suitable PDF for scenario creation and optimization, requiring validation of the assumption. To effectively bridge the forecaster's capability and resolve the assumption issues, the paper proposes a robust model prediction-based techno-economic controller, which essentially utilizes only the lower/upper bounds of the forecast, eliminating the necessity of PDF. Both discrete and continuous control resources such as tap-changers and DERs are utilized for regulating the lower/upper bounds of the network states and robustly minimizing the cost of energy import. The proposed controller is implemented for UKGDS network and validated by comparing performance at various confidence levels of lower/upper bound forecast.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
A Novel Deep Learning based Model for Erythrocytes Classification and Quantification in Sickle Cell Disease
Authors:
Manish Bhatia,
Balram Meena,
Vipin Kumar Rathi,
Prayag Tiwari,
Amit Kumar Jaiswal,
Shagaf M Ansari,
Ajay Kumar,
Pekka Marttinen
Abstract:
The shape of erythrocytes or red blood cells is altered in several pathological conditions. Therefore, identifying and quantifying different erythrocyte shapes can help diagnose various diseases and assist in designing a treatment strategy. Machine Learning (ML) can be efficiently used to identify and quantify distorted erythrocyte morphologies. In this paper, we proposed a customized deep convolu…
▽ More
The shape of erythrocytes or red blood cells is altered in several pathological conditions. Therefore, identifying and quantifying different erythrocyte shapes can help diagnose various diseases and assist in designing a treatment strategy. Machine Learning (ML) can be efficiently used to identify and quantify distorted erythrocyte morphologies. In this paper, we proposed a customized deep convolutional neural network (CNN) model to classify and quantify the distorted and normal morphology of erythrocytes from the images taken from the blood samples of patients suffering from Sickle cell disease ( SCD). We chose SCD as a model disease condition due to the presence of diverse erythrocyte morphologies in the blood samples of SCD patients. For the analysis, we used 428 raw microscopic images of SCD blood samples and generated the dataset consisting of 10, 377 single-cell images. We focused on three well-defined erythrocyte shapes, including discocytes, oval, and sickle. We used 18 layered deep CNN architecture to identify and quantify these shapes with 81% accuracy, outperforming other models. We also used SHAP and LIME for further interpretability. The proposed model can be helpful for the quick and accurate analysis of SCD blood samples by the clinicians and help them make the right decision for better management of SCD.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Huatuo-26M, a Large-scale Chinese Medical QA Dataset
Authors:
Jianquan Li,
Xidong Wang,
Xiangbo Wu,
Zhiyi Zhang,
Xiaolong Xu,
Jie Fu,
Prayag Tiwari,
Xiang Wan,
Benyou Wang
Abstract:
In this paper, we release a largest ever medical Question Answering (QA) dataset with 26 million QA pairs. We benchmark many existing approaches in our dataset in terms of both retrieval and generation. Experimental results show that the existing models perform far lower than expected and the released dataset is still challenging in the pre-trained language model era. Moreover, we also experimenta…
▽ More
In this paper, we release a largest ever medical Question Answering (QA) dataset with 26 million QA pairs. We benchmark many existing approaches in our dataset in terms of both retrieval and generation. Experimental results show that the existing models perform far lower than expected and the released dataset is still challenging in the pre-trained language model era. Moreover, we also experimentally show the benefit of the proposed dataset in many aspects: (i) trained models for other QA datasets in a zero-shot fashion; and (ii) as external knowledge for retrieval-augmented generation (RAG); and (iii) improving existing pre-trained language models by using the QA pairs as a pre-training corpus in continued training manner. We believe that this dataset will not only contribute to medical research but also facilitate both the patients and clinical doctors. See \url{https://github.com/FreedomIntelligence/Huatuo-26M}.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Automatized marine vessel monitoring from sentinel-1 data using convolution neural network
Authors:
Surya Prakash Tiwari,
Sudhir Kumar Chaturvedi,
Subhrangshu Adhikary,
Saikat Banerjee,
Sourav Basu
Abstract:
The advancement of multi-channel synthetic aperture radar (SAR) system is considered as an upgraded technology for surveillance activities. SAR sensors onboard provide data for coastal ocean surveillance and a view of the oceanic surface features. Vessel monitoring has earlier been performed using Constant False Alarm Rate (CFAR) algorithm which is not a smart technique as it lacks decision-making…
▽ More
The advancement of multi-channel synthetic aperture radar (SAR) system is considered as an upgraded technology for surveillance activities. SAR sensors onboard provide data for coastal ocean surveillance and a view of the oceanic surface features. Vessel monitoring has earlier been performed using Constant False Alarm Rate (CFAR) algorithm which is not a smart technique as it lacks decision-making capabilities, therefore we introduce wavelet transformation-based Convolution Neural Network approach to recognize objects from SAR images during the heavy naval traffic, which corresponds to the numerous object detection. The utilized information comprises Sentinel-1 SAR-C dual-polarization data acquisitions over the western coastal zones of India and with help of the proposed technique we have obtained 95.46% detection accuracy. Utilizing this model can automatize the monitoring of naval objects and recognition of foreign maritime intruders.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
A Survey on Few-Shot Class-Incremental Learning
Authors:
Songsong Tian,
Lusi Li,
Weijun Li,
Hang Ran,
Xin Ning,
Prayag Tiwari
Abstract:
Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup easily leads to catastrophic forgetting and overfitting problems, severely affecting model p…
▽ More
Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup easily leads to catastrophic forgetting and overfitting problems, severely affecting model performance. Studying FSCIL helps overcome deep learning model limitations on data volume and acquisition time, while improving practicality and adaptability of machine learning models. This paper provides a comprehensive survey on FSCIL. Unlike previous surveys, we aim to synthesize few-shot learning and incremental learning, focusing on introducing FSCIL from two perspectives, while reviewing over 30 theoretical research studies and more than 20 applied research studies. From the theoretical perspective, we provide a novel categorization approach that divides the field into five subcategories, including traditional machine learning methods, meta-learning based methods, feature and feature space-based methods, replay-based methods, and dynamic network structure-based methods. We also evaluate the performance of recent theoretical research on benchmark datasets of FSCIL. From the application perspective, FSCIL has achieved impressive achievements in various fields of computer vision such as image classification, object detection, and image segmentation, as well as in natural language processing and graph. We summarize the important applications. Finally, we point out potential future research directions, including applications, problem setups, and theory development. Overall, this paper offers a comprehensive analysis of the latest advances in FSCIL from a methodological, performance, and application perspective.
△ Less
Submitted 23 October, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Higher-order Bragg gaps in the electronic band structure of bilayer graphene renormalized by recursive supermoiré potential
Authors:
Mohit Kumar Jat,
Priya Tiwari,
Robin Bajaj,
Ishita Shitut,
Shinjan Mandal,
Kenji Watanabe,
Takashi Taniguchi,
H. R. Krishnamurthy,
Manish Jain,
Aveek Bid
Abstract:
This letter presents our findings on the recursive band gap engineering of chiral fermions in bilayer graphene doubly aligned with hBN. By utilizing two interfering moiré potentials, we generate a supermoiré pattern which renormalizes the electronic bands of the pristine bilayer graphene, resulting in higher-order fractal gaps even at very low energies. These Bragg gaps can be mapped using a uniqu…
▽ More
This letter presents our findings on the recursive band gap engineering of chiral fermions in bilayer graphene doubly aligned with hBN. By utilizing two interfering moiré potentials, we generate a supermoiré pattern which renormalizes the electronic bands of the pristine bilayer graphene, resulting in higher-order fractal gaps even at very low energies. These Bragg gaps can be mapped using a unique linear combination of periodic areas within the system. To validate our findings, we used electronic transport measurements to identify the position of these gaps as functions of the carrier density and establish their agreement with the predicted carrier densities and corresponding quantum numbers obtained using the continuum model. Our work provides direct experimental evidence of the quantization of the area of quasi-Brillouin zones in supermoiré systems. It fills essential gaps in understanding the band structure engineering of Dirac fermions by a recursive doubly periodic superlattice potential.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Natural Language Reasoning, A Survey
Authors:
Fei Yu,
Hongbo Zhang,
Prayag Tiwari,
Benyou Wang
Abstract:
This survey paper proposes a clearer view of natural language reasoning in the field of Natural Language Processing (NLP), both conceptually and practically. Conceptually, we provide a distinct definition for natural language reasoning in NLP, based on both philosophy and NLP scenarios, discuss what types of tasks require reasoning, and introduce a taxonomy of reasoning. Practically, we conduct a…
▽ More
This survey paper proposes a clearer view of natural language reasoning in the field of Natural Language Processing (NLP), both conceptually and practically. Conceptually, we provide a distinct definition for natural language reasoning in NLP, based on both philosophy and NLP scenarios, discuss what types of tasks require reasoning, and introduce a taxonomy of reasoning. Practically, we conduct a comprehensive literature review on natural language reasoning in NLP, mainly covering classical logical reasoning, natural language inference, multi-hop question answering, and commonsense reasoning. The paper also identifies and views backward reasoning, a powerful paradigm for multi-step reasoning, and introduces defeasible reasoning as one of the most important future directions in natural language reasoning research. We focus on single-modality unstructured natural language text, excluding neuro-symbolic techniques and mathematical reasoning.
△ Less
Submitted 13 May, 2023; v1 submitted 26 March, 2023;
originally announced March 2023.
-
F-transforms determined by overlap and grouping maps over a complete lattice
Authors:
Abha Tripathi,
S. P. Tiwari,
Sutapa Mahato
Abstract:
This paper is about the study of F-transforms based on overlap and grouping maps, residual and co-residual implicator over complete lattice from both constructive and axiomatic approaches. Further, the duality, basic properties, and the inverse of proposed F-transforms have been studied, and axiomatic characterizations of proposed direct F-transforms are investigated.
This paper is about the study of F-transforms based on overlap and grouping maps, residual and co-residual implicator over complete lattice from both constructive and axiomatic approaches. Further, the duality, basic properties, and the inverse of proposed F-transforms have been studied, and axiomatic characterizations of proposed direct F-transforms are investigated.
△ Less
Submitted 16 December, 2022;
originally announced January 2023.
-
Probing Cosmology beyond $Λ$CDM using the SKA
Authors:
Shamik Ghosh,
Pankaj Jain,
Rahul Kothari,
Mohit Panwar,
Gurmeet Singh,
Prabhakar Tiwari
Abstract:
The cosmological principle states that the Universe is statistically homogeneous and isotropic at large distance scales. There currently exist many observations which indicate a departure from this principle. It has been shown that many of these observations can be explained by invoking superhorizon cosmological perturbations and may be consistent with the Big Bang paradigm. Remarkably, these mode…
▽ More
The cosmological principle states that the Universe is statistically homogeneous and isotropic at large distance scales. There currently exist many observations which indicate a departure from this principle. It has been shown that many of these observations can be explained by invoking superhorizon cosmological perturbations and may be consistent with the Big Bang paradigm. Remarkably, these modes simultaneously explain the observed Hubble tension, i.e., the discrepancy between the direct and indirect measurements of the Hubble parameter. We propose several tests of the cosmological principle using SKA. In particular, we can reliably extract the signal of dipole anisotropy in the distribution of radio galaxies. The superhorizon perturbations also predict a significant redshift dependence of the dipole signal which can be nicely tested by the study of signals of reionization and the dark ages using SKA. We also propose to study the alignment of radio galaxy axes as well as their integrated polarization vectors over distance scales ranging from a few Mpc to Gpc. We discuss data analysis techniques that can reliably extract these signals from data.
△ Less
Submitted 28 March, 2023; v1 submitted 8 January, 2023;
originally announced January 2023.
-
Observation of time-reversal symmetric Hall effect in graphene-WSe2 heterostructures at room temperature
Authors:
Priya Tiwari,
Divya Sahani,
Atasi Chakraborty,
Kamal Das,
Kenji Watanabe,
Takashi Taniguchi,
Amit Agarwal,
Aveek Bid
Abstract:
In this letter, we provide experimental evidence of the time-reversal symmetric Hall effect in a mesoscopic system, namely high-mobility graphene/WSe$_2$ heterostructures. This linear, dissipative Hall effect, whose sign depends on the sign of the charge carriers, persists up to room temperature. The magnitude and the sign of the Hall signal can be tuned using an external perpendicular electric fi…
▽ More
In this letter, we provide experimental evidence of the time-reversal symmetric Hall effect in a mesoscopic system, namely high-mobility graphene/WSe$_2$ heterostructures. This linear, dissipative Hall effect, whose sign depends on the sign of the charge carriers, persists up to room temperature. The magnitude and the sign of the Hall signal can be tuned using an external perpendicular electric field. Our joint experimental and theoretical study establishes that the strain induced by lattice mismatch, or angle inhomogeneity, produces anisotropic bands in graphene while simultaneously breaking the inversion symmetry. The band anisotropy and reduced spatial symmetry lead to the appearance of a time-reversal symmetric Hall effect. Our study establishes graphene-transition metal dichalcogenide-based heterostructures as an excellent platform for studying the effects of broken symmetry on the physical properties of band-engineered two-dimensional systems.
△ Less
Submitted 18 July, 2023; v1 submitted 5 January, 2023;
originally announced January 2023.
-
On categories of spaces with L-fuzzy partitions, L-fuzzy closure system spaces and coalgebras (dialgebras)
Authors:
Abha Tripathi,
S. P. Tiwari
Abstract:
In this contribution, we aim to introduce and study L-fuzzy partition spaces and L-fuzzy closure system spaces in a categorical framework. Further, we present the concepts of coalgebras and dialgebras corresponding to a direct upper F -transform under certain conditions and show the functorial relationship between the category of spaces with L-fuzzy partition and the category of coalgebras (dialge…
▽ More
In this contribution, we aim to introduce and study L-fuzzy partition spaces and L-fuzzy closure system spaces in a categorical framework. Further, we present the concepts of coalgebras and dialgebras corresponding to a direct upper F -transform under certain conditions and show the functorial relationship between the category of spaces with L-fuzzy partition and the category of coalgebras (dialgebras). Moreover, we show that the categories of coalgebras and dialgebras are isomorphic and introduce a pair of adjoint functors between the coalgebras and dialgebras.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Granular F-transform and its application
Authors:
Abha Tripathi,
S. P. Tiwari,
J. Kavikumar
Abstract:
This contribution introduces the concept of granular F-transform and investigates its basic properties by using the theory of fuzzy numbers and horizontal membership functions. Further, we present a numerical method based on granular F-transform to solve a fuzzy prey-predator model consisting of two prey and one predator due to its natural variability and investigate the existence of the equilibri…
▽ More
This contribution introduces the concept of granular F-transform and investigates its basic properties by using the theory of fuzzy numbers and horizontal membership functions. Further, we present a numerical method based on granular F-transform to solve a fuzzy prey-predator model consisting of two prey and one predator due to its natural variability and investigate the existence of the equilibrium points and their stability
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Experimental observation of spin-split energy dispersion in high-mobility single-layer graphene/WSe2 heterostructures
Authors:
Priya Tiwari,
Mohit Kumar Jat,
Adithi Udupa,
Deepa S. Narang,
Kenji Watanabe,
Takashi Taniguchi,
Diptiman Sen,
Aveek Bid
Abstract:
Proximity-induced spin-orbit coupling in graphene has led to the observation of intriguing phenomena like time-reversal invariant $\mathbb{Z}_2$ topological phase and spin-orbital filtering effects. An understanding of the effect of spin-orbit coupling on the band structure of graphene is essential if these exciting observations are to be transformed into real-world applications. In this research…
▽ More
Proximity-induced spin-orbit coupling in graphene has led to the observation of intriguing phenomena like time-reversal invariant $\mathbb{Z}_2$ topological phase and spin-orbital filtering effects. An understanding of the effect of spin-orbit coupling on the band structure of graphene is essential if these exciting observations are to be transformed into real-world applications. In this research article, we report the experimental determination of the band structure of single-layer graphene (SLG) in the presence of strong proximity-induced spin-orbit coupling. We achieve this in high-mobility hBN-encapsulated SLG/WSe2 heterostructures through measurements of quantum oscillations. We observe clear spin-splitting of the graphene bands along with a substantial increase in the Fermi velocity. Using a theoretical model with realistic parameters to fit our experimental data, we uncover evidence of a band gap opening and band inversion in the SLG. Further, we establish that the deviation of the low-energy band structure from pristine SLG is determined primarily by the valley-Zeeman SOC and Rashba SOC, with the Kane-Mele SOC being inconsequential. Despite robust theoretical predictions and observations of band-splitting, a quantitative measure of the spin-splitting of the valence and the conduction bands and the consequent low-energy dispersion relation in SLG was missing -- our combined experimental and theoretical study fills this lacuna.
△ Less
Submitted 17 October, 2022;
originally announced October 2022.
-
An Overview of Violence Detection Techniques: Current Challenges and Future Directions
Authors:
Nadia Mumtaz,
Naveed Ejaz,
Shabana Habib,
Syed Muhammad Mohsin,
Prayag Tiwari,
Shahab S. Band,
Neeraj Kumar
Abstract:
The Big Video Data generated in today's smart cities has raised concerns from its purposeful usage perspective, where surveillance cameras, among many others are the most prominent resources to contribute to the huge volumes of data, making its automated analysis a difficult task in terms of computation and preciseness. Violence Detection (VD), broadly plunging under Action and Activity recognitio…
▽ More
The Big Video Data generated in today's smart cities has raised concerns from its purposeful usage perspective, where surveillance cameras, among many others are the most prominent resources to contribute to the huge volumes of data, making its automated analysis a difficult task in terms of computation and preciseness. Violence Detection (VD), broadly plunging under Action and Activity recognition domain, is used to analyze Big Video data for anomalous actions incurred due to humans. The VD literature is traditionally based on manually engineered features, though advancements to deep learning based standalone models are developed for real-time VD analysis. This paper focuses on overview of deep sequence learning approaches along with localization strategies of the detected violence. This overview also dives into the initial image processing and machine learning-based VD literature and their possible advantages such as efficiency against the current complex models. Furthermore,the datasets are discussed, to provide an analysis of the current models, explaining their pros and cons with future directions in VD domain derived from an in-depth analysis of the previous methods.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
A study of Dipolar Signal in distant Quasars with various observables
Authors:
Rahul Kothari,
Mohit Panwar,
Gurmeet Singh,
Prabhakar Tiwari,
Pankaj Jain
Abstract:
We study the signal of anisotropy in AGNs/quasars of CatWISE2020 catalogue using different observables. It has been reported earlier that this data shows a strong signal of dipole anisotropy in the source number counts. We test this claim using two independent data analysis procedures and find our number count dipole consistent with the earlier results. In addition to number counts, we test for th…
▽ More
We study the signal of anisotropy in AGNs/quasars of CatWISE2020 catalogue using different observables. It has been reported earlier that this data shows a strong signal of dipole anisotropy in the source number counts. We test this claim using two independent data analysis procedures and find our number count dipole consistent with the earlier results. In addition to number counts, we test for the anisotropy signal in two other observables -- mean spectral index $\barα$ and mean flux density $\bar{B}$. We find a dipole signal of considerable strength both in the mean spectral index and the mean flux density. The dipole in mean flux density points towards the galactic center and becomes very weak after imposing a flux cut to remove sources with flux greater than 1 mJy. This can be attributed to the presence of some bright sources. The signal in mean spectral index, however, is relatively stable as a function of both flux and galactic cuts. The dipole in this observable points roughly opposite to the galactic center and hence most likely arises due to galactic bias. Hence, the signal in both the mean spectral index and mean flux density appears to be consistent with isotropy.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Knowledge Enhancement and Mobile Technology: Improving Effectiveness and Efficiency
Authors:
Siddhartha Paul Tiwari
Abstract:
Mobile technology creates value on three fundamental pillars: productivity, coordination, and transformation. Mobile apps are becoming an increasingly important aspect of teaching and learning in many countries. The use of mobile applications in education is not only beneficial, but also provides students with an enjoyable and interactive experience. For a mobile product launch to be as successful…
▽ More
Mobile technology creates value on three fundamental pillars: productivity, coordination, and transformation. Mobile apps are becoming an increasingly important aspect of teaching and learning in many countries. The use of mobile applications in education is not only beneficial, but also provides students with an enjoyable and interactive experience. For a mobile product launch to be as successful as it can be, it is imperative that a systematic, precise, controlled, and well-established process is in place, which is controlled, efficient, and well-established. Many educational organisation's find themselves in situations where they have to get all departments working effectively and together in order to meet a specific deadline, including marketing, production, and operations, after the organization's product clearance board approves the new product. In many ways, the situation is similar to the software crisis that took place in the middle of the 1970's. As a result of globalisation and communication, oftentimes the effects of globalisation are amplified because of the vast amount of information that must be shared among project team members. Every educational organization has a unique style and way of doing things, and the project management team is no exception. As a consequence, it is commonplace to see that every education entity has its own particular style and way of doing things. In regards to the creation of a mobile application that is going to function efficiently, it is important to remember that it is extremely important to stick to the strategies and requirements that will yield the best results for the education sector.
△ Less
Submitted 18 July, 2022;
originally announced August 2022.
-
The clustering properties of AGNs/quasars in CatWISE2020 catalog
Authors:
Prabhakar Tiwari,
Gong-Bo Zhao,
Adi Nusser
Abstract:
We study the clustering properties of 1,307,530 AGNs/quasars in the CatWISE2020 catalog prepared using the Wide-field Infrared Survey Explorer (WISE) and Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE) survey data. For angular moments $\ell \gtrapprox 10$ ($\lessapprox 18^\circ$) down to non-linear scales, the results are in agreement with the standard $Λ$CDM cosmology, with a gala…
▽ More
We study the clustering properties of 1,307,530 AGNs/quasars in the CatWISE2020 catalog prepared using the Wide-field Infrared Survey Explorer (WISE) and Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE) survey data. For angular moments $\ell \gtrapprox 10$ ($\lessapprox 18^\circ$) down to non-linear scales, the results are in agreement with the standard $Λ$CDM cosmology, with a galaxy bias roughly matching that of the NRAO VLA Sky Survey (NVSS) AGNs. We further explore the redshift dependence of the fraction of infrared bright AGNs on stellar mass, $f_{\rm IB} \sim M_*^{α_0 + α_1 z}$, and find $α_1=1.27^{+0.25}_{-0.30}$, ruling out a non-evolution hypothesis at $\approx 4.6σ$ confidence level. The results are consistent with the measurements obtained with NVSS AGNs, though considerably more precise thanks to the significantly higher number density of objects in CatWISE2020. The excess dipole and high clustering signal above angular scale $\approx 18^\circ$ remain anomalous.
△ Less
Submitted 8 February, 2023; v1 submitted 19 July, 2022;
originally announced July 2022.
-
Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk
Authors:
Benyou Wang,
Xiangbo Wu,
Xiaokang Liu,
Jianquan Li,
Prayag Tiwari,
Qianqian Xie
Abstract:
Language is the principal tool for human communication, in which humor is one of the most attractive parts. Producing natural language like humans using computers, a.k.a, Natural Language Generation (NLG), has been widely used for dialogue systems, chatbots, machine translation, as well as computer-aid creation e.g., idea generations, scriptwriting. However, the humor aspect of natural language is…
▽ More
Language is the principal tool for human communication, in which humor is one of the most attractive parts. Producing natural language like humans using computers, a.k.a, Natural Language Generation (NLG), has been widely used for dialogue systems, chatbots, machine translation, as well as computer-aid creation e.g., idea generations, scriptwriting. However, the humor aspect of natural language is relatively under-investigated, especially in the age of pre-trained language models. In this work, we aim to preliminarily test whether NLG can generate humor as humans do. We build a new dataset consisting of numerous digitized Chinese Comical Crosstalk scripts (called C$^3$ in short), which is for a popular Chinese performing art called `Xiangsheng' since 1800s. (For convenience for non-Chinese speakers, we called `crosstalk' for `Xiangsheng' in this paper.) We benchmark various generation approaches including training-from-scratch Seq2seq, fine-tuned middle-scale PLMs, and large-scale PLMs (with and without fine-tuning). Moreover, we also conduct a human assessment, showing that 1) large-scale pretraining largely improves crosstalk generation quality; and 2) even the scripts generated from the best PLM is far from what we expect, with only 65% quality of human-created crosstalk. We conclude, humor generation could be largely improved using large-scaled PLMs, but it is still in its infancy.
The data and benchmarking code is publicly available in \url{https://github.com/anonNo2/crosstalk-generation}.
△ Less
Submitted 2 July, 2022;
originally announced July 2022.
-
PSL is Dead. Long Live PSL
Authors:
Kevin Smith,
Hai Lin,
Praveen Tiwari,
Marjorie Sayer,
Claudionor Coelho
Abstract:
Property Specification Language (PSL) is a form of temporal logic that has been mainly used in discrete domains (e.g. formal hardware verification). In this paper, we show that by merging machine learning techniques with PSL monitors, we can extend PSL to work on continuous domains. We apply this technique in machine learning-based anomaly detection to analyze scenarios of real-time streaming even…
▽ More
Property Specification Language (PSL) is a form of temporal logic that has been mainly used in discrete domains (e.g. formal hardware verification). In this paper, we show that by merging machine learning techniques with PSL monitors, we can extend PSL to work on continuous domains. We apply this technique in machine learning-based anomaly detection to analyze scenarios of real-time streaming events from continuous variables in order to detect abnormal behaviors of a system. By using machine learning with formal models, we leverage the strengths of both machine learning methods and formal semantics of time. On one hand, machine learning techniques can produce distributions on continuous variables, where abnormalities can be captured as deviations from the distributions. On the other hand, formal methods can characterize discrete temporal behaviors and relations that cannot be easily learned by machine learning techniques. Interestingly, the anomalies detected by machine learning and the underlying time representation used are discrete events. We implemented a temporal monitoring package (TEF) that operates in conjunction with normal data science packages for anomaly detection machine learning systems, and we show that TEF can be used to perform accurate interpretation of temporal correlation between events.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
GeV emission from a compact binary merger
Authors:
Alessio Mei,
Biswajit Banerjee,
Gor Oganesyan,
Om Sharan Salafia,
Stefano Giarratana,
Marica Branchesi,
Paolo D'Avanzo,
Sergio Campana,
Giancarlo Ghirlanda,
Samuele Ronchini,
Amit Shukla,
Pawan Tiwari
Abstract:
An energetic $\rm γ$-ray burst (GRB), GRB 211211A, was observed on 2021 December 11 by the Neil Gehrels Swift Observatory. Despite its long duration, typically associated with bursts produced by the collapse of massive stars, the discovery of an optical-infrared kilonova and a quasi-periodic oscillation during a gamma-ray precursor points to a compact object binary merger origin. The complete unde…
▽ More
An energetic $\rm γ$-ray burst (GRB), GRB 211211A, was observed on 2021 December 11 by the Neil Gehrels Swift Observatory. Despite its long duration, typically associated with bursts produced by the collapse of massive stars, the discovery of an optical-infrared kilonova and a quasi-periodic oscillation during a gamma-ray precursor points to a compact object binary merger origin. The complete understanding of this nearby ($\sim$ 1 billion light-years) burst will significantly impact our knowledge of GRB progenitors and the physical processes that lead to electromagnetic emission in compact binary mergers. Here, we report the discovery of a significant ($\rm >5 σ$) transient-like emission in the high-energy $\rm γ$-rays (HE; E$>0.1$ GeV) observed by Fermi/LAT starting at $10^3$ s after the burst. After an initial phase with a roughly constant flux ($\rm \sim 5\times 10^{-10}\ erg\ s^{-1}\ cm^{-2}$) lasting $\sim 2\times 10^4$ s, the flux started decreasing and soon went undetected. The multi-wavelength afterglow emission observed at such late times is usually in good agreement with synchrotron emission from a relativistic shock wave that arises as the GRB jet decelerates in the interstellar medium. However, our detailed modelling of a rich dataset comprising public and dedicated multi-wavelength observations demonstrates that GeV emission from GRB 211211A is in excess with respect to the expectation of this scenario. We explore the possibility that the GeV excess is inverse Compton emission due to the interaction of a long-lived, low-power jet with an external source of photons. We discover that the kilonova emission can provide the necessary seed photons for GeV emission in binary neutron star mergers.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Knowledge Management Strategies and Emerging Technologies -- An Overview Of the Underpinning Concepts
Authors:
Siddhartha Paul Tiwari
Abstract:
Among the essential elements of knowledge management is the use of information and data, as well as the knowledge, skills, and abilities inherent within communities, as well as their ideas, commitments, and motivations for making good decisions as emerging technologies become more prevalent. Numerous leading social scientists in this field have asserted that organisational knowledge should be rega…
▽ More
Among the essential elements of knowledge management is the use of information and data, as well as the knowledge, skills, and abilities inherent within communities, as well as their ideas, commitments, and motivations for making good decisions as emerging technologies become more prevalent. Numerous leading social scientists in this field have asserted that organisational knowledge should be regarded as a strategic asset. There is a growing awareness of the importance of gathering, locating, capturing, and sharing collective knowledge and expertise of societies, and societies are urged to develop effective and efficient methods of gathering, locating, capturing, and sharing that knowledge in order to deal with problems and to benefit from opportunities. People living in many countries and regions are interested in implementing knowledge management processes and technologies, and many of them have included knowledge management as an integral part of their overall development strategies. The management of knowledge plays an increasingly important role in global economic development (Bell, 1973, 1978). In order to remain relevant in the modern world, organisations should not ignore knowledge management and emerging technologies.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.