Questions tagged [statistics]
Statistics is the mathematical study of using probability to infer characteristics of a population from a limited number of samples or observations. Non-programming statistics questions are off-topic here, and they should be posted at https://stats.stackexchange.com instead.
statistics
16,436
questions
0
votes
0
answers
21
views
Giving a data set additional information
I want to give additional information to fungal_tab data set by combining it with taxa_fungi data set. I want to add all the Mycorrhiza information regardless it is ectomycorrhiza or endomycorrhiza. I ...
-1
votes
0
answers
17
views
How do I compare two big Temperature-Matrix-Datasets and classify the difference/similarities? How do I proceed finding a good method? [closed]
I have the following issue: I want to compare thermal pictures (data is present as a big Matrix/ each Pixel one value) in python and understand weather they are different. What statistical approaches ...
0
votes
0
answers
16
views
The variance of the sum of 2 normal distributions is supposed to be the the sum of the individual variances, but why isn't it below?
Consider the python code below.
import statistics as s
m1,sigma1 = 100,20
m2,sigma2 = 70,20
n1,n2 = 100000,100000
d1 = s.NormalDist(mu = m1, sigma=sigma1)
d2 = s.NormalDist(mu = m2, sigma=sigma2)
...
0
votes
0
answers
20
views
Do I need to 'shift' my new target variable in Time Series prediction?
When I have time series data but I don't want to predict the values of the series. Instead, I create a new target variable that gives me the direction (1 to represent an increase and 0 to represent a ...
1
vote
1
answer
14
views
How to deal with word counts of zero when calculating Pointwise Mutual Information (PMI) for word cooccurrences in Natural Language Processing
I have a co-occurrence matrix of words in a text (two words x and y are considered co-occurring, if they both occur in a context window of w words). I want to calculate the Pointwise Mutual ...
0
votes
0
answers
16
views
How to code an algorithm for Linda Index calculation in SQL (Apache Superset)?
I collided with a real obstacle trying to bring the calculation of Linda index in Apache Superset SQL Notebook. Let's assume we have a table for 6 firms in one region for one year with their market ...
1
vote
2
answers
42
views
Python - Check if the last value in a sequence is relatively higher than the rest
For a list of percentage data, I need to check if the last value (90.2) is somehow higher and somewhat "abnormal" than the rest of the data. Clearly it is in this sequence.
delivery_pct = [...
0
votes
0
answers
8
views
Hausman test report [closed]
I've been working on a research project using a multi-level regression model, and I'm currently figuring out how to present the Hausman test results. I've seen some papers where authors mention doing ...
0
votes
0
answers
10
views
How to prove expected residual time formula
Can anyone help me to show that for a general random variable X with a fixed distribution to model the time interval between the independent events. The expected residual time, which is the time one ...
0
votes
0
answers
14
views
Variation in the total number of samples when performing multiple fisher exact tests [migrated]
I have dataset of 2 time points. In each time point I have to compare the counts of different components at each position (number of positions =1000). I used fisher exact test to compare if there is a ...
0
votes
0
answers
28
views
How to compare variance of distance between two treatments over time?
I am interested in understanding how my variable (distance) changes over time between two treatment groups. I now want to compare measures of variance for distance between the two treatment groups, ...
1
vote
0
answers
19
views
How to check if a variable varies across timestamps or across samples
Let's assume we have some time-series data where every person (sample) has measurements for 2 variables (Var1, Var2) for three timepoints. Following is a dummy representation of the data.
Is there a ...
0
votes
0
answers
9
views
Hovmoller Plot in Python not showing correctly
I am trying to create a Hovmoller plot in Python on Google Colab, but Hovmoller is not showing as it should show. It's showing vertical different color boxes
how to fix this?
Here is the code that I ...
-1
votes
0
answers
29
views
What distribution should I use? [closed]
I'm new to statistics and need to create a probability distribution with the following study:
*Note: Due to proprietary reasons the example given below is the best representation of the real data.
...
0
votes
0
answers
10
views
What is the best test to run to compare presence/absence of something between sampling locations?
I have data on the presence/absence of four different pathogens found in edible crabs in 7 locations over 2 seasons. For this study, 30 crabs were collected from each site and for each crab the ...