Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory332.0 KiB
Average record size in memory34.0 B

Variable types

Numeric2
Categorical1

Dataset

Description농가의 일일 영농활동(교육, 시비 작업 등), 생산, 판매 활동 등 기록 관리시스템으로 영농일지일련번호, 파일일련번호, 파일종류등을 제공합니다.
Author충청북도
URLhttps://www.data.go.kr/data/15050306/fileData.do

Alerts

파일종류 has constant value ""Constant
영농일지일련번호 is highly overall correlated with 파일일련번호High correlation
파일일련번호 is highly overall correlated with 영농일지일련번호High correlation
파일일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:54:51.708881
Analysis finished2023-12-12 19:54:53.001343
Duration1.29 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영농일지일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct9039
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean150716.48
Minimum2412
Maximum315448
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:54:53.104679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2412
5-th percentile19383.25
Q171338.5
median144358
Q3227095
95-th percentile297232.9
Maximum315448
Range313036
Interquartile range (IQR)155756.5

Descriptive statistics

Standard deviation89376.063
Coefficient of variation (CV)0.59300791
Kurtosis-1.181072
Mean150716.48
Median Absolute Deviation (MAD)76952.5
Skewness0.14509994
Sum1.5071648 × 109
Variance7.9880806 × 109
MonotonicityNot monotonic
2023-12-13T04:54:53.283669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
302621 4
 
< 0.1%
308737 3
 
< 0.1%
268122 3
 
< 0.1%
252768 3
 
< 0.1%
266570 3
 
< 0.1%
252925 3
 
< 0.1%
273598 3
 
< 0.1%
106058 3
 
< 0.1%
7888 3
 
< 0.1%
206383 3
 
< 0.1%
Other values (9029) 9969
99.7%
ValueCountFrequency (%)
2412 1
< 0.1%
3235 1
< 0.1%
3484 1
< 0.1%
5420 1
< 0.1%
5421 1
< 0.1%
6442 1
< 0.1%
6743 1
< 0.1%
6744 1
< 0.1%
6745 1
< 0.1%
6747 1
< 0.1%
ValueCountFrequency (%)
315448 1
< 0.1%
315444 1
< 0.1%
315342 1
< 0.1%
315327 2
< 0.1%
315321 1
< 0.1%
315319 1
< 0.1%
315248 1
< 0.1%
315235 1
< 0.1%
315225 1
< 0.1%
315162 1
< 0.1%

파일일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36750.119
Minimum1
Maximum76889
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:54:53.450453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3068.8
Q115001.75
median38240.5
Q358175.5
95-th percentile73188.75
Maximum76889
Range76888
Interquartile range (IQR)43173.75

Descriptive statistics

Standard deviation23602.487
Coefficient of variation (CV)0.64224247
Kurtosis-1.3856932
Mean36750.119
Median Absolute Deviation (MAD)21777.5
Skewness0.091064509
Sum3.6750119 × 108
Variance5.570774 × 108
MonotonicityNot monotonic
2023-12-13T04:54:53.651336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7857 1
 
< 0.1%
36170 1
 
< 0.1%
63605 1
 
< 0.1%
52325 1
 
< 0.1%
74803 1
 
< 0.1%
11147 1
 
< 0.1%
6663 1
 
< 0.1%
60238 1
 
< 0.1%
38195 1
 
< 0.1%
19826 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
18 1
< 0.1%
19 1
< 0.1%
35 1
< 0.1%
52 1
< 0.1%
55 1
< 0.1%
58 1
< 0.1%
62 1
< 0.1%
ValueCountFrequency (%)
76889 1
< 0.1%
76886 1
< 0.1%
76859 1
< 0.1%
76841 1
< 0.1%
76839 1
< 0.1%
76836 1
< 0.1%
76831 1
< 0.1%
76826 1
< 0.1%
76825 1
< 0.1%
76822 1
< 0.1%

파일종류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
W
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowW
2nd rowW
3rd rowW
4th rowW
5th rowW

Common Values

ValueCountFrequency (%)
W 10000
100.0%

Length

2023-12-13T04:54:53.844801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:54:53.981447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
w 10000
100.0%

Interactions

2023-12-13T04:54:52.170025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:54:51.894085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:54:52.334458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:54:52.029503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:54:54.067040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영농일지일련번호파일일련번호
영농일지일련번호1.0000.982
파일일련번호0.9821.000
2023-12-13T04:54:54.194675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영농일지일련번호파일일련번호
영농일지일련번호1.0001.000
파일일련번호1.0001.000

Missing values

2023-12-13T04:54:52.846865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:54:52.951583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영농일지일련번호파일일련번호파일종류
7858384407857W
5251426663066501W
5597328319570566W
3079314507838539W
7978786798W
8476418358477W
129226165012929W
2216310598922197W
4919225292962407W
6074731380076347W
영농일지일련번호파일일련번호파일종류
4878524709861872W
2935514037636742W
5370327039967892W
6054231166576106W
188459275218861W
6061931197176186W
6067631223476241W
3355415611841950W
4661923139260417W
2576612329925804W