Overview

Dataset statistics

Number of variables3
Number of observations5003
Missing cells5003
Missing cells (%)33.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory132.0 KiB
Average record size in memory27.0 B

Variable types

Numeric2
Unsupported1

Dataset

Description농가의 일일 영농활동(교육, 시비 작업 등), 생산, 판매 활동 등 기록 관리시스템으로 교육일련번호, 파일일련번호 비고 등을 제공합니다.
Author충청북도
URLhttps://www.data.go.kr/data/15050305/fileData.do

Alerts

교육일련번호 is highly overall correlated with 파일일련번호High correlation
파일일련번호 is highly overall correlated with 교육일련번호High correlation
비고1 has 5003 (100.0%) missing valuesMissing
파일일련번호 has unique valuesUnique
비고1 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 03:33:31.632546
Analysis finished2023-12-12 03:33:32.182359
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교육일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct2676
Distinct (%)53.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4278.6698
Minimum7
Maximum9474
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T12:33:32.249783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile489.4
Q12060.5
median4162
Q36294
95-th percentile8580.4
Maximum9474
Range9467
Interquartile range (IQR)4233.5

Descriptive statistics

Standard deviation2579.1662
Coefficient of variation (CV)0.60279628
Kurtosis-1.1013563
Mean4278.6698
Median Absolute Deviation (MAD)2122
Skewness0.17732619
Sum21406185
Variance6652098.5
MonotonicityIncreasing
2023-12-12T12:33:32.370442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4378 6
 
0.1%
5099 6
 
0.1%
5333 6
 
0.1%
5368 6
 
0.1%
5022 5
 
0.1%
5233 4
 
0.1%
5542 4
 
0.1%
6138 4
 
0.1%
7823 4
 
0.1%
4923 4
 
0.1%
Other values (2666) 4954
99.0%
ValueCountFrequency (%)
7 2
< 0.1%
9 1
 
< 0.1%
11 1
 
< 0.1%
19 1
 
< 0.1%
21 1
 
< 0.1%
23 1
 
< 0.1%
24 3
0.1%
26 1
 
< 0.1%
27 1
 
< 0.1%
29 2
< 0.1%
ValueCountFrequency (%)
9474 2
< 0.1%
9471 1
 
< 0.1%
9443 3
0.1%
9439 2
< 0.1%
9438 1
 
< 0.1%
9430 1
 
< 0.1%
9429 1
 
< 0.1%
9425 1
 
< 0.1%
9422 1
 
< 0.1%
9421 1
 
< 0.1%

파일일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct5003
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44244.937
Minimum33156
Maximum76894
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T12:33:32.712589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33156
5-th percentile33406.1
Q134407.5
median35667
Q352106.5
95-th percentile71306.1
Maximum76894
Range43738
Interquartile range (IQR)17699

Descriptive statistics

Standard deviation12845.355
Coefficient of variation (CV)0.29032372
Kurtosis-0.33480174
Mean44244.937
Median Absolute Deviation (MAD)2292
Skewness1.0112812
Sum2.2135742 × 108
Variance1.6500314 × 108
MonotonicityNot monotonic
2023-12-12T12:33:32.820631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33156 1
 
< 0.1%
48107 1
 
< 0.1%
48129 1
 
< 0.1%
48128 1
 
< 0.1%
48127 1
 
< 0.1%
48111 1
 
< 0.1%
48110 1
 
< 0.1%
48109 1
 
< 0.1%
48108 1
 
< 0.1%
48092 1
 
< 0.1%
Other values (4993) 4993
99.8%
ValueCountFrequency (%)
33156 1
< 0.1%
33157 1
< 0.1%
33158 1
< 0.1%
33159 1
< 0.1%
33160 1
< 0.1%
33161 1
< 0.1%
33162 1
< 0.1%
33163 1
< 0.1%
33164 1
< 0.1%
33165 1
< 0.1%
ValueCountFrequency (%)
76894 1
< 0.1%
76893 1
< 0.1%
76879 1
< 0.1%
76723 1
< 0.1%
76722 1
< 0.1%
76721 1
< 0.1%
76700 1
< 0.1%
76699 1
< 0.1%
76698 1
< 0.1%
76546 1
< 0.1%

비고1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5003
Missing (%)100.0%
Memory size44.1 KiB

Interactions

2023-12-12T12:33:31.897500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:33:31.727877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:33:31.992496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:33:31.811439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:33:32.894158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육일련번호파일일련번호
교육일련번호1.0000.965
파일일련번호0.9651.000
2023-12-12T12:33:32.974341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육일련번호파일일련번호
교육일련번호1.0001.000
파일일련번호1.0001.000

Missing values

2023-12-12T12:33:32.082325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:33:32.151160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육일련번호파일일련번호비고1
0733156<NA>
1733157<NA>
2933158<NA>
31133159<NA>
41933160<NA>
52133161<NA>
62333162<NA>
72433163<NA>
82433164<NA>
92433165<NA>
교육일련번호파일일련번호비고1
4993943076533<NA>
4994943876698<NA>
4995943976699<NA>
4996943976700<NA>
4997944376721<NA>
4998944376722<NA>
4999944376723<NA>
5000947176879<NA>
5001947476893<NA>
5002947476894<NA>