Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory683.6 KiB
Average record size in memory70.0 B

Variable types

Numeric6
Categorical1

Dataset

Description2021년 공공데이터 기업매칭지원사업을 통해 수집한 제주 지역 내 고품질 감귤 농가의 위치과실당도데이터입니다. 데이터 항목별 설명은 다음과 같습니다. 대상 ID, id(위치키), fc_id(당도키), fcp_position(측정위치), fcp_brix(평균당도), fcp_size(평균크기)
Author제주국제자유도시개발센터
URLhttps://www.data.go.kr/data/15097171/fileData.do

Alerts

구분 is highly overall correlated with 대상 아이디 and 3 other fieldsHigh correlation
대상 아이디 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
아이디 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
당도키 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
평균크기 is highly overall correlated with 구분 and 3 other fieldsHigh correlation
구분 has unique valuesUnique
대상 아이디 has unique valuesUnique
아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:17:15.587085
Analysis finished2023-12-12 04:17:21.691481
Duration6.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20815.831
Minimum3
Maximum41733
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:17:21.768731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile2039.8
Q110636.25
median20739
Q331252.25
95-th percentile39628.1
Maximum41733
Range41730
Interquartile range (IQR)20616

Descriptive statistics

Standard deviation12011.357
Coefficient of variation (CV)0.57702991
Kurtosis-1.1878284
Mean20815.831
Median Absolute Deviation (MAD)10317.5
Skewness0.0043005268
Sum2.0815831 × 108
Variance1.442727 × 108
MonotonicityNot monotonic
2023-12-12T13:17:21.936997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34829 1
 
< 0.1%
2664 1
 
< 0.1%
5648 1
 
< 0.1%
15438 1
 
< 0.1%
33016 1
 
< 0.1%
1556 1
 
< 0.1%
10115 1
 
< 0.1%
37954 1
 
< 0.1%
4856 1
 
< 0.1%
13576 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3 1
< 0.1%
4 1
< 0.1%
18 1
< 0.1%
31 1
< 0.1%
32 1
< 0.1%
36 1
< 0.1%
37 1
< 0.1%
40 1
< 0.1%
45 1
< 0.1%
47 1
< 0.1%
ValueCountFrequency (%)
41733 1
< 0.1%
41719 1
< 0.1%
41718 1
< 0.1%
41717 1
< 0.1%
41705 1
< 0.1%
41702 1
< 0.1%
41692 1
< 0.1%
41690 1
< 0.1%
41687 1
< 0.1%
41681 1
< 0.1%

대상 아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3405208 × 109
Minimum1.3405 × 109
Maximum1.3405417 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:17:22.054924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.3405 × 109
5-th percentile1.340502 × 109
Q11.3405106 × 109
median1.3405207 × 109
Q31.3405313 × 109
95-th percentile1.3405396 × 109
Maximum1.3405417 × 109
Range41730
Interquartile range (IQR)20616

Descriptive statistics

Standard deviation12011.357
Coefficient of variation (CV)8.9602168 × 10-6
Kurtosis-1.1878284
Mean1.3405208 × 109
Median Absolute Deviation (MAD)10317.5
Skewness0.0043005268
Sum1.3405208 × 1013
Variance1.442727 × 108
MonotonicityNot monotonic
2023-12-12T13:17:22.171014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1340534829 1
 
< 0.1%
1340502664 1
 
< 0.1%
1340505648 1
 
< 0.1%
1340515438 1
 
< 0.1%
1340533016 1
 
< 0.1%
1340501556 1
 
< 0.1%
1340510115 1
 
< 0.1%
1340537954 1
 
< 0.1%
1340504856 1
 
< 0.1%
1340513576 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1340500003 1
< 0.1%
1340500004 1
< 0.1%
1340500018 1
< 0.1%
1340500031 1
< 0.1%
1340500032 1
< 0.1%
1340500036 1
< 0.1%
1340500037 1
< 0.1%
1340500040 1
< 0.1%
1340500045 1
< 0.1%
1340500047 1
< 0.1%
ValueCountFrequency (%)
1340541733 1
< 0.1%
1340541719 1
< 0.1%
1340541718 1
< 0.1%
1340541717 1
< 0.1%
1340541705 1
< 0.1%
1340541702 1
< 0.1%
1340541692 1
< 0.1%
1340541690 1
< 0.1%
1340541687 1
< 0.1%
1340541681 1
< 0.1%

아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20815.831
Minimum3
Maximum41733
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:17:22.279983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile2039.8
Q110636.25
median20739
Q331252.25
95-th percentile39628.1
Maximum41733
Range41730
Interquartile range (IQR)20616

Descriptive statistics

Standard deviation12011.357
Coefficient of variation (CV)0.57702991
Kurtosis-1.1878284
Mean20815.831
Median Absolute Deviation (MAD)10317.5
Skewness0.0043005268
Sum2.0815831 × 108
Variance1.442727 × 108
MonotonicityNot monotonic
2023-12-12T13:17:22.390793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34829 1
 
< 0.1%
2664 1
 
< 0.1%
5648 1
 
< 0.1%
15438 1
 
< 0.1%
33016 1
 
< 0.1%
1556 1
 
< 0.1%
10115 1
 
< 0.1%
37954 1
 
< 0.1%
4856 1
 
< 0.1%
13576 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3 1
< 0.1%
4 1
< 0.1%
18 1
< 0.1%
31 1
< 0.1%
32 1
< 0.1%
36 1
< 0.1%
37 1
< 0.1%
40 1
< 0.1%
45 1
< 0.1%
47 1
< 0.1%
ValueCountFrequency (%)
41733 1
< 0.1%
41719 1
< 0.1%
41718 1
< 0.1%
41717 1
< 0.1%
41705 1
< 0.1%
41702 1
< 0.1%
41692 1
< 0.1%
41690 1
< 0.1%
41687 1
< 0.1%
41681 1
< 0.1%

당도키
Real number (ℝ)

HIGH CORRELATION 

Distinct7784
Distinct (%)77.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6938.9419
Minimum1
Maximum13911
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:17:22.492395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile679.95
Q13545.75
median6913.5
Q310418
95-th percentile13210
Maximum13911
Range13910
Interquartile range (IQR)6872.25

Descriptive statistics

Standard deviation4003.7864
Coefficient of variation (CV)0.57700245
Kurtosis-1.187828
Mean6938.9419
Median Absolute Deviation (MAD)3439.5
Skewness0.0043013004
Sum69389419
Variance16030306
MonotonicityNot monotonic
2023-12-12T13:17:22.605981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9541 3
 
< 0.1%
13387 3
 
< 0.1%
1199 3
 
< 0.1%
155 3
 
< 0.1%
11760 3
 
< 0.1%
4329 3
 
< 0.1%
10882 3
 
< 0.1%
9623 3
 
< 0.1%
49 3
 
< 0.1%
7865 3
 
< 0.1%
Other values (7774) 9970
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
6 1
< 0.1%
11 2
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
15 1
< 0.1%
16 2
< 0.1%
17 2
< 0.1%
ValueCountFrequency (%)
13911 1
< 0.1%
13907 1
< 0.1%
13906 2
< 0.1%
13902 1
< 0.1%
13901 1
< 0.1%
13898 1
< 0.1%
13897 1
< 0.1%
13896 1
< 0.1%
13894 1
< 0.1%
13893 2
< 0.1%

측정위치
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3367 
3343 
3290 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
3367
33.7%
3343
33.4%
3290
32.9%

Length

2023-12-12T13:17:22.726384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:17:22.807377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3367
33.7%
3343
33.4%
3290
32.9%

평균당도
Real number (ℝ)

Distinct262
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.854797
Minimum0
Maximum16.77
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:17:22.901563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8.83
Q19.93
median10.73
Q311.73
95-th percentile13.2
Maximum16.77
Range16.77
Interquartile range (IQR)1.8

Descriptive statistics

Standard deviation1.3338123
Coefficient of variation (CV)0.12287768
Kurtosis0.58632375
Mean10.854797
Median Absolute Deviation (MAD)0.9
Skewness0.31554831
Sum108547.97
Variance1.7790552
MonotonicityNot monotonic
2023-12-12T13:17:23.013666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.37 122
 
1.2%
10.6 122
 
1.2%
10.63 113
 
1.1%
10.73 111
 
1.1%
10.4 111
 
1.1%
10.07 110
 
1.1%
10.53 110
 
1.1%
10.7 108
 
1.1%
10.5 107
 
1.1%
10.27 104
 
1.0%
Other values (252) 8882
88.8%
ValueCountFrequency (%)
0.0 1
< 0.1%
5.97 1
< 0.1%
6.77 1
< 0.1%
6.83 1
< 0.1%
6.97 1
< 0.1%
7.0 1
< 0.1%
7.03 1
< 0.1%
7.07 1
< 0.1%
7.1 2
< 0.1%
7.27 2
< 0.1%
ValueCountFrequency (%)
16.77 1
< 0.1%
16.53 1
< 0.1%
16.37 1
< 0.1%
16.33 1
< 0.1%
16.3 1
< 0.1%
16.27 1
< 0.1%
16.2 1
< 0.1%
16.07 1
< 0.1%
15.93 1
< 0.1%
15.8 2
< 0.1%

평균크기
Real number (ℝ)

HIGH CORRELATION 

Distinct3198
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57.579571
Minimum39.16
Maximum80.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:17:23.142205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum39.16
5-th percentile42.51
Q151.7
median58.32
Q363.59
95-th percentile70.49
Maximum80.3
Range41.14
Interquartile range (IQR)11.89

Descriptive statistics

Standard deviation8.4782674
Coefficient of variation (CV)0.14724437
Kurtosis-0.58404777
Mean57.579571
Median Absolute Deviation (MAD)5.8
Skewness-0.15311191
Sum575795.71
Variance71.881018
MonotonicityNot monotonic
2023-12-12T13:17:23.284366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
56.72 12
 
0.1%
60.52 11
 
0.1%
56.58 11
 
0.1%
63.11 11
 
0.1%
62.18 11
 
0.1%
63.59 11
 
0.1%
53.91 10
 
0.1%
59.46 10
 
0.1%
59.0 10
 
0.1%
56.57 10
 
0.1%
Other values (3188) 9893
98.9%
ValueCountFrequency (%)
39.16 1
< 0.1%
39.37 1
< 0.1%
39.39 1
< 0.1%
39.41 1
< 0.1%
39.46 1
< 0.1%
39.5 1
< 0.1%
39.56 1
< 0.1%
39.62 1
< 0.1%
39.65 1
< 0.1%
39.66 1
< 0.1%
ValueCountFrequency (%)
80.3 1
< 0.1%
80.23 1
< 0.1%
80.11 1
< 0.1%
80.09 1
< 0.1%
80.08 1
< 0.1%
80.01 1
< 0.1%
79.97 1
< 0.1%
79.93 2
< 0.1%
79.67 1
< 0.1%
79.66 1
< 0.1%

Interactions

2023-12-12T13:17:20.622275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:16.774407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.584706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.280610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.006375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.805584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:20.779529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:16.863442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.666113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.379895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.146688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.936804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:20.922360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:16.958311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.769853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.497826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.284383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:20.062621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:21.042201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.040566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.880825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.607370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.411386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:20.198705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:21.209290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.136504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.003264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.722914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.567307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:20.367645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:21.340124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:17.490303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.141466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:18.879129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:19.703876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:17:20.503265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:17:23.365794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분대상 아이디아이디당도키측정위치평균당도평균크기
구분1.0001.0001.0001.0000.0000.2080.650
대상 아이디1.0001.0001.0001.0000.0000.2080.650
아이디1.0001.0001.0001.0000.0000.2080.650
당도키1.0001.0001.0001.0000.0000.2080.650
측정위치0.0000.0000.0000.0001.0000.0280.000
평균당도0.2080.2080.2080.2080.0281.0000.133
평균크기0.6500.6500.6500.6500.0000.1331.000
2023-12-12T13:17:23.686001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분대상 아이디아이디당도키평균당도평균크기측정위치
구분1.0001.0001.0001.0000.076-0.5700.000
대상 아이디1.0001.0001.0001.0000.076-0.5700.000
아이디1.0001.0001.0001.0000.076-0.5700.000
당도키1.0001.0001.0001.0000.076-0.5700.000
평균당도0.0760.0760.0760.0761.000-0.1420.018
평균크기-0.570-0.570-0.570-0.570-0.1421.0000.000
측정위치0.0000.0000.0000.0000.0180.0001.000

Missing values

2023-12-12T13:17:21.507757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:17:21.633894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분대상 아이디아이디당도키측정위치평균당도평균크기
34828348291340534829348291161011.6742.0
290462904713405290472904796839.6341.87
2923729238134052923829238974610.7346.32
39143391441340539144391441304810.0343.97
2664326644134052664426644888213.2757.25
35699357001340535700357001190010.4745.73
2384723848134052384823848795012.2759.29
2516925170134052517025170839012.162.98
2212922130134052213022130737710.0767.72
2805528056134052805628056935210.5350.73
구분대상 아이디아이디당도키측정위치평균당도평균크기
2793427935134052793527935931211.643.41
4013840139134054013940139133809.2757.32
1658416585134051658516585552911.749.43
171917201340501720172057412.1769.81
175417551340501755175558512.8362.95
1958619587134051958719587652910.7360.75
3698136982134053698236982123289.7762.49
7064706513405070657065235512.2766.54
190041900513405190051900563358.858.23
2721227213134052721327213907112.9746.88