Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 726 |
Missing cells | 5030 |
Missing cells (%) | 57.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 72.4 KiB |
Average record size in memory | 102.2 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 6 |
Text | 3 |
Boolean | 1 |
Dataset
Description | 농가경영기록장 시스템에서 사용하는 코드에 대한 정보입니다. 공통코드와 날씨지역정보 및 지역별 우편변호 코드를 포함합니다. 공통코드와 지역코드를 구분컬럼으로 구분하였습니다. |
---|---|
URL | https://www.data.go.kr/data/15119435/fileData.do |
사용여부 has constant value "" | Constant |
코드_구분 is highly overall correlated with 구분 | High correlation |
구분 is highly overall correlated with 기관코드 and 6 other fields | High correlation |
기관코드 is highly overall correlated with 엑스좌표(X) and 2 other fields | High correlation |
엑스좌표(X) is highly overall correlated with 기관코드 and 1 other fields | High correlation |
와이좌표(Y) is highly overall correlated with 우편번호_시작 and 2 other fields | High correlation |
우편번호_시작 is highly overall correlated with 와이좌표(Y) and 2 other fields | High correlation |
우편번호_끝 is highly overall correlated with 기관코드 and 3 other fields | High correlation |
순서번호 is highly overall correlated with 구분 | High correlation |
구분 is highly imbalanced (85.5%) | Imbalance |
기관코드 has 711 (97.9%) missing values | Missing |
지역명 has 711 (97.9%) missing values | Missing |
엑스좌표(X) has 711 (97.9%) missing values | Missing |
와이좌표(Y) has 711 (97.9%) missing values | Missing |
사용여부 has 711 (97.9%) missing values | Missing |
우편번호_시작 has 711 (97.9%) missing values | Missing |
우편번호_끝 has 711 (97.9%) missing values | Missing |
코드_아이디 has 15 (2.1%) missing values | Missing |
코드_이름 has 15 (2.1%) missing values | Missing |
순서번호 has 23 (3.2%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 14:20:28.932103 |
---|---|
Analysis finished | 2023-12-12 14:20:33.596295 |
Duration | 4.66 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.8 KiB |
공통코드 | |
---|---|
날씨지역코드 | 15 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.0413223 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 날씨지역코드 |
---|---|
2nd row | 날씨지역코드 |
3rd row | 날씨지역코드 |
4th row | 날씨지역코드 |
5th row | 날씨지역코드 |
Common Values
Value | Count | Frequency (%) |
공통코드 | 711 | |
날씨지역코드 | 15 | 2.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공통코드 | 711 | |
날씨지역코드 | 15 | 2.1% |
기관코드
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 100.0% |
Missing | 711 |
Missing (%) | 97.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.3449667 × 109 |
Minimum | 4.3 × 109 |
---|---|
Maximum | 4.38 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 4.3 × 109 |
---|---|
5-th percentile | 4.30777 × 109 |
Q1 | 4.31135 × 109 |
median | 4.372 × 109 |
Q3 | 4.37475 × 109 |
95-th percentile | 4.3779 × 109 |
Maximum | 4.38 × 109 |
Range | 80000000 |
Interquartile range (IQR) | 63400000 |
Descriptive statistics
Standard deviation | 33636219 |
---|---|
Coefficient of variation (CV) | 0.0077414217 |
Kurtosis | -2.2212036 |
Mean | 4.3449667 × 109 |
Median Absolute Deviation (MAD) | 8000000 |
Skewness | -0.17099757 |
Sum | 6.51745 × 1010 |
Variance | 1.1313952 × 1015 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
4300000000 | 1 | 0.1% |
4311100000 | 1 | 0.1% |
4311200000 | 1 | 0.1% |
4311300000 | 1 | 0.1% |
4311400000 | 1 | 0.1% |
4313000000 | 1 | 0.1% |
4315000000 | 1 | 0.1% |
4372000000 | 1 | 0.1% |
4373000000 | 1 | 0.1% |
4374000000 | 1 | 0.1% |
Other values (5) | 5 | 0.7% |
(Missing) | 711 |
Value | Count | Frequency (%) |
4300000000 | 1 | |
4311100000 | 1 | |
4311200000 | 1 | |
4311300000 | 1 | |
4311400000 | 1 | |
4313000000 | 1 | |
4315000000 | 1 | |
4372000000 | 1 | |
4373000000 | 1 | |
4374000000 | 1 |
Value | Count | Frequency (%) |
4380000000 | 1 | |
4377000000 | 1 | |
4376000000 | 1 | |
4375000000 | 1 | |
4374500000 | 1 | |
4374000000 | 1 | |
4373000000 | 1 | |
4372000000 | 1 | |
4315000000 | 1 | |
4313000000 | 1 |
지역명
Text
MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 100.0% |
Missing | 711 |
Missing (%) | 97.9% |
Memory size | 5.8 KiB |
Length
Max length | 12 |
---|---|
Median length | 8 |
Mean length | 8.8 |
Min length | 4 |
Characters and Unicode
Total characters | 132 |
---|---|
Distinct characters | 31 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 15 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 충청북도 |
---|---|
2nd row | 충청북도 청주시 상당구 |
3rd row | 충청북도 청주시 서원구 |
4th row | 충청북도 청주시 흥덕구 |
5th row | 충청북도 청주시 청원구 |
Value | Count | Frequency (%) |
충청북도 | 15 | |
청주시 | 4 | 12.1% |
상당구 | 1 | 3.0% |
서원구 | 1 | 3.0% |
흥덕구 | 1 | 3.0% |
청원구 | 1 | 3.0% |
충주시 | 1 | 3.0% |
제천시 | 1 | 3.0% |
보은군 | 1 | 3.0% |
옥천군 | 1 | 3.0% |
Other values (6) | 6 | 18.2% |
Most occurring characters
Value | Count | Frequency (%) |
청 | 20 | |
18 | ||
충 | 16 | |
북 | 15 | |
도 | 15 | |
군 | 8 | 6.1% |
시 | 6 | 4.5% |
주 | 5 | 3.8% |
구 | 4 | 3.0% |
천 | 3 | 2.3% |
Other values (21) | 22 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 114 | |
Space Separator | 18 | 13.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
청 | 20 | |
충 | 16 | |
북 | 15 | |
도 | 15 | |
군 | 8 | 7.0% |
시 | 6 | 5.3% |
주 | 5 | 4.4% |
구 | 4 | 3.5% |
천 | 3 | 2.6% |
원 | 2 | 1.8% |
Other values (20) | 20 |
Space Separator
Value | Count | Frequency (%) |
18 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 114 | |
Common | 18 | 13.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
청 | 20 | |
충 | 16 | |
북 | 15 | |
도 | 15 | |
군 | 8 | 7.0% |
시 | 6 | 5.3% |
주 | 5 | 4.4% |
구 | 4 | 3.5% |
천 | 3 | 2.6% |
원 | 2 | 1.8% |
Other values (20) | 20 |
Common
Value | Count | Frequency (%) |
18 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 114 | |
ASCII | 18 | 13.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
청 | 20 | |
충 | 16 | |
북 | 15 | |
도 | 15 | |
군 | 8 | 7.0% |
시 | 6 | 5.3% |
주 | 5 | 4.4% |
구 | 4 | 3.5% |
천 | 3 | 2.6% |
원 | 2 | 1.8% |
Other values (20) | 20 |
ASCII
Value | Count | Frequency (%) |
18 |
엑스좌표(X)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 66.7% |
Missing | 711 |
Missing (%) | 97.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 72.466667 |
Minimum | 67 |
---|---|
Maximum | 84 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 67 |
---|---|
5-th percentile | 67.7 |
Q1 | 69 |
median | 71 |
Q3 | 74 |
95-th percentile | 81.9 |
Maximum | 84 |
Range | 17 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 4.8235533 |
---|---|
Coefficient of variation (CV) | 0.066562373 |
Kurtosis | 1.3140841 |
Mean | 72.466667 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.3082537 |
Sum | 1087 |
Variance | 23.266667 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
69 | 4 | 0.6% |
71 | 2 | 0.3% |
74 | 2 | 0.3% |
67 | 1 | 0.1% |
76 | 1 | 0.1% |
81 | 1 | 0.1% |
73 | 1 | 0.1% |
68 | 1 | 0.1% |
72 | 1 | 0.1% |
84 | 1 | 0.1% |
(Missing) | 711 |
Value | Count | Frequency (%) |
67 | 1 | 0.1% |
68 | 1 | 0.1% |
69 | 4 | |
71 | 2 | |
72 | 1 | 0.1% |
73 | 1 | 0.1% |
74 | 2 | |
76 | 1 | 0.1% |
81 | 1 | 0.1% |
84 | 1 | 0.1% |
Value | Count | Frequency (%) |
84 | 1 | 0.1% |
81 | 1 | 0.1% |
76 | 1 | 0.1% |
74 | 2 | |
73 | 1 | 0.1% |
72 | 1 | 0.1% |
71 | 2 | |
69 | 4 | |
68 | 1 | 0.1% |
67 | 1 | 0.1% |
와이좌표(Y)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 73.3% |
Missing | 711 |
Missing (%) | 97.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 108.26667 |
Minimum | 97 |
---|---|
Maximum | 118 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 97 |
---|---|
5-th percentile | 98.4 |
Q1 | 106 |
median | 107 |
Q3 | 112 |
95-th percentile | 115.9 |
Maximum | 118 |
Range | 21 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 5.7875071 |
---|---|
Coefficient of variation (CV) | 0.053456038 |
Kurtosis | -0.14020111 |
Mean | 108.26667 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.35433575 |
Sum | 1624 |
Variance | 33.495238 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
107 | 3 | 0.4% |
106 | 2 | 0.3% |
111 | 2 | 0.3% |
114 | 1 | 0.1% |
118 | 1 | 0.1% |
103 | 1 | 0.1% |
99 | 1 | 0.1% |
97 | 1 | 0.1% |
110 | 1 | 0.1% |
113 | 1 | 0.1% |
(Missing) | 711 |
Value | Count | Frequency (%) |
97 | 1 | 0.1% |
99 | 1 | 0.1% |
103 | 1 | 0.1% |
106 | 2 | |
107 | 3 | |
110 | 1 | 0.1% |
111 | 2 | |
113 | 1 | 0.1% |
114 | 1 | 0.1% |
115 | 1 | 0.1% |
Value | Count | Frequency (%) |
118 | 1 | 0.1% |
115 | 1 | 0.1% |
114 | 1 | 0.1% |
113 | 1 | 0.1% |
111 | 2 | |
110 | 1 | 0.1% |
107 | 3 | |
106 | 2 | |
103 | 1 | 0.1% |
99 | 1 | 0.1% |
사용여부
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 6.7% |
Missing | 711 |
Missing (%) | 97.9% |
Memory size | 1.5 KiB |
True | 15 |
---|---|
(Missing) |
Value | Count | Frequency (%) |
True | 15 | 2.1% |
(Missing) | 711 |
우편번호_시작
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 73.3% |
Missing | 711 |
Missing (%) | 97.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 27940 |
Minimum | 27000 |
---|---|
Maximum | 29100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 27000 |
---|---|
5-th percentile | 27000 |
Q1 | 27450 |
median | 28000 |
Q3 | 28100 |
95-th percentile | 29030 |
Maximum | 29100 |
Range | 2100 |
Interquartile range (IQR) | 650 |
Descriptive statistics
Standard deviation | 682.22326 |
---|---|
Coefficient of variation (CV) | 0.024417439 |
Kurtosis | -0.64131349 |
Mean | 27940 |
Median Absolute Deviation (MAD) | 400 |
Skewness | 0.28425101 |
Sum | 419100 |
Variance | 465428.57 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
28100 | 4 | 0.6% |
27000 | 2 | 0.3% |
27300 | 1 | 0.1% |
27100 | 1 | 0.1% |
28900 | 1 | 0.1% |
29000 | 1 | 0.1% |
29100 | 1 | 0.1% |
27900 | 1 | 0.1% |
27800 | 1 | 0.1% |
28000 | 1 | 0.1% |
(Missing) | 711 |
Value | Count | Frequency (%) |
27000 | 2 | |
27100 | 1 | 0.1% |
27300 | 1 | 0.1% |
27600 | 1 | 0.1% |
27800 | 1 | 0.1% |
27900 | 1 | 0.1% |
28000 | 1 | 0.1% |
28100 | 4 | |
28900 | 1 | 0.1% |
29000 | 1 | 0.1% |
Value | Count | Frequency (%) |
29100 | 1 | 0.1% |
29000 | 1 | 0.1% |
28900 | 1 | 0.1% |
28100 | 4 | |
28000 | 1 | 0.1% |
27900 | 1 | 0.1% |
27800 | 1 | 0.1% |
27600 | 1 | 0.1% |
27300 | 1 | 0.1% |
27100 | 1 | 0.1% |
우편번호_끝
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 80.0% |
Missing | 711 |
Missing (%) | 97.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28445.667 |
Minimum | 27099 |
---|---|
Maximum | 29999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 27099 |
---|---|
5-th percentile | 27239 |
Q1 | 27849 |
median | 28899 |
Q3 | 28949 |
95-th percentile | 29439 |
Maximum | 29999 |
Range | 2900 |
Interquartile range (IQR) | 1100 |
Descriptive statistics
Standard deviation | 820.16259 |
---|---|
Coefficient of variation (CV) | 0.028832602 |
Kurtosis | -0.74834861 |
Mean | 28445.667 |
Median Absolute Deviation (MAD) | 800 |
Skewness | -0.018666381 |
Sum | 426685 |
Variance | 672666.67 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
28899 | 4 | 0.6% |
29999 | 1 | 0.1% |
27599 | 1 | 0.1% |
27299 | 1 | 0.1% |
28999 | 1 | 0.1% |
29099 | 1 | 0.1% |
29199 | 1 | 0.1% |
27999 | 1 | 0.1% |
27899 | 1 | 0.1% |
28099 | 1 | 0.1% |
Other values (2) | 2 | 0.3% |
(Missing) | 711 |
Value | Count | Frequency (%) |
27099 | 1 | 0.1% |
27299 | 1 | 0.1% |
27599 | 1 | 0.1% |
27799 | 1 | 0.1% |
27899 | 1 | 0.1% |
27999 | 1 | 0.1% |
28099 | 1 | 0.1% |
28899 | 4 | |
28999 | 1 | 0.1% |
29099 | 1 | 0.1% |
Value | Count | Frequency (%) |
29999 | 1 | 0.1% |
29199 | 1 | 0.1% |
29099 | 1 | 0.1% |
28999 | 1 | 0.1% |
28899 | 4 | |
28099 | 1 | 0.1% |
27999 | 1 | 0.1% |
27899 | 1 | 0.1% |
27799 | 1 | 0.1% |
27599 | 1 | 0.1% |
코드_구분
Categorical
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 5.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.8 KiB |
CROP_C_CD | |
---|---|
YEAR_T_CD | |
CROP_B_CD | |
BANK_T_CD | 26 |
CROP_W_CD | 19 |
Other values (37) |
Length
Max length | 13 |
---|---|
Median length | 9 |
Mean length | 8.9283747 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
CROP_C_CD | 404 | |
YEAR_T_CD | 52 | 7.2% |
CROP_B_CD | 49 | 6.7% |
BANK_T_CD | 26 | 3.6% |
CROP_W_CD | 19 | 2.6% |
PACK_T_CD | 17 | 2.3% |
<NA> | 15 | 2.1% |
MT_T_CD | 12 | 1.7% |
CROP_A_CD | 12 | 1.7% |
SKY_T_CD | 10 | 1.4% |
Other values (32) | 110 | 15.2% |
Length
Value | Count | Frequency (%) |
crop_c_cd | 404 | |
year_t_cd | 52 | 7.2% |
crop_b_cd | 49 | 6.7% |
bank_t_cd | 26 | 3.6% |
crop_w_cd | 19 | 2.6% |
pack_t_cd | 17 | 2.3% |
na | 15 | 2.1% |
mt_t_cd | 12 | 1.7% |
crop_a_cd | 12 | 1.7% |
sky_t_cd | 10 | 1.4% |
Other values (32) | 110 | 15.2% |
코드_아이디
Text
MISSING
 
Distinct | 545 |
---|---|
Distinct (%) | 76.7% |
Missing | 15 |
Missing (%) | 2.1% |
Memory size | 5.8 KiB |
Value | Count | Frequency (%) |
1 | 23 | 3.2% |
2 | 23 | 3.2% |
3 | 18 | 2.5% |
4 | 14 | 2.0% |
5 | 11 | 1.5% |
6 | 7 | 1.0% |
7 | 7 | 1.0% |
d | 7 | 1.0% |
8 | 6 | 0.8% |
9 | 6 | 0.8% |
Other values (535) | 589 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 327 | |
2 | 326 | |
0 | 313 | |
5 | 312 | |
3 | 224 | |
4 | 222 | |
6 | 110 | 5.1% |
7 | 110 | 5.1% |
8 | 80 | 3.7% |
9 | 74 | 3.4% |
Other values (21) | 64 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2098 | |
Uppercase Letter | 64 | 3.0% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
D | 7 | 10.9% |
F | 6 | 9.4% |
B | 6 | 9.4% |
C | 6 | 9.4% |
I | 5 | 7.8% |
A | 4 | 6.2% |
L | 3 | 4.7% |
S | 3 | 4.7% |
H | 3 | 4.7% |
P | 3 | 4.7% |
Other values (11) | 18 |
Decimal Number
Value | Count | Frequency (%) |
1 | 327 | |
2 | 326 | |
0 | 313 | |
5 | 312 | |
3 | 224 | |
4 | 222 | |
6 | 110 | 5.2% |
7 | 110 | 5.2% |
8 | 80 | 3.8% |
9 | 74 | 3.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2098 | |
Latin | 64 | 3.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
D | 7 | 10.9% |
F | 6 | 9.4% |
B | 6 | 9.4% |
C | 6 | 9.4% |
I | 5 | 7.8% |
A | 4 | 6.2% |
L | 3 | 4.7% |
S | 3 | 4.7% |
H | 3 | 4.7% |
P | 3 | 4.7% |
Other values (11) | 18 |
Common
Value | Count | Frequency (%) |
1 | 327 | |
2 | 326 | |
0 | 313 | |
5 | 312 | |
3 | 224 | |
4 | 222 | |
6 | 110 | 5.2% |
7 | 110 | 5.2% |
8 | 80 | 3.8% |
9 | 74 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2162 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 327 | |
2 | 326 | |
0 | 313 | |
5 | 312 | |
3 | 224 | |
4 | 222 | |
6 | 110 | 5.1% |
7 | 110 | 5.1% |
8 | 80 | 3.7% |
9 | 74 | 3.4% |
Other values (21) | 64 | 3.0% |
코드_이름
Text
MISSING
 
Distinct | 684 |
---|---|
Distinct (%) | 96.2% |
Missing | 15 |
Missing (%) | 2.1% |
Memory size | 5.8 KiB |
Value | Count | Frequency (%) |
기타 | 9 | 1.2% |
대변 | 3 | 0.4% |
추가 | 3 | 0.4% |
현금 | 3 | 0.4% |
차변 | 3 | 0.4% |
흐리고 | 3 | 0.4% |
가끔 | 3 | 0.4% |
과세 | 2 | 0.3% |
비 | 2 | 0.3% |
눈 | 2 | 0.3% |
Other values (675) | 687 |
Most occurring characters
Value | Count | Frequency (%) |
류 | 70 | 2.9% |
리 | 55 | 2.3% |
기 | 51 | 2.1% |
2 | 50 | 2.1% |
0 | 49 | 2.0% |
아 | 47 | 1.9% |
타 | 43 | 1.8% |
1 | 41 | 1.7% |
스 | 37 | 1.5% |
9 | 36 | 1.5% |
Other values (427) | 1953 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2113 | |
Decimal Number | 225 | 9.3% |
Open Punctuation | 28 | 1.2% |
Close Punctuation | 28 | 1.2% |
Other Punctuation | 10 | 0.4% |
Lowercase Letter | 10 | 0.4% |
Space Separator | 9 | 0.4% |
Uppercase Letter | 9 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
류 | 70 | 3.3% |
리 | 55 | 2.6% |
기 | 51 | 2.4% |
아 | 47 | 2.2% |
타 | 43 | 2.0% |
스 | 37 | 1.8% |
나 | 36 | 1.7% |
무 | 30 | 1.4% |
라 | 28 | 1.3% |
고 | 27 | 1.3% |
Other values (399) | 1689 |
Decimal Number
Value | Count | Frequency (%) |
2 | 50 | |
0 | 49 | |
1 | 41 | |
9 | 36 | |
8 | 16 | 7.1% |
3 | 8 | 3.6% |
4 | 7 | 3.1% |
5 | 6 | 2.7% |
7 | 6 | 2.7% |
6 | 6 | 2.7% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 2 | |
C | 2 | |
L | 1 | |
Y | 1 | |
N | 1 | |
F | 1 | |
B | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 3 | |
g | 3 | |
k | 2 | |
l | 1 | 10.0% |
w | 1 | 10.0% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 6 | |
. | 3 | |
, | 1 | 10.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 28 |
Close Punctuation
Value | Count | Frequency (%) |
) | 28 |
Space Separator
Value | Count | Frequency (%) |
9 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2111 | |
Common | 300 | 12.3% |
Latin | 19 | 0.8% |
Han | 2 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
류 | 70 | 3.3% |
리 | 55 | 2.6% |
기 | 51 | 2.4% |
아 | 47 | 2.2% |
타 | 43 | 2.0% |
스 | 37 | 1.8% |
나 | 36 | 1.7% |
무 | 30 | 1.4% |
라 | 28 | 1.3% |
고 | 27 | 1.3% |
Other values (397) | 1687 |
Common
Value | Count | Frequency (%) |
2 | 50 | |
0 | 49 | |
1 | 41 | |
9 | 36 | |
( | 28 | |
) | 28 | |
8 | 16 | 5.3% |
9 | 3.0% | |
3 | 8 | 2.7% |
4 | 7 | 2.3% |
Other values (6) | 28 |
Latin
Value | Count | Frequency (%) |
m | 3 | |
g | 3 | |
S | 2 | |
C | 2 | |
k | 2 | |
l | 1 | 5.3% |
w | 1 | 5.3% |
L | 1 | 5.3% |
Y | 1 | 5.3% |
N | 1 | 5.3% |
Other values (2) | 2 |
Han
Value | Count | Frequency (%) |
蜂 | 1 | |
羊 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2110 | |
ASCII | 319 | 13.1% |
CJK | 2 | 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
류 | 70 | 3.3% |
리 | 55 | 2.6% |
기 | 51 | 2.4% |
아 | 47 | 2.2% |
타 | 43 | 2.0% |
스 | 37 | 1.8% |
나 | 36 | 1.7% |
무 | 30 | 1.4% |
라 | 28 | 1.3% |
고 | 27 | 1.3% |
Other values (396) | 1686 |
ASCII
Value | Count | Frequency (%) |
2 | 50 | |
0 | 49 | |
1 | 41 | |
9 | 36 | |
( | 28 | |
) | 28 | |
8 | 16 | 5.0% |
9 | 2.8% | |
3 | 8 | 2.5% |
4 | 7 | 2.2% |
Other values (18) | 47 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
CJK
Value | Count | Frequency (%) |
蜂 | 1 | |
羊 | 1 |
순서번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 407 |
---|---|
Distinct (%) | 57.9% |
Missing | 23 |
Missing (%) | 3.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.1394 |
Minimum | 0 |
---|---|
Maximum | 416 |
Zeros | 1 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 9 |
median | 56 |
Q3 | 240.5 |
95-th percentile | 380.9 |
Maximum | 416 |
Range | 416 |
Interquartile range (IQR) | 231.5 |
Descriptive statistics
Standard deviation | 134.24 |
---|---|
Coefficient of variation (CV) | 1.0558489 |
Kurtosis | -0.9492711 |
Mean | 127.1394 |
Median Absolute Deviation (MAD) | 54 |
Skewness | 0.72682817 |
Sum | 89379 |
Variance | 18020.377 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 40 | 5.5% |
2 | 39 | 5.4% |
3 | 26 | 3.6% |
4 | 19 | 2.6% |
5 | 17 | 2.3% |
6 | 13 | 1.8% |
7 | 11 | 1.5% |
8 | 9 | 1.2% |
9 | 9 | 1.2% |
10 | 9 | 1.2% |
Other values (397) | 511 | |
(Missing) | 23 | 3.2% |
Value | Count | Frequency (%) |
0 | 1 | 0.1% |
1 | 40 | |
2 | 39 | |
3 | 26 | |
4 | 19 | |
5 | 17 | |
6 | 13 | 1.8% |
7 | 11 | 1.5% |
8 | 9 | 1.2% |
9 | 9 | 1.2% |
Value | Count | Frequency (%) |
416 | 1 | |
415 | 1 | |
414 | 1 | |
413 | 1 | |
412 | 1 | |
411 | 1 | |
410 | 1 | |
409 | 1 | |
408 | 1 | |
407 | 1 |
구분 | 기관코드 | 지역명 | 엑스좌표(X) | 와이좌표(Y) | 우편번호_시작 | 우편번호_끝 | 코드_구분 | 순서번호 | |
---|---|---|---|---|---|---|---|---|---|
구분 | 1.000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
기관코드 | NaN | 1.000 | 1.000 | 0.922 | 0.918 | 0.629 | 0.383 | NaN | NaN |
지역명 | NaN | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | NaN | NaN |
엑스좌표(X) | NaN | 0.922 | 1.000 | 1.000 | 0.829 | 0.904 | 0.844 | NaN | NaN |
와이좌표(Y) | NaN | 0.918 | 1.000 | 0.829 | 1.000 | 0.989 | 0.930 | NaN | NaN |
우편번호_시작 | NaN | 0.629 | 1.000 | 0.904 | 0.989 | 1.000 | 0.924 | NaN | NaN |
우편번호_끝 | NaN | 0.383 | 1.000 | 0.844 | 0.930 | 0.924 | 1.000 | NaN | NaN |
코드_구분 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | 0.510 |
순서번호 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.510 | 1.000 |
코드_구분 | 구분 | |
---|---|---|
코드_구분 | 1.000 | 1.000 |
구분 | 1.000 | 1.000 |
기관코드 | 엑스좌표(X) | 와이좌표(Y) | 우편번호_시작 | 우편번호_끝 | 순서번호 | 구분 | 코드_구분 | |
---|---|---|---|---|---|---|---|---|
기관코드 | 1.000 | 0.502 | 0.375 | -0.148 | -0.501 | NaN | 1.000 | 0.000 |
엑스좌표(X) | 0.502 | 1.000 | 0.443 | -0.275 | -0.447 | NaN | 1.000 | 0.000 |
와이좌표(Y) | 0.375 | 0.443 | 1.000 | -0.863 | -0.904 | NaN | 1.000 | 0.000 |
우편번호_시작 | -0.148 | -0.275 | -0.863 | 1.000 | 0.644 | NaN | 1.000 | 0.000 |
우편번호_끝 | -0.501 | -0.447 | -0.904 | 0.644 | 1.000 | NaN | 1.000 | 0.000 |
순서번호 | NaN | NaN | NaN | NaN | NaN | 1.000 | 1.000 | 0.183 |
구분 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
코드_구분 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.183 | 1.000 | 1.000 |
구분 | 기관코드 | 지역명 | 엑스좌표(X) | 와이좌표(Y) | 사용여부 | 우편번호_시작 | 우편번호_끝 | 코드_구분 | 코드_아이디 | 코드_이름 | 순서번호 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 날씨지역코드 | 4300000000 | 충청북도 | 69 | 107 | Y | 27000 | 29999 | <NA> | <NA> | <NA> | <NA> |
1 | 날씨지역코드 | 4311100000 | 충청북도 청주시 상당구 | 69 | 106 | Y | 28100 | 28899 | <NA> | <NA> | <NA> | <NA> |
2 | 날씨지역코드 | 4311200000 | 충청북도 청주시 서원구 | 69 | 107 | Y | 28100 | 28899 | <NA> | <NA> | <NA> | <NA> |
3 | 날씨지역코드 | 4311300000 | 충청북도 청주시 흥덕구 | 67 | 106 | Y | 28100 | 28899 | <NA> | <NA> | <NA> | <NA> |
4 | 날씨지역코드 | 4311400000 | 충청북도 청주시 청원구 | 69 | 107 | Y | 28100 | 28899 | <NA> | <NA> | <NA> | <NA> |
5 | 날씨지역코드 | 4313000000 | 충청북도 충주시 | 76 | 114 | Y | 27300 | 27599 | <NA> | <NA> | <NA> | <NA> |
6 | 날씨지역코드 | 4315000000 | 충청북도 제천시 | 81 | 118 | Y | 27100 | 27299 | <NA> | <NA> | <NA> | <NA> |
7 | 날씨지역코드 | 4372000000 | 충청북도 보은군 | 73 | 103 | Y | 28900 | 28999 | <NA> | <NA> | <NA> | <NA> |
8 | 날씨지역코드 | 4373000000 | 충청북도 옥천군 | 71 | 99 | Y | 29000 | 29099 | <NA> | <NA> | <NA> | <NA> |
9 | 날씨지역코드 | 4374000000 | 충청북도 영동군 | 74 | 97 | Y | 29100 | 29199 | <NA> | <NA> | <NA> | <NA> |
구분 | 기관코드 | 지역명 | 엑스좌표(X) | 와이좌표(Y) | 사용여부 | 우편번호_시작 | 우편번호_끝 | 코드_구분 | 코드_아이디 | 코드_이름 | 순서번호 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
716 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3306 | 연 | 133 |
717 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3307 | 토란 | 134 |
718 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3308 | 기타근채류 | 135 |
719 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3401 | 양파 | 136 |
720 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3402 | 파 | 137 |
721 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3403 | 마늘 | 138 |
722 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3404 | 생강 | 139 |
723 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3405 | 고추냉이 | 140 |
724 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3406 | 기타조미채소류 | 141 |
725 | 공통코드 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | CROP_C_CD | 3501 | 고들빼기 | 142 |