Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 894 |
Missing cells | 184 |
Missing cells (%) | 3.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 43.8 KiB |
Average record size in memory | 50.1 B |
Variable types
Numeric | 2 |
---|---|
Boolean | 3 |
Text | 1 |
Dataset
Description | 경기도 경기통계시스템 기관정보 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=E000SKXSHDMI8GTE9TRG33424815&infSeq=1 |
통계작성여부 has constant value "" | Constant |
표준통계작성기관여부 has constant value "" | Constant |
조직번호 is highly overall correlated with 상위조직번호 and 1 other fields | High correlation |
상위조직번호 is highly overall correlated with 조직번호 and 1 other fields | High correlation |
부서여부 is highly overall correlated with 조직번호 and 1 other fields | High correlation |
상위조직번호 has 184 (20.6%) missing values | Missing |
조직번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 22:14:09.077247 |
---|---|
Analysis finished | 2023-12-10 22:14:09.746964 |
Duration | 0.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
조직번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 894 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 212491.27 |
Minimum | 0 |
---|---|
Maximum | 994000 |
Zeros | 1 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 190.15 |
Q1 | 107010.5 |
median | 154002.5 |
Q3 | 331012.75 |
95-th percentile | 395000.35 |
Maximum | 994000 |
Range | 994000 |
Interquartile range (IQR) | 224002.25 |
Descriptive statistics
Standard deviation | 195840.25 |
---|---|
Coefficient of variation (CV) | 0.92163901 |
Kurtosis | 4.8638492 |
Mean | 212491.27 |
Median Absolute Deviation (MAD) | 153061 |
Skewness | 1.8066324 |
Sum | 1.899672 × 108 |
Variance | 3.8353403 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201002 | 1 | 0.1% |
390000 | 1 | 0.1% |
385001 | 1 | 0.1% |
386000 | 1 | 0.1% |
386001 | 1 | 0.1% |
387000 | 1 | 0.1% |
387001 | 1 | 0.1% |
387002 | 1 | 0.1% |
388000 | 1 | 0.1% |
388001 | 1 | 0.1% |
Other values (884) | 884 |
Value | Count | Frequency (%) |
0 | 1 | |
101 | 1 | |
102 | 1 | |
103 | 1 | |
105 | 1 | |
106 | 1 | |
109 | 1 | |
110 | 1 | |
111 | 1 | |
112 | 1 |
Value | Count | Frequency (%) |
994000 | 1 | |
993000 | 1 | |
989000 | 1 | |
987000 | 1 | |
986000 | 1 | |
985000 | 1 | |
979000 | 1 | |
971001 | 1 | |
971000 | 1 | |
969001 | 1 |
상위조직번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 169 |
---|---|
Distinct (%) | 23.8% |
Missing | 184 |
Missing (%) | 20.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 267.46197 |
Minimum | 101 |
---|---|
Maximum | 994 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 106 |
Q1 | 118.5 |
median | 231 |
Q3 | 345 |
95-th percentile | 621 |
Maximum | 994 |
Range | 893 |
Interquartile range (IQR) | 226.5 |
Descriptive statistics
Standard deviation | 183.30984 |
---|---|
Coefficient of variation (CV) | 0.68536786 |
Kurtosis | 6.1934205 |
Mean | 267.46197 |
Median Absolute Deviation (MAD) | 113 |
Skewness | 2.2504088 |
Sum | 189898 |
Variance | 33602.497 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
117 | 32 | 3.6% |
116 | 24 | 2.7% |
361 | 22 | 2.5% |
115 | 19 | 2.1% |
118 | 19 | 2.1% |
301 | 18 | 2.0% |
101 | 18 | 2.0% |
123 | 18 | 2.0% |
114 | 17 | 1.9% |
345 | 17 | 1.9% |
Other values (159) | 506 | |
(Missing) | 184 | 20.6% |
Value | Count | Frequency (%) |
101 | 18 | |
102 | 4 | 0.4% |
105 | 3 | 0.3% |
106 | 15 | |
110 | 11 | |
111 | 5 | 0.6% |
112 | 3 | 0.3% |
113 | 8 | |
114 | 17 | |
115 | 19 |
Value | Count | Frequency (%) |
994 | 1 | |
993 | 1 | |
989 | 1 | |
987 | 1 | |
986 | 1 | |
985 | 1 | |
979 | 1 | |
971 | 2 | |
969 | 2 | |
967 | 1 |
부서여부
Boolean
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 710 | |
False | 184 | 20.6% |
조직명
Text
Distinct | 704 |
---|---|
Distinct (%) | 78.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.1 KiB |
Length
Max length | 25 |
---|---|
Median length | 22 |
Mean length | 8.5693512 |
Min length | 2 |
Characters and Unicode
Total characters | 7661 |
---|---|
Distinct characters | 291 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 694 ? |
---|---|
Unique (%) | 77.6% |
Sample
1st row | 교통국 교통계획과 |
---|---|
2nd row | 교통국 전산정보센터 |
3rd row | 보건복지국 사회복지과 |
4th row | 보건사회국 사회과 |
5th row | 복지건강국 보건정책과 |
Value | Count | Frequency (%) |
기타 | 169 | 12.0% |
기획관리실 | 19 | 1.3% |
경제통계국 | 16 | 1.1% |
조사부 | 16 | 1.1% |
기획관실 | 10 | 0.7% |
교육정보화과 | 8 | 0.6% |
통계팀 | 7 | 0.5% |
경영조사팀 | 7 | 0.5% |
조사통계팀 | 7 | 0.5% |
정보화담당관실 | 7 | 0.5% |
Other values (918) | 1147 |
Most occurring characters
Value | Count | Frequency (%) |
524 | 6.8% | |
기 | 336 | 4.4% |
국 | 284 | 3.7% |
정 | 267 | 3.5% |
과 | 241 | 3.1% |
사 | 228 | 3.0% |
팀 | 190 | 2.5% |
부 | 182 | 2.4% |
타 | 171 | 2.2% |
관 | 170 | 2.2% |
Other values (281) | 5068 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7088 | |
Space Separator | 528 | 6.9% |
Decimal Number | 19 | 0.2% |
Uppercase Letter | 14 | 0.2% |
Other Punctuation | 4 | 0.1% |
Close Punctuation | 3 | < 0.1% |
Open Punctuation | 3 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 336 | 4.7% |
국 | 284 | 4.0% |
정 | 267 | 3.8% |
과 | 241 | 3.4% |
사 | 228 | 3.2% |
팀 | 190 | 2.7% |
부 | 182 | 2.6% |
타 | 171 | 2.4% |
관 | 170 | 2.4% |
조 | 161 | 2.3% |
Other values (260) | 4858 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 3 | |
D | 2 | |
R | 2 | |
H | 2 | |
K | 2 | |
G | 1 | 7.1% |
B | 1 | 7.1% |
I | 1 | 7.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 7 | |
2 | 6 | |
0 | 5 | |
1 | 1 | 5.3% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2 | |
& | 1 | |
· | 1 |
Space Separator
Value | Count | Frequency (%) |
524 | ||
4 | 0.8% |
Close Punctuation
Value | Count | Frequency (%) |
) | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7088 | |
Common | 558 | 7.3% |
Latin | 15 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 336 | 4.7% |
국 | 284 | 4.0% |
정 | 267 | 3.8% |
과 | 241 | 3.4% |
사 | 228 | 3.2% |
팀 | 190 | 2.7% |
부 | 182 | 2.6% |
타 | 171 | 2.4% |
관 | 170 | 2.4% |
조 | 161 | 2.3% |
Other values (260) | 4858 |
Common
Value | Count | Frequency (%) |
524 | ||
1 | 7 | 1.3% |
2 | 6 | 1.1% |
0 | 5 | 0.9% |
4 | 0.7% | |
) | 3 | 0.5% |
( | 3 | 0.5% |
/ | 2 | 0.4% |
1 | 1 | 0.2% |
& | 1 | 0.2% |
Other values (2) | 2 | 0.4% |
Latin
Value | Count | Frequency (%) |
T | 3 | |
D | 2 | |
R | 2 | |
H | 2 | |
K | 2 | |
G | 1 | 6.7% |
B | 1 | 6.7% |
I | 1 | 6.7% |
e | 1 | 6.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7088 | |
ASCII | 567 | 7.4% |
None | 6 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
524 | ||
1 | 7 | 1.2% |
2 | 6 | 1.1% |
0 | 5 | 0.9% |
T | 3 | 0.5% |
) | 3 | 0.5% |
( | 3 | 0.5% |
/ | 2 | 0.4% |
D | 2 | 0.4% |
R | 2 | 0.4% |
Other values (8) | 10 | 1.8% |
Hangul
Value | Count | Frequency (%) |
기 | 336 | 4.7% |
국 | 284 | 4.0% |
정 | 267 | 3.8% |
과 | 241 | 3.4% |
사 | 228 | 3.2% |
팀 | 190 | 2.7% |
부 | 182 | 2.6% |
타 | 171 | 2.4% |
관 | 170 | 2.4% |
조 | 161 | 2.3% |
Other values (260) | 4858 |
None
Value | Count | Frequency (%) |
4 | ||
1 | 1 | 16.7% |
· | 1 | 16.7% |
통계작성여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 894 |
표준통계작성기관여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 894 |
조직번호 | 상위조직번호 | 부서여부 | |
---|---|---|---|
조직번호 | 1.000 | 0.997 | 1.000 |
상위조직번호 | 0.997 | 1.000 | NaN |
부서여부 | 1.000 | NaN | 1.000 |
조직번호 | 상위조직번호 | 부서여부 | |
---|---|---|---|
조직번호 | 1.000 | 1.000 | 0.997 |
상위조직번호 | 1.000 | 1.000 | 1.000 |
부서여부 | 0.997 | 1.000 | 1.000 |
조직번호 | 상위조직번호 | 부서여부 | 조직명 | 통계작성여부 | 표준통계작성기관여부 | |
---|---|---|---|---|---|---|
0 | 201002 | 201 | Y | 교통국 교통계획과 | N | N |
1 | 201003 | 201 | Y | 교통국 전산정보센터 | N | N |
2 | 201004 | 201 | Y | 보건복지국 사회복지과 | N | N |
3 | 201005 | 201 | Y | 보건사회국 사회과 | N | N |
4 | 201006 | 201 | Y | 복지건강국 보건정책과 | N | N |
5 | 201007 | 201 | Y | 복지건강국 장애인복지과 | N | N |
6 | 201008 | 201 | Y | 복지여성국 보건과 | N | N |
7 | 201009 | 201 | Y | 송파구 기획예산과 | N | N |
8 | 201010 | 201 | Y | 전산정보담당관실 | N | N |
9 | 201011 | 201 | Y | 정보화기획단장 정보화기획담당관실 | N | N |
조직번호 | 상위조직번호 | 부서여부 | 조직명 | 통계작성여부 | 표준통계작성기관여부 | |
---|---|---|---|---|---|---|
884 | 132002 | 132 | Y | 교통관리관실 교통안전과 | N | N |
885 | 133000 | 133 | Y | 기타 | N | N |
886 | 133001 | 133 | Y | 부동산납세관리국 종합부동산세과 | N | N |
887 | 134000 | 134 | Y | 기타 | N | N |
888 | 134001 | 134 | Y | 통관지원국 통관기획과 | N | N |
889 | 135000 | 135 | Y | 기타 | N | N |
890 | 135001 | 135 | Y | 총무부 기획과 | N | N |
891 | 136000 | 136 | Y | 기타 | N | N |
892 | 136001 | 136 | Y | 사유림자원국 산림소득과 | N | N |
893 | 136002 | 136 | Y | 사유림지원국 산림소득과 | N | N |