Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 424 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 17.5 KiB |
Average record size in memory | 42.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | KT |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=43 |
DO_NM(시도명) has constant value "" | Constant |
H_SDNG_CD(통계청행정동코드) is highly overall correlated with H_DNG_CD(행자부행정동코드) and 1 other fields | High correlation |
H_DNG_CD(행자부행정동코드) is highly overall correlated with H_SDNG_CD(통계청행정동코드) and 1 other fields | High correlation |
CT_NM(시군구명) is highly overall correlated with H_SDNG_CD(통계청행정동코드) and 1 other fields | High correlation |
H_SDNG_CD(통계청행정동코드) has unique values | Unique |
H_DNG_CD(행자부행정동코드) has unique values | Unique |
Reproduction
Analysis started | 2024-01-14 06:49:08.061699 |
---|---|
Analysis finished | 2024-01-14 06:49:08.821442 |
Duration | 0.76 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
H_SDNG_CD(통계청행정동코드)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 424 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1113663.9 |
Minimum | 1101053 |
---|---|
Maximum | 1125074 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.9 KiB |
Quantile statistics
Minimum | 1101053 |
---|---|
5-th percentile | 1102058.1 |
Q1 | 1107069.8 |
median | 1114068.5 |
Q3 | 1120317.8 |
95-th percentile | 1124078.9 |
Maximum | 1125074 |
Range | 24021 |
Interquartile range (IQR) | 13248 |
Descriptive statistics
Standard deviation | 7411.4876 |
---|---|
Coefficient of variation (CV) | 0.006655049 |
Kurtosis | -1.2527027 |
Mean | 1113663.9 |
Median Absolute Deviation (MAD) | 6991 |
Skewness | -0.084974899 |
Sum | 4.7219348 × 108 |
Variance | 54930149 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1101053 | 1 | 0.2% |
1119066 | 1 | 0.2% |
1119063 | 1 | 0.2% |
1119062 | 1 | 0.2% |
1119061 | 1 | 0.2% |
1119056 | 1 | 0.2% |
1119055 | 1 | 0.2% |
1119054 | 1 | 0.2% |
1118061 | 1 | 0.2% |
1118060 | 1 | 0.2% |
Other values (414) | 414 |
Value | Count | Frequency (%) |
1101053 | 1 | |
1101054 | 1 | |
1101055 | 1 | |
1101056 | 1 | |
1101057 | 1 | |
1101058 | 1 | |
1101060 | 1 | |
1101061 | 1 | |
1101063 | 1 | |
1101064 | 1 |
Value | Count | Frequency (%) |
1125074 | 1 | |
1125073 | 1 | |
1125072 | 1 | |
1125071 | 1 | |
1125070 | 1 | |
1125067 | 1 | |
1125066 | 1 | |
1125065 | 1 | |
1125063 | 1 | |
1125061 | 1 |
H_DNG_CD(행자부행정동코드)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 424 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11433195 |
Minimum | 11110515 |
---|---|
Maximum | 11740700 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.9 KiB |
Quantile statistics
Minimum | 11110515 |
---|---|
5-th percentile | 11140582 |
Q1 | 11260649 |
median | 11440620 |
Q3 | 11598141 |
95-th percentile | 11710678 |
Maximum | 11740700 |
Range | 630185 |
Interquartile range (IQR) | 337492.5 |
Descriptive statistics
Standard deviation | 191894.28 |
---|---|
Coefficient of variation (CV) | 0.016783959 |
Kurtosis | -1.2660373 |
Mean | 11433195 |
Median Absolute Deviation (MAD) | 179942.5 |
Skewness | -0.011417148 |
Sum | 4.8476747 × 109 |
Variance | 3.6823416 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11110530 | 1 | 0.2% |
11560660 | 1 | 0.2% |
11560630 | 1 | 0.2% |
11560620 | 1 | 0.2% |
11560610 | 1 | 0.2% |
11560560 | 1 | 0.2% |
11560550 | 1 | 0.2% |
11560540 | 1 | 0.2% |
11545710 | 1 | 0.2% |
11545700 | 1 | 0.2% |
Other values (414) | 414 |
Value | Count | Frequency (%) |
11110515 | 1 | |
11110530 | 1 | |
11110540 | 1 | |
11110550 | 1 | |
11110560 | 1 | |
11110570 | 1 | |
11110580 | 1 | |
11110600 | 1 | |
11110615 | 1 | |
11110630 | 1 |
Value | Count | Frequency (%) |
11740700 | 1 | |
11740690 | 1 | |
11740685 | 1 | |
11740660 | 1 | |
11740650 | 1 | |
11740640 | 1 | |
11740620 | 1 | |
11740610 | 1 | |
11740600 | 1 | |
11740590 | 1 |
DO_NM(시도명)
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.4 KiB |
서울 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
서울 | 424 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 424 |
CT_NM(시군구명)
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 5.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.4 KiB |
송파구 | 27 |
---|---|
강남구 | 22 |
관악구 | 21 |
성북구 | 20 |
강서구 | 20 |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0731132 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 종로구 |
---|---|
2nd row | 종로구 |
3rd row | 종로구 |
4th row | 종로구 |
5th row | 종로구 |
Common Values
Value | Count | Frequency (%) |
송파구 | 27 | 6.4% |
강남구 | 22 | 5.2% |
관악구 | 21 | 5.0% |
성북구 | 20 | 4.7% |
강서구 | 20 | 4.7% |
노원구 | 19 | 4.5% |
강동구 | 18 | 4.2% |
서초구 | 18 | 4.2% |
영등포구 | 18 | 4.2% |
양천구 | 18 | 4.2% |
Other values (15) | 223 |
Length
Value | Count | Frequency (%) |
송파구 | 27 | 6.4% |
강남구 | 22 | 5.2% |
관악구 | 21 | 5.0% |
성북구 | 20 | 4.7% |
강서구 | 20 | 4.7% |
노원구 | 19 | 4.5% |
강동구 | 18 | 4.2% |
서초구 | 18 | 4.2% |
영등포구 | 18 | 4.2% |
양천구 | 18 | 4.2% |
Other values (15) | 223 |
H_DNG_NM(행정동명)
Text
Distinct | 423 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.4 KiB |
Value | Count | Frequency (%) |
신사동 | 2 | 0.5% |
노량진1동 | 1 | 0.2% |
독산3동 | 1 | 0.2% |
양평2동 | 1 | 0.2% |
양평1동 | 1 | 0.2% |
당산2동 | 1 | 0.2% |
당산1동 | 1 | 0.2% |
여의동 | 1 | 0.2% |
시흥5동 | 1 | 0.2% |
시흥4동 | 1 | 0.2% |
Other values (413) | 413 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 426 | |
2 | 97 | 6.0% |
1 | 97 | 6.0% |
3 | 43 | 2.7% |
신 | 38 | 2.4% |
4 | 26 | 1.6% |
가 | 23 | 1.4% |
곡 | 18 | 1.1% |
계 | 17 | 1.1% |
화 | 17 | 1.1% |
Other values (178) | 806 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1307 | |
Decimal Number | 292 | 18.2% |
Other Punctuation | 9 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 426 | |
신 | 38 | 2.9% |
가 | 23 | 1.8% |
곡 | 18 | 1.4% |
계 | 17 | 1.3% |
화 | 17 | 1.3% |
산 | 16 | 1.2% |
방 | 16 | 1.2% |
성 | 16 | 1.2% |
상 | 16 | 1.2% |
Other values (167) | 704 |
Decimal Number
Value | Count | Frequency (%) |
2 | 97 | |
1 | 97 | |
3 | 43 | |
4 | 26 | 8.9% |
5 | 11 | 3.8% |
6 | 7 | 2.4% |
7 | 6 | 2.1% |
8 | 3 | 1.0% |
0 | 1 | 0.3% |
9 | 1 | 0.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1307 | |
Common | 301 | 18.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 426 | |
신 | 38 | 2.9% |
가 | 23 | 1.8% |
곡 | 18 | 1.4% |
계 | 17 | 1.3% |
화 | 17 | 1.3% |
산 | 16 | 1.2% |
방 | 16 | 1.2% |
성 | 16 | 1.2% |
상 | 16 | 1.2% |
Other values (167) | 704 |
Common
Value | Count | Frequency (%) |
2 | 97 | |
1 | 97 | |
3 | 43 | |
4 | 26 | 8.6% |
5 | 11 | 3.7% |
. | 9 | 3.0% |
6 | 7 | 2.3% |
7 | 6 | 2.0% |
8 | 3 | 1.0% |
0 | 1 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1307 | |
ASCII | 301 | 18.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 426 | |
신 | 38 | 2.9% |
가 | 23 | 1.8% |
곡 | 18 | 1.4% |
계 | 17 | 1.3% |
화 | 17 | 1.3% |
산 | 16 | 1.2% |
방 | 16 | 1.2% |
성 | 16 | 1.2% |
상 | 16 | 1.2% |
Other values (167) | 704 |
ASCII
Value | Count | Frequency (%) |
2 | 97 | |
1 | 97 | |
3 | 43 | |
4 | 26 | 8.6% |
5 | 11 | 3.7% |
. | 9 | 3.0% |
6 | 7 | 2.3% |
7 | 6 | 2.0% |
8 | 3 | 1.0% |
0 | 1 | 0.3% |
H_SDNG_CD(통계청행정동코드) | H_DNG_CD(행자부행정동코드) | CT_NM(시군구명) | |
---|---|---|---|
H_SDNG_CD(통계청행정동코드) | 1.000 | 0.996 | 1.000 |
H_DNG_CD(행자부행정동코드) | 0.996 | 1.000 | 1.000 |
CT_NM(시군구명) | 1.000 | 1.000 | 1.000 |
H_SDNG_CD(통계청행정동코드) | H_DNG_CD(행자부행정동코드) | CT_NM(시군구명) | |
---|---|---|---|
H_SDNG_CD(통계청행정동코드) | 1.000 | 0.999 | 0.977 |
H_DNG_CD(행자부행정동코드) | 0.999 | 1.000 | 0.982 |
CT_NM(시군구명) | 0.977 | 0.982 | 1.000 |
H_SDNG_CD(통계청행정동코드) | H_DNG_CD(행자부행정동코드) | DO_NM(시도명) | CT_NM(시군구명) | H_DNG_NM(행정동명) | |
---|---|---|---|---|---|
0 | 1101053 | 11110530 | 서울 | 종로구 | 사직동 |
1 | 1101054 | 11110540 | 서울 | 종로구 | 삼청동 |
2 | 1101055 | 11110550 | 서울 | 종로구 | 부암동 |
3 | 1101056 | 11110560 | 서울 | 종로구 | 평창동 |
4 | 1101057 | 11110570 | 서울 | 종로구 | 무악동 |
5 | 1101058 | 11110580 | 서울 | 종로구 | 교남동 |
6 | 1101060 | 11110600 | 서울 | 종로구 | 가회동 |
7 | 1101061 | 11110615 | 서울 | 종로구 | 종로1.2.3.4가동 |
8 | 1101063 | 11110630 | 서울 | 종로구 | 종로5.6가동 |
9 | 1101064 | 11110640 | 서울 | 종로구 | 이화동 |
H_SDNG_CD(통계청행정동코드) | H_DNG_CD(행자부행정동코드) | DO_NM(시도명) | CT_NM(시군구명) | H_DNG_NM(행정동명) | |
---|---|---|---|---|---|
414 | 1125061 | 11740600 | 서울 | 강동구 | 천호1동 |
415 | 1125063 | 11740620 | 서울 | 강동구 | 천호3동 |
416 | 1125065 | 11740640 | 서울 | 강동구 | 성내1동 |
417 | 1125066 | 11740650 | 서울 | 강동구 | 성내2동 |
418 | 1125067 | 11740660 | 서울 | 강동구 | 성내3동 |
419 | 1125070 | 11740690 | 서울 | 강동구 | 둔촌1동 |
420 | 1125071 | 11740700 | 서울 | 강동구 | 둔촌2동 |
421 | 1125072 | 11740570 | 서울 | 강동구 | 암사1동 |
422 | 1125073 | 11740610 | 서울 | 강동구 | 천호2동 |
423 | 1125074 | 11740685 | 서울 | 강동구 | 길동 |