Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 25.0 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Text | 1 |
Categorical | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | KT |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=73 |
Reproduction
Analysis started | 2023-12-10 15:02:25.897263 |
---|---|
Analysis finished | 2023-12-10 15:02:28.614886 |
Duration | 2.72 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준년월(STD_YM)
Real number (ℝ)
Distinct | 34 |
---|---|
Distinct (%) | 6.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 201705.75 |
Minimum | 201601 |
---|---|
Maximum | 201810 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 201601 |
---|---|
5-th percentile | 201602 |
Q1 | 201610 |
median | 201707 |
Q3 | 201802 |
95-th percentile | 201808.05 |
Maximum | 201810 |
Range | 209 |
Interquartile range (IQR) | 192 |
Descriptive statistics
Standard deviation | 78.249309 |
---|---|
Coefficient of variation (CV) | 0.00038793792 |
Kurtosis | -1.3798766 |
Mean | 201705.75 |
Median Absolute Deviation (MAD) | 96 |
Skewness | -0.0011407472 |
Sum | 1.0085287 × 108 |
Variance | 6122.9544 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201807 | 23 | 4.6% |
201805 | 19 | 3.8% |
201706 | 19 | 3.8% |
201712 | 18 | 3.6% |
201710 | 18 | 3.6% |
201711 | 18 | 3.6% |
201803 | 18 | 3.6% |
201708 | 18 | 3.6% |
201602 | 17 | 3.4% |
201709 | 17 | 3.4% |
Other values (24) | 315 |
Value | Count | Frequency (%) |
201601 | 11 | |
201602 | 17 | |
201603 | 10 | |
201604 | 11 | |
201605 | 8 | |
201606 | 15 | |
201607 | 11 | |
201608 | 17 | |
201609 | 13 | |
201610 | 16 |
Value | Count | Frequency (%) |
201810 | 11 | |
201809 | 14 | |
201808 | 14 | |
201807 | 23 | |
201806 | 14 | |
201805 | 19 | |
201804 | 11 | |
201803 | 18 | |
201802 | 13 | |
201801 | 16 |
행정동_코드(ADMI_CD)
Real number (ℝ)
Distinct | 295 |
---|---|
Distinct (%) | 59.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11440928 |
Minimum | 11110515 |
---|---|
Maximum | 11740700 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 11110515 |
---|---|
5-th percentile | 11140578 |
Q1 | 11290590 |
median | 11440700 |
Q3 | 11620575 |
95-th percentile | 11740520 |
Maximum | 11740700 |
Range | 630185 |
Interquartile range (IQR) | 329985 |
Descriptive statistics
Standard deviation | 192411.45 |
---|---|
Coefficient of variation (CV) | 0.016817819 |
Kurtosis | -1.2544035 |
Mean | 11440928 |
Median Absolute Deviation (MAD) | 150125 |
Skewness | -0.036346148 |
Sum | 5.7204638 × 109 |
Variance | 3.7022167 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11320522 | 5 | 1.0% |
11620715 | 5 | 1.0% |
11620735 | 5 | 1.0% |
11500603 | 4 | 0.8% |
11290575 | 4 | 0.8% |
11305535 | 4 | 0.8% |
11290630 | 4 | 0.8% |
11530510 | 4 | 0.8% |
11590530 | 4 | 0.8% |
11710720 | 4 | 0.8% |
Other values (285) | 457 |
Value | Count | Frequency (%) |
11110515 | 2 | |
11110530 | 1 | 0.2% |
11110550 | 2 | |
11110570 | 1 | 0.2% |
11110580 | 1 | 0.2% |
11110600 | 3 | |
11110615 | 1 | 0.2% |
11110650 | 1 | 0.2% |
11110680 | 3 | |
11110690 | 1 | 0.2% |
Value | Count | Frequency (%) |
11740700 | 1 | 0.2% |
11740690 | 3 | |
11740685 | 1 | 0.2% |
11740660 | 1 | 0.2% |
11740650 | 1 | 0.2% |
11740640 | 2 | |
11740620 | 2 | |
11740610 | 2 | |
11740600 | 2 | |
11740590 | 1 | 0.2% |
행정동_이름(ADMI_NM)
Text
Distinct | 298 |
---|---|
Distinct (%) | 59.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
방배4동 | 6 | 1.2% |
신사동 | 5 | 1.0% |
창신1동 | 4 | 0.8% |
삼각산동 | 4 | 0.8% |
용산2가동 | 4 | 0.8% |
방학1동 | 4 | 0.8% |
방이2동 | 4 | 0.8% |
목3동 | 4 | 0.8% |
마장동 | 4 | 0.8% |
창신3동 | 4 | 0.8% |
Other values (288) | 457 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 501 | |
1 | 121 | 6.3% |
2 | 107 | 5.5% |
3 | 52 | 2.7% |
신 | 48 | 2.5% |
4 | 41 | 2.1% |
가 | 33 | 1.7% |
방 | 26 | 1.3% |
계 | 25 | 1.3% |
창 | 22 | 1.1% |
Other values (160) | 957 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1553 | |
Decimal Number | 359 | 18.6% |
Other Punctuation | 21 | 1.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 501 | |
신 | 48 | 3.1% |
가 | 33 | 2.1% |
방 | 26 | 1.7% |
계 | 25 | 1.6% |
창 | 22 | 1.4% |
곡 | 20 | 1.3% |
산 | 19 | 1.2% |
원 | 19 | 1.2% |
정 | 18 | 1.2% |
Other values (149) | 822 |
Decimal Number
Value | Count | Frequency (%) |
1 | 121 | |
2 | 107 | |
3 | 52 | |
4 | 41 | 11.4% |
6 | 11 | 3.1% |
5 | 10 | 2.8% |
8 | 7 | 1.9% |
7 | 7 | 1.9% |
0 | 2 | 0.6% |
9 | 1 | 0.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 21 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1553 | |
Common | 380 | 19.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 501 | |
신 | 48 | 3.1% |
가 | 33 | 2.1% |
방 | 26 | 1.7% |
계 | 25 | 1.6% |
창 | 22 | 1.4% |
곡 | 20 | 1.3% |
산 | 19 | 1.2% |
원 | 19 | 1.2% |
정 | 18 | 1.2% |
Other values (149) | 822 |
Common
Value | Count | Frequency (%) |
1 | 121 | |
2 | 107 | |
3 | 52 | |
4 | 41 | 10.8% |
. | 21 | 5.5% |
6 | 11 | 2.9% |
5 | 10 | 2.6% |
8 | 7 | 1.8% |
7 | 7 | 1.8% |
0 | 2 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1553 | |
ASCII | 380 | 19.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 501 | |
신 | 48 | 3.1% |
가 | 33 | 2.1% |
방 | 26 | 1.7% |
계 | 25 | 1.6% |
창 | 22 | 1.4% |
곡 | 20 | 1.3% |
산 | 19 | 1.2% |
원 | 19 | 1.2% |
정 | 18 | 1.2% |
Other values (149) | 822 |
ASCII
Value | Count | Frequency (%) |
1 | 121 | |
2 | 107 | |
3 | 52 | |
4 | 41 | 10.8% |
. | 21 | 5.5% |
6 | 11 | 2.9% |
5 | 10 | 2.6% |
8 | 7 | 1.8% |
7 | 7 | 1.8% |
0 | 2 | 0.5% |
성별(GENDER)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
F | |
---|---|
M |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | F |
---|---|
2nd row | M |
3rd row | M |
4th row | F |
5th row | F |
Common Values
Value | Count | Frequency (%) |
F | 254 | |
M | 246 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 254 | |
m | 246 |
연령대(AGE)
Categorical
Distinct | 15 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
70_ABOVE | |
---|---|
3034 | |
6569 | |
5054 | |
4549 | |
Other values (10) |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.344 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 6569 |
---|---|
2nd row | 4549 |
3rd row | 70_ABOVE |
4th row | 1519 |
5th row | 3539 |
Common Values
Value | Count | Frequency (%) |
70_ABOVE | 43 | 8.6% |
3034 | 39 | 7.8% |
6569 | 37 | 7.4% |
5054 | 36 | 7.2% |
4549 | 35 | 7.0% |
3539 | 34 | 6.8% |
5559 | 34 | 6.8% |
1519 | 33 | 6.6% |
2024 | 33 | 6.6% |
4044 | 32 | 6.4% |
Other values (5) | 144 |
Length
Value | Count | Frequency (%) |
70_above | 43 | 8.6% |
3034 | 39 | 7.8% |
6569 | 37 | 7.4% |
5054 | 36 | 7.2% |
4549 | 35 | 7.0% |
3539 | 34 | 6.8% |
5559 | 34 | 6.8% |
1519 | 33 | 6.6% |
2024 | 33 | 6.6% |
4044 | 32 | 6.4% |
Other values (5) | 144 |
유동인구_합계(POP_CNT)
Real number (ℝ)
Distinct | 498 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 101146.48 |
Minimum | 152 |
---|---|
Maximum | 1567717 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 152 |
---|---|
5-th percentile | 652.5 |
Q1 | 30647 |
median | 67736.5 |
Q3 | 123126 |
95-th percentile | 348897.35 |
Maximum | 1567717 |
Range | 1567565 |
Interquartile range (IQR) | 92479 |
Descriptive statistics
Standard deviation | 133721.25 |
---|---|
Coefficient of variation (CV) | 1.3220554 |
Kurtosis | 36.551542 |
Mean | 101146.48 |
Median Absolute Deviation (MAD) | 44308.5 |
Skewness | 4.6911567 |
Sum | 50573242 |
Variance | 1.7881374 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
358 | 2 | 0.4% |
419 | 2 | 0.4% |
132396 | 1 | 0.2% |
24772 | 1 | 0.2% |
4145 | 1 | 0.2% |
8256 | 1 | 0.2% |
114887 | 1 | 0.2% |
155233 | 1 | 0.2% |
13858 | 1 | 0.2% |
40993 | 1 | 0.2% |
Other values (488) | 488 |
Value | Count | Frequency (%) |
152 | 1 | |
168 | 1 | |
262 | 1 | |
267 | 1 | |
274 | 1 | |
311 | 1 | |
336 | 1 | |
341 | 1 | |
348 | 1 | |
357 | 1 |
Value | Count | Frequency (%) |
1567717 | 1 | |
1078273 | 1 | |
697996 | 1 | |
693524 | 1 | |
577623 | 1 | |
551626 | 1 | |
549657 | 1 | |
539870 | 1 | |
538774 | 1 | |
513366 | 1 |
기준년월(STD_YM) | 행정동_코드(ADMI_CD) | 성별(GENDER) | 연령대(AGE) | 유동인구_합계(POP_CNT) | |
---|---|---|---|---|---|
기준년월(STD_YM) | 1.000 | 0.000 | 0.049 | 0.000 | 0.084 |
행정동_코드(ADMI_CD) | 0.000 | 1.000 | 0.045 | 0.208 | 0.000 |
성별(GENDER) | 0.049 | 0.045 | 1.000 | 0.000 | 0.000 |
연령대(AGE) | 0.000 | 0.208 | 0.000 | 1.000 | 0.080 |
유동인구_합계(POP_CNT) | 0.084 | 0.000 | 0.000 | 0.080 | 1.000 |
연령대(AGE) | 성별(GENDER) | |
---|---|---|
연령대(AGE) | 1.000 | 0.000 |
성별(GENDER) | 0.000 | 1.000 |
기준년월(STD_YM) | 행정동_코드(ADMI_CD) | 유동인구_합계(POP_CNT) | 성별(GENDER) | 연령대(AGE) | |
---|---|---|---|---|---|
기준년월(STD_YM) | 1.000 | -0.008 | 0.049 | 0.047 | 0.000 |
행정동_코드(ADMI_CD) | -0.008 | 1.000 | -0.030 | 0.034 | 0.075 |
유동인구_합계(POP_CNT) | 0.049 | -0.030 | 1.000 | 0.000 | 0.031 |
성별(GENDER) | 0.047 | 0.034 | 0.000 | 1.000 | 0.000 |
연령대(AGE) | 0.000 | 0.075 | 0.031 | 0.000 | 1.000 |
기준년월(STD_YM) | 행정동_코드(ADMI_CD) | 행정동_이름(ADMI_NM) | 성별(GENDER) | 연령대(AGE) | 유동인구_합계(POP_CNT) | |
---|---|---|---|---|---|---|
0 | 201704 | 11500535 | 대림2동 | F | 6569 | 132396 |
1 | 201803 | 11590660 | 서초4동 | M | 4549 | 333412 |
2 | 201803 | 11410710 | 쌍문4동 | M | 70_ABOVE | 84393 |
3 | 201603 | 11410520 | 논현1동 | F | 1519 | 674 |
4 | 201611 | 11740600 | 행당2동 | F | 3539 | 44638 |
5 | 201702 | 11620765 | 구로3동 | F | 3034 | 3711 |
6 | 201611 | 11200535 | 목3동 | M | 70_ABOVE | 94212 |
7 | 201608 | 11590670 | 양재2동 | M | 6064 | 17771 |
8 | 201709 | 11410640 | 홍제2동 | M | 2529 | 107057 |
9 | 201806 | 11740660 | 논현1동 | F | 1014 | 116294 |
기준년월(STD_YM) | 행정동_코드(ADMI_CD) | 행정동_이름(ADMI_NM) | 성별(GENDER) | 연령대(AGE) | 유동인구_합계(POP_CNT) | |
---|---|---|---|---|---|---|
490 | 201611 | 11230730 | 약수동 | M | 2024 | 182182 |
491 | 201802 | 11320660 | 신대방2동 | M | 6064 | 115907 |
492 | 201709 | 11620685 | 망원1동 | F | 1014 | 129568 |
493 | 201711 | 11620735 | 상계9동 | M | 0509 | 262 |
494 | 201612 | 11260610 | 일원2동 | M | 3034 | 233274 |
495 | 201706 | 11215760 | 일원1동 | F | 3034 | 182240 |
496 | 201611 | 11110650 | 반포3동 | M | 1014 | 76940 |
497 | 201706 | 11470590 | 삼각산동 | M | 2024 | 121412 |
498 | 201706 | 11410700 | 방배4동 | F | 4549 | 89843 |
499 | 201608 | 11305645 | 대치4동 | M | 5054 | 126741 |