Dataset statistics
Number of variables | 21 |
---|---|
Number of observations | 10000 |
Missing cells | 21154 |
Missing cells (%) | 10.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.8 MiB |
Average record size in memory | 186.0 B |
Variable types
Numeric | 9 |
---|---|
Categorical | 4 |
Text | 6 |
DateTime | 2 |
Dataset
Description | 시스템등록번호,시군구코드,법정동코드,자치구명,법정동명,지번구분,본번,부번,주소,중개업등록번호,중개업자명,사업자상호,전화번호,상태구분,행정처분 시작일,행정처분 종료일,조회 개수,도로명코드,건물,건물 본번,건물 부번 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15550/S/1/datasetView.do |
지번구분 is highly imbalanced (99.1%) | Imbalance |
상태구분 is highly imbalanced (97.9%) | Imbalance |
건물 is highly imbalanced (90.9%) | Imbalance |
전화번호 has 965 (9.7%) missing values | Missing |
행정처분 시작일 has 9989 (99.9%) missing values | Missing |
행정처분 종료일 has 9989 (99.9%) missing values | Missing |
건물 부번 has 203 (2.0%) missing values | Missing |
시스템등록번호 is highly skewed (γ1 = 55.54901036) | Skewed |
시군구코드 is highly skewed (γ1 = 74.2236707) | Skewed |
부번 has 2625 (26.2%) zeros | Zeros |
건물 부번 has 8912 (89.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-10 23:11:54.215721 |
---|---|
Analysis finished | 2024-05-10 23:11:59.351314 |
Duration | 5.14 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시스템등록번호
Real number (ℝ)
SKEWED
 
Distinct | 9999 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1479639 × 1014 |
Minimum | 1.1110198 × 1014 |
---|---|
Maximum | 4.5113202 × 1014 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.1110198 × 1014 |
---|---|
5-th percentile | 1.1170201 × 1014 |
Q1 | 1.1305201 × 1014 |
median | 1.1500202 × 1014 |
Q3 | 1.1650202 × 1014 |
95-th percentile | 1.1710202 × 1014 |
Maximum | 4.5113202 × 1014 |
Range | 3.4003004 × 1014 |
Interquartile range (IQR) | 3.450013 × 1012 |
Descriptive statistics
Standard deviation | 4.9023901 × 1012 |
---|---|
Coefficient of variation (CV) | 0.04270509 |
Kurtosis | 3645.7391 |
Mean | 1.1479639 × 1014 |
Median Absolute Deviation (MAD) | 1.799992 × 1012 |
Skewness | 55.54901 |
Sum | 1.1479639 × 1018 |
Variance | 2.4033429 × 1025 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
116802010000230 | 2 | < 0.1% |
116802022000682 | 1 | < 0.1% |
112002020000019 | 1 | < 0.1% |
115902002000254 | 1 | < 0.1% |
111702021000122 | 1 | < 0.1% |
113052019000025 | 1 | < 0.1% |
114102021000004 | 1 | < 0.1% |
116502014000147 | 1 | < 0.1% |
113052024000021 | 1 | < 0.1% |
113802009000313 | 1 | < 0.1% |
Other values (9989) | 9989 |
Value | Count | Frequency (%) |
111101984000002 | 1 | |
111101984000045 | 1 | |
111101984000154 | 1 | |
111101984000162 | 1 | |
111101986000137 | 1 | |
111101987000048 | 1 | |
111101988000020 | 1 | |
111101988000038 | 1 | |
111101988000152 | 1 | |
111101989000192 | 1 |
Value | Count | Frequency (%) |
451132023000073 | 1 | |
416302003000278 | 1 | |
117402024000112 | 1 | |
117402024000110 | 1 | |
117402024000109 | 1 | |
117402024000108 | 1 | |
117402024000107 | 1 | |
117402024000105 | 1 | |
117402024000102 | 1 | |
117402024000100 | 1 |
시군구코드
Real number (ℝ)
SKEWED
 
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11477.04 |
Minimum | 11110 |
---|---|
Maximum | 52113 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11170 |
Q1 | 11305 |
median | 11500 |
Q3 | 11650 |
95-th percentile | 11710 |
Maximum | 52113 |
Range | 41003 |
Interquartile range (IQR) | 345 |
Descriptive statistics
Standard deviation | 448.80203 |
---|---|
Coefficient of variation (CV) | 0.039104336 |
Kurtosis | 6722.592 |
Mean | 11477.04 |
Median Absolute Deviation (MAD) | 180 |
Skewness | 74.223671 |
Sum | 1.147704 × 108 |
Variance | 201423.27 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11680 | 1153 | 11.5% |
11650 | 736 | 7.4% |
11710 | 640 | 6.4% |
11500 | 501 | 5.0% |
11740 | 493 | 4.9% |
11440 | 488 | 4.9% |
11380 | 444 | 4.4% |
11560 | 439 | 4.4% |
11215 | 374 | 3.7% |
11470 | 365 | 3.6% |
Other values (16) | 4367 |
Value | Count | Frequency (%) |
11110 | 201 | |
11140 | 240 | |
11170 | 346 | |
11200 | 323 | |
11215 | 374 | |
11230 | 359 | |
11260 | 298 | |
11290 | 308 | |
11305 | 243 | |
11320 | 199 |
Value | Count | Frequency (%) |
52113 | 1 | < 0.1% |
11740 | 493 | |
11710 | 640 | |
11680 | 1153 | |
11650 | 736 | |
11620 | 361 | 3.6% |
11590 | 336 | 3.4% |
11560 | 439 | 4.4% |
11545 | 277 | 2.8% |
11530 | 305 | 3.0% |
법정동코드
Real number (ℝ)
Distinct | 390 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1472964 × 109 |
Minimum | 1.1110101 × 109 |
---|---|
Maximum | 1.174011 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.1110101 × 109 |
---|---|
5-th percentile | 1.1170109 × 109 |
Q1 | 1.1305101 × 109 |
median | 1.1500103 × 109 |
Q3 | 1.1650108 × 109 |
95-th percentile | 1.1710114 × 109 |
Maximum | 1.174011 × 109 |
Range | 63000900 |
Interquartile range (IQR) | 34500700 |
Descriptive statistics
Standard deviation | 19051063 |
---|---|
Coefficient of variation (CV) | 0.01660518 |
Kurtosis | -1.2473972 |
Mean | 1.1472964 × 109 |
Median Absolute Deviation (MAD) | 17999800 |
Skewness | -0.28214162 |
Sum | 1.1472964 × 1013 |
Variance | 3.6294301 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1168010100 | 332 | 3.3% |
1165010800 | 275 | 2.8% |
1162010200 | 183 | 1.8% |
1162010100 | 172 | 1.7% |
1150010300 | 164 | 1.6% |
1147010100 | 163 | 1.6% |
1135010500 | 155 | 1.6% |
1168010600 | 154 | 1.5% |
1165010100 | 148 | 1.5% |
1153010200 | 140 | 1.4% |
Other values (380) | 8114 |
Value | Count | Frequency (%) |
1111010100 | 2 | |
1111010200 | 1 | < 0.1% |
1111010400 | 1 | < 0.1% |
1111010500 | 1 | < 0.1% |
1111010600 | 2 | |
1111010800 | 3 | |
1111011000 | 2 | |
1111011100 | 4 | |
1111011400 | 2 | |
1111011500 | 2 |
Value | Count | Frequency (%) |
1174011000 | 11 | 0.1% |
1174010900 | 93 | |
1174010800 | 123 | |
1174010700 | 38 | 0.4% |
1174010600 | 39 | 0.4% |
1174010500 | 70 | |
1174010300 | 28 | 0.3% |
1174010200 | 56 | |
1174010100 | 37 | 0.4% |
1171011400 | 29 | 0.3% |
자치구명
Categorical
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
강남구 | |
---|---|
서초구 | |
송파구 | 639 |
강서구 | 501 |
강동구 | 493 |
Other values (21) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0823 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강남구 |
---|---|
2nd row | 중구 |
3rd row | 동작구 |
4th row | 서초구 |
5th row | 영등포구 |
Common Values
Value | Count | Frequency (%) |
강남구 | 1152 | 11.5% |
서초구 | 734 | 7.3% |
송파구 | 639 | 6.4% |
강서구 | 501 | 5.0% |
강동구 | 493 | 4.9% |
마포구 | 488 | 4.9% |
은평구 | 444 | 4.4% |
영등포구 | 439 | 4.4% |
광진구 | 374 | 3.7% |
양천구 | 365 | 3.6% |
Other values (16) | 4371 |
Length
Value | Count | Frequency (%) |
강남구 | 1152 | 11.5% |
서초구 | 734 | 7.3% |
송파구 | 639 | 6.4% |
강서구 | 501 | 5.0% |
강동구 | 493 | 4.9% |
마포구 | 488 | 4.9% |
은평구 | 444 | 4.4% |
영등포구 | 439 | 4.4% |
광진구 | 374 | 3.7% |
양천구 | 365 | 3.6% |
Other values (16) | 4371 |
법정동명
Text
Distinct | 393 |
---|---|
Distinct (%) | 3.9% |
Missing | 8 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
역삼동 | 331 | 3.3% |
서초동 | 275 | 2.8% |
신림동 | 183 | 1.8% |
봉천동 | 171 | 1.7% |
화곡동 | 163 | 1.6% |
신정동 | 162 | 1.6% |
상계동 | 155 | 1.6% |
대치동 | 154 | 1.5% |
방배동 | 150 | 1.5% |
구로동 | 138 | 1.4% |
Other values (383) | 8110 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 9860 | |
가 | 1032 | 3.3% |
신 | 931 | 2.9% |
삼 | 513 | 1.6% |
곡 | 477 | 1.5% |
산 | 448 | 1.4% |
성 | 426 | 1.3% |
정 | 390 | 1.2% |
방 | 382 | 1.2% |
역 | 368 | 1.2% |
Other values (185) | 16902 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30987 | |
Decimal Number | 742 | 2.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9860 | |
가 | 1032 | 3.3% |
신 | 931 | 3.0% |
삼 | 513 | 1.7% |
곡 | 477 | 1.5% |
산 | 448 | 1.4% |
성 | 426 | 1.4% |
정 | 390 | 1.3% |
방 | 382 | 1.2% |
역 | 368 | 1.2% |
Other values (177) | 16160 |
Decimal Number
Value | Count | Frequency (%) |
1 | 208 | |
2 | 199 | |
3 | 122 | |
5 | 73 | 9.8% |
4 | 72 | 9.7% |
6 | 39 | 5.3% |
7 | 19 | 2.6% |
8 | 10 | 1.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30987 | |
Common | 742 | 2.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 9860 | |
가 | 1032 | 3.3% |
신 | 931 | 3.0% |
삼 | 513 | 1.7% |
곡 | 477 | 1.5% |
산 | 448 | 1.4% |
성 | 426 | 1.4% |
정 | 390 | 1.3% |
방 | 382 | 1.2% |
역 | 368 | 1.2% |
Other values (177) | 16160 |
Common
Value | Count | Frequency (%) |
1 | 208 | |
2 | 199 | |
3 | 122 | |
5 | 73 | 9.8% |
4 | 72 | 9.7% |
6 | 39 | 5.3% |
7 | 19 | 2.6% |
8 | 10 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30987 | |
ASCII | 742 | 2.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 9860 | |
가 | 1032 | 3.3% |
신 | 931 | 3.0% |
삼 | 513 | 1.7% |
곡 | 477 | 1.5% |
산 | 448 | 1.4% |
성 | 426 | 1.4% |
정 | 390 | 1.3% |
방 | 382 | 1.2% |
역 | 368 | 1.2% |
Other values (177) | 16160 |
ASCII
Value | Count | Frequency (%) |
1 | 208 | |
2 | 199 | |
3 | 122 | |
5 | 73 | 9.8% |
4 | 72 | 9.7% |
6 | 39 | 5.3% |
7 | 19 | 2.6% |
8 | 10 | 1.3% |
지번구분
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
<NA> | 10 |
2 | 2 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.003 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 9988 | |
<NA> | 10 | 0.1% |
2 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 9988 | |
na | 10 | 0.1% |
2 | 2 | < 0.1% |
본번
Real number (ℝ)
Distinct | 1404 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 469.6713 |
Minimum | 0 |
---|---|
Maximum | 4958 |
Zeros | 25 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 18 |
Q1 | 125 |
median | 341 |
Q3 | 693 |
95-th percentile | 1332.25 |
Maximum | 4958 |
Range | 4958 |
Interquartile range (IQR) | 568 |
Descriptive statistics
Standard deviation | 470.23622 |
---|---|
Coefficient of variation (CV) | 1.0012028 |
Kurtosis | 18.060775 |
Mean | 469.6713 |
Median Absolute Deviation (MAD) | 260.5 |
Skewness | 2.8307866 |
Sum | 4696713 |
Variance | 221122.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 57 | 0.6% |
20 | 44 | 0.4% |
825 | 37 | 0.4% |
19 | 36 | 0.4% |
98 | 36 | 0.4% |
18 | 36 | 0.4% |
50 | 35 | 0.4% |
2 | 35 | 0.4% |
27 | 34 | 0.3% |
10 | 34 | 0.3% |
Other values (1394) | 9616 |
Value | Count | Frequency (%) |
0 | 25 | |
1 | 57 | |
2 | 35 | |
3 | 28 | |
4 | 25 | |
5 | 31 | |
6 | 11 | 0.1% |
7 | 25 | |
8 | 27 | |
9 | 32 |
Value | Count | Frequency (%) |
4958 | 1 | < 0.1% |
4955 | 1 | < 0.1% |
4950 | 8 | |
4945 | 1 | < 0.1% |
4937 | 1 | < 0.1% |
4934 | 1 | < 0.1% |
4921 | 1 | < 0.1% |
4780 | 2 | < 0.1% |
4765 | 1 | < 0.1% |
4759 | 1 | < 0.1% |
부번
Real number (ℝ)
ZEROS
 
Distinct | 350 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24.4291 |
Minimum | 0 |
---|---|
Maximum | 2181 |
Zeros | 2625 |
Zeros (%) | 26.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 5 |
Q3 | 20 |
95-th percentile | 95 |
Maximum | 2181 |
Range | 2181 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 80.122008 |
---|---|
Coefficient of variation (CV) | 3.2797773 |
Kurtosis | 210.08632 |
Mean | 24.4291 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 11.774607 |
Sum | 244291 |
Variance | 6419.5361 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2625 | |
1 | 898 | 9.0% |
2 | 485 | 4.9% |
3 | 419 | 4.2% |
4 | 340 | 3.4% |
5 | 317 | 3.2% |
6 | 290 | 2.9% |
8 | 237 | 2.4% |
7 | 227 | 2.3% |
9 | 191 | 1.9% |
Other values (340) | 3971 |
Value | Count | Frequency (%) |
0 | 2625 | |
1 | 898 | 9.0% |
2 | 485 | 4.9% |
3 | 419 | 4.2% |
4 | 340 | 3.4% |
5 | 317 | 3.2% |
6 | 290 | 2.9% |
7 | 227 | 2.3% |
8 | 237 | 2.4% |
9 | 191 | 1.9% |
Value | Count | Frequency (%) |
2181 | 1 | |
2150 | 1 | |
1738 | 1 | |
1539 | 1 | |
1503 | 1 | |
1483 | 1 | |
1406 | 1 | |
1312 | 1 | |
1268 | 1 | |
1130 | 1 |
주소
Text
Distinct | 9825 |
---|---|
Distinct (%) | 98.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 71 |
---|---|
Median length | 55 |
Mean length | 32.4284 |
Min length | 16 |
Characters and Unicode
Total characters | 324284 |
---|---|
Distinct characters | 613 |
Distinct categories | 15 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 9684 ? |
---|---|
Unique (%) | 96.8% |
Sample
1st row | 서울특별시 강남구 압구정로29길 71 1층 105호(압구정동, 점포4동) |
---|---|
2nd row | 서울특별시 중구 난계로11길 8 1층 |
3rd row | 서울특별시 동작구 노량진로 252 (본동) |
4th row | 서울특별시 서초구 효령로 429 , 111호(서초동, 강남 삼부르네상스시티) |
5th row | 서울특별시 영등포구 가마산로 466 104호 |
Value | Count | Frequency (%) |
서울특별시 | 10002 | 16.8% |
1층 | 1697 | 2.8% |
강남구 | 1152 | 1.9% |
887 | 1.5% | |
서초구 | 731 | 1.2% |
상가동 | 644 | 1.1% |
송파구 | 639 | 1.1% |
강서구 | 504 | 0.8% |
강동구 | 496 | 0.8% |
마포구 | 486 | 0.8% |
Other values (11862) | 42450 |
Most occurring characters
Value | Count | Frequency (%) |
50214 | 15.5% | |
1 | 18279 | 5.6% |
동 | 12945 | 4.0% |
서 | 12587 | 3.9% |
구 | 10754 | 3.3% |
로 | 10519 | 3.2% |
시 | 10481 | 3.2% |
울 | 10077 | 3.1% |
특 | 10007 | 3.1% |
별 | 10005 | 3.1% |
Other values (603) | 168416 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 191348 | |
Decimal Number | 56995 | 17.6% |
Space Separator | 50214 | 15.5% |
Close Punctuation | 9049 | 2.8% |
Open Punctuation | 9047 | 2.8% |
Other Punctuation | 4740 | 1.5% |
Dash Punctuation | 1442 | 0.4% |
Uppercase Letter | 1248 | 0.4% |
Lowercase Letter | 120 | < 0.1% |
Other Symbol | 54 | < 0.1% |
Other values (5) | 27 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 12945 | 6.8% |
서 | 12587 | 6.6% |
구 | 10754 | 5.6% |
로 | 10519 | 5.5% |
시 | 10481 | 5.5% |
울 | 10077 | 5.3% |
특 | 10007 | 5.2% |
별 | 10005 | 5.2% |
호 | 5969 | 3.1% |
길 | 5497 | 2.9% |
Other values (526) | 92507 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 400 | |
A | 194 | |
C | 87 | 7.0% |
S | 77 | 6.2% |
D | 58 | 4.6% |
K | 55 | 4.4% |
M | 42 | 3.4% |
L | 36 | 2.9% |
R | 35 | 2.8% |
O | 34 | 2.7% |
Other values (16) | 230 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 44 | |
r | 8 | 6.7% |
t | 7 | 5.8% |
o | 7 | 5.8% |
i | 7 | 5.8% |
n | 6 | 5.0% |
a | 5 | 4.2% |
w | 4 | 3.3% |
k | 4 | 3.3% |
b | 4 | 3.3% |
Other values (10) | 24 |
Decimal Number
Value | Count | Frequency (%) |
1 | 18279 | |
2 | 7251 | 12.7% |
0 | 6548 | 11.5% |
3 | 5540 | 9.7% |
4 | 4212 | 7.4% |
5 | 3786 | 6.6% |
6 | 3214 | 5.6% |
7 | 3009 | 5.3% |
8 | 2658 | 4.7% |
9 | 2498 | 4.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 4648 | |
@ | 52 | 1.1% |
. | 16 | 0.3% |
/ | 7 | 0.1% |
& | 5 | 0.1% |
; | 5 | 0.1% |
? | 4 | 0.1% |
? | 3 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 8752 | |
] | 297 | 3.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 8748 | |
[ | 299 | 3.3% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 8 | |
Ⅱ | 4 |
Space Separator
Value | Count | Frequency (%) |
50214 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1442 |
Other Symbol
Value | Count | Frequency (%) |
㈕ | 54 |
Math Symbol
Value | Count | Frequency (%) |
~ | 10 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3 |
Other Number
Value | Count | Frequency (%) |
⑴ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 191399 | |
Common | 131502 | |
Latin | 1380 | 0.4% |
Han | 3 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 12945 | 6.8% |
서 | 12587 | 6.6% |
구 | 10754 | 5.6% |
로 | 10519 | 5.5% |
시 | 10481 | 5.5% |
울 | 10077 | 5.3% |
특 | 10007 | 5.2% |
별 | 10005 | 5.2% |
호 | 5969 | 3.1% |
길 | 5497 | 2.9% |
Other values (524) | 92558 |
Latin
Value | Count | Frequency (%) |
B | 400 | |
A | 194 | |
C | 87 | 6.3% |
S | 77 | 5.6% |
D | 58 | 4.2% |
K | 55 | 4.0% |
e | 44 | 3.2% |
M | 42 | 3.0% |
L | 36 | 2.6% |
R | 35 | 2.5% |
Other values (38) | 352 |
Common
Value | Count | Frequency (%) |
50214 | ||
1 | 18279 | 13.9% |
) | 8752 | 6.7% |
( | 8748 | 6.7% |
2 | 7251 | 5.5% |
0 | 6548 | 5.0% |
3 | 5540 | 4.2% |
, | 4648 | 3.5% |
4 | 4212 | 3.2% |
5 | 3786 | 2.9% |
Other values (18) | 13524 | 10.3% |
Han
Value | Count | Frequency (%) |
利 | 1 | |
景 | 1 | |
家 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 191345 | |
ASCII | 132866 | |
None | 57 | < 0.1% |
Number Forms | 12 | < 0.1% |
CJK | 3 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
50214 | ||
1 | 18279 | 13.8% |
) | 8752 | 6.6% |
( | 8748 | 6.6% |
2 | 7251 | 5.5% |
0 | 6548 | 4.9% |
3 | 5540 | 4.2% |
, | 4648 | 3.5% |
4 | 4212 | 3.2% |
5 | 3786 | 2.8% |
Other values (62) | 14888 | 11.2% |
Hangul
Value | Count | Frequency (%) |
동 | 12945 | 6.8% |
서 | 12587 | 6.6% |
구 | 10754 | 5.6% |
로 | 10519 | 5.5% |
시 | 10481 | 5.5% |
울 | 10077 | 5.3% |
특 | 10007 | 5.2% |
별 | 10005 | 5.2% |
호 | 5969 | 3.1% |
길 | 5497 | 2.9% |
Other values (523) | 92504 |
None
Value | Count | Frequency (%) |
㈕ | 54 | |
? | 3 | 5.3% |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 8 | |
Ⅱ | 4 |
Enclosed Alphanum
Value | Count | Frequency (%) |
⑴ | 1 |
CJK
Value | Count | Frequency (%) |
利 | 1 | |
景 | 1 | |
家 | 1 |
중개업등록번호
Text
Distinct | 9999 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 20 |
---|---|
Median length | 16 |
Mean length | 14.7064 |
Min length | 4 |
Characters and Unicode
Total characters | 147064 |
---|---|
Distinct characters | 21 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 9998 ? |
---|---|
Unique (%) | > 99.9% |
Sample
1st row | 11680-2022-00518 |
---|---|
2nd row | 공92220000-1731 |
3rd row | 나-92460000-177 |
4th row | 11650-2023-00259 |
5th row | 11560-2021-00017 |
Value | Count | Frequency (%) |
9250-143 | 2 | < 0.1% |
9251-8798 | 1 | < 0.1% |
92380000-3994 | 1 | < 0.1% |
11350-2023-00049 | 1 | < 0.1% |
11590-2019-00088 | 1 | < 0.1% |
92460000-1369 | 1 | < 0.1% |
11170-2021-00122 | 1 | < 0.1% |
11305-2019-00027 | 1 | < 0.1% |
11410-2021-00004 | 1 | < 0.1% |
11305-2024-00021 | 1 | < 0.1% |
Other values (9989) | 9989 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 44025 | |
1 | 23630 | |
2 | 21231 | |
- | 16646 | 11.3% |
3 | 6907 | 4.7% |
9 | 6802 | 4.6% |
5 | 6546 | 4.5% |
4 | 6532 | 4.4% |
6 | 5216 | 3.5% |
8 | 4367 | 3.0% |
Other values (11) | 5162 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 129623 | |
Dash Punctuation | 16646 | 11.3% |
Other Letter | 794 | 0.5% |
Control | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 44025 | |
1 | 23630 | |
2 | 21231 | |
3 | 6907 | 5.3% |
9 | 6802 | 5.2% |
5 | 6546 | 5.1% |
4 | 6532 | 5.0% |
6 | 5216 | 4.0% |
8 | 4367 | 3.4% |
7 | 4367 | 3.4% |
Other Letter
Value | Count | Frequency (%) |
가 | 290 | |
공 | 176 | |
다 | 150 | |
인 | 113 | 14.2% |
나 | 58 | 7.3% |
법 | 4 | 0.5% |
예 | 1 | 0.1% |
산 | 1 | 0.1% |
주 | 1 | 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16646 |
Control
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 146270 | |
Hangul | 794 | 0.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 44025 | |
1 | 23630 | |
2 | 21231 | |
- | 16646 | 11.4% |
3 | 6907 | 4.7% |
9 | 6802 | 4.7% |
5 | 6546 | 4.5% |
4 | 6532 | 4.5% |
6 | 5216 | 3.6% |
8 | 4367 | 3.0% |
Other values (2) | 4368 | 3.0% |
Hangul
Value | Count | Frequency (%) |
가 | 290 | |
공 | 176 | |
다 | 150 | |
인 | 113 | 14.2% |
나 | 58 | 7.3% |
법 | 4 | 0.5% |
예 | 1 | 0.1% |
산 | 1 | 0.1% |
주 | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 146270 | |
Hangul | 794 | 0.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 44025 | |
1 | 23630 | |
2 | 21231 | |
- | 16646 | 11.4% |
3 | 6907 | 4.7% |
9 | 6802 | 4.7% |
5 | 6546 | 4.5% |
4 | 6532 | 4.5% |
6 | 5216 | 3.6% |
8 | 4367 | 3.0% |
Other values (2) | 4368 | 3.0% |
Hangul
Value | Count | Frequency (%) |
가 | 290 | |
공 | 176 | |
다 | 150 | |
인 | 113 | 14.2% |
나 | 58 | 7.3% |
법 | 4 | 0.5% |
예 | 1 | 0.1% |
산 | 1 | 0.1% |
주 | 1 | 0.1% |
중개업자명
Text
Distinct | 8306 |
---|---|
Distinct (%) | 83.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
김정희 | 14 | 0.1% |
김정숙 | 12 | 0.1% |
김영숙 | 11 | 0.1% |
이정희 | 10 | 0.1% |
김현숙 | 10 | 0.1% |
김미희 | 9 | 0.1% |
김민정 | 9 | 0.1% |
김선희 | 9 | 0.1% |
김미숙 | 9 | 0.1% |
이경숙 | 8 | 0.1% |
Other values (8300) | 9903 |
Most occurring characters
Value | Count | Frequency (%) |
김 | 2092 | 7.0% |
이 | 1514 | 5.1% |
정 | 1222 | 4.1% |
영 | 987 | 3.3% |
박 | 812 | 2.7% |
희 | 740 | 2.5% |
경 | 633 | 2.1% |
현 | 593 | 2.0% |
숙 | 571 | 1.9% |
성 | 502 | 1.7% |
Other values (314) | 20274 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 29901 | |
Uppercase Letter | 33 | 0.1% |
Space Separator | 4 | < 0.1% |
Open Punctuation | 1 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
김 | 2092 | 7.0% |
이 | 1514 | 5.1% |
정 | 1222 | 4.1% |
영 | 987 | 3.3% |
박 | 812 | 2.7% |
희 | 740 | 2.5% |
경 | 633 | 2.1% |
현 | 593 | 2.0% |
숙 | 571 | 1.9% |
성 | 502 | 1.7% |
Other values (295) | 20235 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 6 | |
I | 5 | |
H | 4 | |
A | 3 | |
G | 2 | 6.1% |
J | 2 | 6.1% |
O | 2 | 6.1% |
D | 1 | 3.0% |
U | 1 | 3.0% |
C | 1 | 3.0% |
Other values (6) | 6 |
Space Separator
Value | Count | Frequency (%) |
4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29901 | |
Latin | 33 | 0.1% |
Common | 6 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
김 | 2092 | 7.0% |
이 | 1514 | 5.1% |
정 | 1222 | 4.1% |
영 | 987 | 3.3% |
박 | 812 | 2.7% |
희 | 740 | 2.5% |
경 | 633 | 2.1% |
현 | 593 | 2.0% |
숙 | 571 | 1.9% |
성 | 502 | 1.7% |
Other values (295) | 20235 |
Latin
Value | Count | Frequency (%) |
N | 6 | |
I | 5 | |
H | 4 | |
A | 3 | |
G | 2 | 6.1% |
J | 2 | 6.1% |
O | 2 | 6.1% |
D | 1 | 3.0% |
U | 1 | 3.0% |
C | 1 | 3.0% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
4 | ||
( | 1 | 16.7% |
) | 1 | 16.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29901 | |
ASCII | 39 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
김 | 2092 | 7.0% |
이 | 1514 | 5.1% |
정 | 1222 | 4.1% |
영 | 987 | 3.3% |
박 | 812 | 2.7% |
희 | 740 | 2.5% |
경 | 633 | 2.1% |
현 | 593 | 2.0% |
숙 | 571 | 1.9% |
성 | 502 | 1.7% |
Other values (295) | 20235 |
ASCII
Value | Count | Frequency (%) |
N | 6 | |
I | 5 | |
4 | ||
H | 4 | |
A | 3 | 7.7% |
G | 2 | 5.1% |
J | 2 | 5.1% |
O | 2 | 5.1% |
D | 1 | 2.6% |
U | 1 | 2.6% |
Other values (9) | 9 |
사업자상호
Text
Distinct | 6143 |
---|---|
Distinct (%) | 61.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 37 |
---|---|
Median length | 34 |
Mean length | 11.8848 |
Min length | 5 |
Characters and Unicode
Total characters | 118848 |
---|---|
Distinct characters | 757 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 5238 ? |
---|---|
Unique (%) | 52.4% |
Sample
1st row | 압구정서울공인중개사사무소 |
---|---|
2nd row | 필공인중개사사무소 |
3rd row | 중앙부동산중개인사무소 |
4th row | 강남삼부공인중개사사무소 |
5th row | 강산공인중개사사무소 |
Value | Count | Frequency (%) |
공인중개사사무소 | 119 | 1.2% |
현대공인중개사사무소 | 100 | 1.0% |
삼성공인중개사사무소 | 85 | 0.8% |
미래공인중개사사무소 | 67 | 0.7% |
우리공인중개사사무소 | 62 | 0.6% |
주식회사 | 54 | 0.5% |
하나공인중개사사무소 | 52 | 0.5% |
행운공인중개사사무소 | 46 | 0.4% |
중앙공인중개사사무소 | 43 | 0.4% |
태양공인중개사사무소 | 42 | 0.4% |
Other values (6160) | 9577 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 18719 | |
중 | 10101 | 8.5% |
개 | 10060 | 8.5% |
인 | 9508 | 8.0% |
소 | 9390 | 7.9% |
무 | 9331 | 7.9% |
공 | 8937 | 7.5% |
동 | 2947 | 2.5% |
산 | 2807 | 2.4% |
부 | 2746 | 2.3% |
Other values (747) | 34302 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 113332 | |
Decimal Number | 2736 | 2.3% |
Uppercase Letter | 878 | 0.7% |
Close Punctuation | 489 | 0.4% |
Open Punctuation | 487 | 0.4% |
Dash Punctuation | 442 | 0.4% |
Space Separator | 252 | 0.2% |
Lowercase Letter | 177 | 0.1% |
Other Punctuation | 45 | < 0.1% |
Letter Number | 5 | < 0.1% |
Other values (2) | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 18719 | |
중 | 10101 | 8.9% |
개 | 10060 | 8.9% |
인 | 9508 | 8.4% |
소 | 9390 | 8.3% |
무 | 9331 | 8.2% |
공 | 8937 | 7.9% |
동 | 2947 | 2.6% |
산 | 2807 | 2.5% |
부 | 2746 | 2.4% |
Other values (670) | 28786 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 147 | |
S | 94 | 10.7% |
O | 77 | 8.8% |
A | 52 | 5.9% |
M | 48 | 5.5% |
L | 43 | 4.9% |
C | 39 | 4.4% |
B | 36 | 4.1% |
D | 36 | 4.1% |
E | 33 | 3.8% |
Other values (16) | 273 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 61 | |
h | 15 | 8.5% |
o | 11 | 6.2% |
i | 10 | 5.6% |
s | 9 | 5.1% |
a | 9 | 5.1% |
n | 8 | 4.5% |
p | 7 | 4.0% |
t | 7 | 4.0% |
l | 6 | 3.4% |
Other values (11) | 34 |
Decimal Number
Value | Count | Frequency (%) |
0 | 588 | |
2 | 336 | |
1 | 332 | |
4 | 266 | |
9 | 262 | |
3 | 240 | |
8 | 230 | 8.4% |
7 | 196 | 7.2% |
5 | 155 | 5.7% |
6 | 131 | 4.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 25 | |
& | 9 | 20.0% |
? | 2 | 4.4% |
/ | 2 | 4.4% |
, | 2 | 4.4% |
? | 1 | 2.2% |
' | 1 | 2.2% |
; | 1 | 2.2% |
% | 1 | 2.2% |
@ | 1 | 2.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 488 | |
] | 1 | 0.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 486 | |
[ | 1 | 0.2% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 3 | |
Ⅰ | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 442 |
Space Separator
Value | Count | Frequency (%) |
252 |
Math Symbol
Value | Count | Frequency (%) |
+ | 3 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 113328 | |
Common | 4454 | 3.7% |
Latin | 1060 | 0.9% |
Han | 6 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 18719 | |
중 | 10101 | 8.9% |
개 | 10060 | 8.9% |
인 | 9508 | 8.4% |
소 | 9390 | 8.3% |
무 | 9331 | 8.2% |
공 | 8937 | 7.9% |
동 | 2947 | 2.6% |
산 | 2807 | 2.5% |
부 | 2746 | 2.4% |
Other values (667) | 28782 |
Latin
Value | Count | Frequency (%) |
K | 147 | 13.9% |
S | 94 | 8.9% |
O | 77 | 7.3% |
e | 61 | 5.8% |
A | 52 | 4.9% |
M | 48 | 4.5% |
L | 43 | 4.1% |
C | 39 | 3.7% |
B | 36 | 3.4% |
D | 36 | 3.4% |
Other values (39) | 427 |
Common
Value | Count | Frequency (%) |
0 | 588 | |
) | 488 | |
( | 486 | |
- | 442 | |
2 | 336 | |
1 | 332 | |
4 | 266 | 6.0% |
9 | 262 | 5.9% |
252 | 5.7% | |
3 | 240 | 5.4% |
Other values (17) | 762 |
Han
Value | Count | Frequency (%) |
秀 | 3 | |
美 | 1 | 16.7% |
善 | 1 | 16.7% |
正 | 1 | 16.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 113326 | |
ASCII | 5508 | 4.6% |
CJK | 6 | < 0.1% |
Number Forms | 5 | < 0.1% |
None | 3 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 18719 | |
중 | 10101 | 8.9% |
개 | 10060 | 8.9% |
인 | 9508 | 8.4% |
소 | 9390 | 8.3% |
무 | 9331 | 8.2% |
공 | 8937 | 7.9% |
동 | 2947 | 2.6% |
산 | 2807 | 2.5% |
부 | 2746 | 2.4% |
Other values (666) | 28780 |
ASCII
Value | Count | Frequency (%) |
0 | 588 | 10.7% |
) | 488 | 8.9% |
( | 486 | 8.8% |
- | 442 | 8.0% |
2 | 336 | 6.1% |
1 | 332 | 6.0% |
4 | 266 | 4.8% |
9 | 262 | 4.8% |
252 | 4.6% | |
3 | 240 | 4.4% |
Other values (63) | 1816 |
CJK
Value | Count | Frequency (%) |
秀 | 3 | |
美 | 1 | 16.7% |
善 | 1 | 16.7% |
正 | 1 | 16.7% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 3 | |
Ⅰ | 2 |
None
Value | Count | Frequency (%) |
㈜ | 2 | |
? | 1 |
전화번호
Text
MISSING
 
Distinct | 8461 |
---|---|
Distinct (%) | 93.6% |
Missing | 965 |
Missing (%) | 9.7% |
Memory size | 156.2 KiB |
Length
Max length | 480 |
---|---|
Median length | 466 |
Mean length | 11.862313 |
Min length | 1 |
Characters and Unicode
Total characters | 107176 |
---|---|
Distinct characters | 52 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 8424 ? |
---|---|
Unique (%) | 93.2% |
Sample
1st row | 02-546-6262 |
---|---|
2nd row | 02-2238-3113 |
3rd row | 02-813-9254 |
4th row | 02-575-2070 |
5th row | 408-6700 |
Value | Count | Frequency (%) |
685 | 6.6% | |
02 | 5 | < 0.1% |
추가 | 4 | < 0.1% |
1588-4802 | 3 | < 0.1% |
02-766-4700 | 2 | < 0.1% |
02-909-9300 | 2 | < 0.1% |
02-445-0002 | 2 | < 0.1% |
02-943-4800 | 2 | < 0.1% |
02-3437-4000 | 2 | < 0.1% |
354-6699 | 2 | < 0.1% |
Other values (9644) | 9688 |
Most occurring characters
Value | Count | Frequency (%) |
- | 16889 | |
0 | 16565 | |
2 | 14083 | |
4 | 8363 | |
5 | 7530 | |
8 | 7507 | |
9 | 7359 | |
3 | 6927 | |
6 | 6100 | 5.7% |
7 | 5965 | 5.6% |
Other values (42) | 9888 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 86057 | |
Dash Punctuation | 16889 | 15.8% |
Other Punctuation | 1908 | 1.8% |
Space Separator | 1784 | 1.7% |
Other Letter | 237 | 0.2% |
Close Punctuation | 126 | 0.1% |
Open Punctuation | 103 | 0.1% |
Math Symbol | 49 | < 0.1% |
Lowercase Letter | 23 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
추 | 64 | |
가 | 64 | |
경 | 12 | 5.1% |
변 | 12 | 5.1% |
서 | 10 | 4.2% |
의 | 10 | 4.2% |
동 | 10 | 4.2% |
외 | 5 | 2.1% |
전 | 4 | 1.7% |
록 | 4 | 1.7% |
Other values (19) | 42 |
Decimal Number
Value | Count | Frequency (%) |
0 | 16565 | |
2 | 14083 | |
4 | 8363 | |
5 | 7530 | |
8 | 7507 | |
9 | 7359 | |
3 | 6927 | |
6 | 6100 | 7.1% |
7 | 5965 | 6.9% |
1 | 5658 | 6.6% |
Lowercase Letter
Value | Count | Frequency (%) |
r | 10 | |
q | 10 | |
f | 1 | 4.3% |
a | 1 | 4.3% |
x | 1 | 4.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1675 | |
. | 208 | 10.9% |
/ | 25 | 1.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16889 |
Space Separator
Value | Count | Frequency (%) |
1784 |
Close Punctuation
Value | Count | Frequency (%) |
) | 126 |
Open Punctuation
Value | Count | Frequency (%) |
( | 103 |
Math Symbol
Value | Count | Frequency (%) |
~ | 49 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 106916 | |
Hangul | 237 | 0.2% |
Latin | 23 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
추 | 64 | |
가 | 64 | |
경 | 12 | 5.1% |
변 | 12 | 5.1% |
서 | 10 | 4.2% |
의 | 10 | 4.2% |
동 | 10 | 4.2% |
외 | 5 | 2.1% |
전 | 4 | 1.7% |
록 | 4 | 1.7% |
Other values (19) | 42 |
Common
Value | Count | Frequency (%) |
- | 16889 | |
0 | 16565 | |
2 | 14083 | |
4 | 8363 | |
5 | 7530 | |
8 | 7507 | |
9 | 7359 | |
3 | 6927 | |
6 | 6100 | 5.7% |
7 | 5965 | 5.6% |
Other values (8) | 9628 |
Latin
Value | Count | Frequency (%) |
r | 10 | |
q | 10 | |
f | 1 | 4.3% |
a | 1 | 4.3% |
x | 1 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 106939 | |
Hangul | 237 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 16889 | |
0 | 16565 | |
2 | 14083 | |
4 | 8363 | |
5 | 7530 | |
8 | 7507 | |
9 | 7359 | |
3 | 6927 | |
6 | 6100 | 5.7% |
7 | 5965 | 5.6% |
Other values (13) | 9651 |
Hangul
Value | Count | Frequency (%) |
추 | 64 | |
가 | 64 | |
경 | 12 | 5.1% |
변 | 12 | 5.1% |
서 | 10 | 4.2% |
의 | 10 | 4.2% |
동 | 10 | 4.2% |
외 | 5 | 2.1% |
전 | 4 | 1.7% |
록 | 4 | 1.7% |
Other values (19) | 42 |
상태구분
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
영업중 | |
---|---|
휴업 | 28 |
업무정지 | 11 |
휴업연장 | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 2.9984 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 영업중 |
---|---|
2nd row | 영업중 |
3rd row | 영업중 |
4th row | 영업중 |
5th row | 영업중 |
Common Values
Value | Count | Frequency (%) |
영업중 | 9960 | |
휴업 | 28 | 0.3% |
업무정지 | 11 | 0.1% |
휴업연장 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
영업중 | 9960 | |
휴업 | 28 | 0.3% |
업무정지 | 11 | 0.1% |
휴업연장 | 1 | < 0.1% |
행정처분 시작일
Date
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 90.9% |
Missing | 9989 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Minimum | 2024-02-28 00:00:00 |
---|---|
Maximum | 2024-05-10 00:00:00 |
행정처분 종료일
Date
MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 81.8% |
Missing | 9989 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Minimum | 2024-05-17 00:00:00 |
---|---|
Maximum | 2024-10-21 00:00:00 |
조회 개수
Real number (ℝ)
Distinct | 2146 |
---|---|
Distinct (%) | 21.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 649.4873 |
Minimum | 1 |
---|---|
Maximum | 3055 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 54 |
Q1 | 261.75 |
median | 520 |
Q3 | 838 |
95-th percentile | 1753 |
Maximum | 3055 |
Range | 3054 |
Interquartile range (IQR) | 576.25 |
Descriptive statistics
Standard deviation | 561.10296 |
---|---|
Coefficient of variation (CV) | 0.86391675 |
Kurtosis | 3.6741131 |
Mean | 649.4873 |
Median Absolute Deviation (MAD) | 283 |
Skewness | 1.7597474 |
Sum | 6494873 |
Variance | 314836.53 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
504 | 17 | 0.2% |
368 | 16 | 0.2% |
591 | 16 | 0.2% |
177 | 15 | 0.1% |
283 | 15 | 0.1% |
549 | 15 | 0.1% |
56 | 15 | 0.1% |
229 | 15 | 0.1% |
574 | 15 | 0.1% |
154 | 15 | 0.1% |
Other values (2136) | 9846 |
Value | Count | Frequency (%) |
1 | 11 | |
2 | 12 | |
3 | 9 | |
4 | 8 | |
5 | 9 | |
6 | 11 | |
7 | 8 | |
8 | 10 | |
9 | 7 | |
10 | 12 |
Value | Count | Frequency (%) |
3055 | 1 | |
3051 | 1 | |
3049 | 1 | |
3045 | 1 | |
3044 | 1 | |
3043 | 1 | |
3040 | 1 | |
3039 | 1 | |
3035 | 1 | |
3034 | 1 |
도로명코드
Real number (ℝ)
Distinct | 3389 |
---|---|
Distinct (%) | 33.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1473213 × 1011 |
Minimum | 1.111021 × 1011 |
---|---|
Maximum | 1.1740486 × 1011 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.111021 × 1011 |
---|---|
5-th percentile | 1.1170301 × 1011 |
Q1 | 1.1305301 × 1011 |
median | 1.1500316 × 1011 |
Q3 | 1.1650416 × 1011 |
95-th percentile | 1.1710485 × 1011 |
Maximum | 1.1740486 × 1011 |
Range | 6.302758 × 109 |
Interquartile range (IQR) | 3.4511583 × 109 |
Descriptive statistics
Standard deviation | 1.9051638 × 109 |
---|---|
Coefficient of variation (CV) | 0.016605321 |
Kurtosis | -1.2472515 |
Mean | 1.1473213 × 1011 |
Median Absolute Deviation (MAD) | 1.799028 × 109 |
Skewness | -0.28220608 |
Sum | 1.1473213 × 1015 |
Variance | 3.6296492 × 1018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
116803122006 | 70 | 0.7% |
117103123023 | 51 | 0.5% |
116503121017 | 51 | 0.5% |
114703114003 | 50 | 0.5% |
116803122005 | 49 | 0.5% |
113503000001 | 48 | 0.5% |
115003115001 | 47 | 0.5% |
115453117001 | 45 | 0.4% |
115002005007 | 45 | 0.4% |
116803122010 | 44 | 0.4% |
Other values (3379) | 9500 |
Value | Count | Frequency (%) |
111102100001 | 2 | < 0.1% |
111102100002 | 2 | < 0.1% |
111103000008 | 4 | |
111103005003 | 1 | < 0.1% |
111103005004 | 2 | < 0.1% |
111103005006 | 4 | |
111103005007 | 5 | |
111103005008 | 1 | < 0.1% |
111103100002 | 7 | |
111103100003 | 2 | < 0.1% |
Value | Count | Frequency (%) |
117404858048 | 1 | < 0.1% |
117404858046 | 1 | < 0.1% |
117404172446 | 1 | < 0.1% |
117404172435 | 3 | |
117404172431 | 2 | |
117404172430 | 1 | < 0.1% |
117404172429 | 1 | < 0.1% |
117404172428 | 2 | |
117404172426 | 1 | < 0.1% |
117404172425 | 1 | < 0.1% |
건물
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
<NA> | 236 |
18 | |
1 | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0708 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9745 | |
<NA> | 236 | 2.4% |
18 | 0.2% | |
1 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9745 | |
na | 236 | 2.4% |
1 | 1 | < 0.1% |
건물 본번
Real number (ℝ)
Distinct | 792 |
---|---|
Distinct (%) | 7.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 145.5884 |
Minimum | 1 |
---|---|
Maximum | 2936 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6 |
Q1 | 19 |
median | 48 |
Q3 | 158.25 |
95-th percentile | 537 |
Maximum | 2936 |
Range | 2935 |
Interquartile range (IQR) | 139.25 |
Descriptive statistics
Standard deviation | 278.13098 |
---|---|
Coefficient of variation (CV) | 1.9103924 |
Kurtosis | 35.239548 |
Mean | 145.5884 |
Median Absolute Deviation (MAD) | 38 |
Skewness | 5.0693189 |
Sum | 1455884 |
Variance | 77356.842 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 201 | 2.0% |
7 | 183 | 1.8% |
8 | 180 | 1.8% |
10 | 173 | 1.7% |
11 | 162 | 1.6% |
16 | 160 | 1.6% |
17 | 149 | 1.5% |
15 | 147 | 1.5% |
9 | 147 | 1.5% |
14 | 144 | 1.4% |
Other values (782) | 8354 |
Value | Count | Frequency (%) |
1 | 56 | 0.6% |
2 | 79 | 0.8% |
3 | 104 | |
4 | 90 | |
5 | 140 | |
6 | 201 | |
7 | 183 | |
8 | 180 | |
9 | 147 | |
10 | 173 |
Value | Count | Frequency (%) |
2936 | 1 | < 0.1% |
2921 | 5 | |
2917 | 8 | |
2912 | 6 | |
2806 | 1 | < 0.1% |
2803 | 3 | < 0.1% |
2737 | 1 | < 0.1% |
2728 | 1 | < 0.1% |
2615 | 1 | < 0.1% |
2340 | 2 | < 0.1% |
건물 부번
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 45 |
---|---|
Distinct (%) | 0.5% |
Missing | 203 |
Missing (%) | 2.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.60763499 |
Minimum | 0 |
---|---|
Maximum | 91 |
Zeros | 8912 |
Zeros (%) | 89.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 91 |
Range | 91 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 3.279229 |
---|---|
Coefficient of variation (CV) | 5.3967087 |
Kurtosis | 135.36582 |
Mean | 0.60763499 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 9.5217613 |
Sum | 5953 |
Variance | 10.753343 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8912 | |
1 | 342 | 3.4% |
2 | 83 | 0.8% |
3 | 45 | 0.4% |
5 | 37 | 0.4% |
6 | 36 | 0.4% |
4 | 35 | 0.4% |
7 | 34 | 0.3% |
8 | 32 | 0.3% |
10 | 26 | 0.3% |
Other values (35) | 215 | 2.1% |
(Missing) | 203 | 2.0% |
Value | Count | Frequency (%) |
0 | 8912 | |
1 | 342 | 3.4% |
2 | 83 | 0.8% |
3 | 45 | 0.4% |
4 | 35 | 0.4% |
5 | 37 | 0.4% |
6 | 36 | 0.4% |
7 | 34 | 0.3% |
8 | 32 | 0.3% |
9 | 21 | 0.2% |
Value | Count | Frequency (%) |
91 | 1 | < 0.1% |
55 | 3 | |
53 | 1 | < 0.1% |
48 | 1 | < 0.1% |
47 | 1 | < 0.1% |
43 | 2 | |
42 | 1 | < 0.1% |
40 | 1 | < 0.1% |
38 | 1 | < 0.1% |
37 | 2 |
시스템등록번호 | 시군구코드 | 법정동코드 | 자치구명 | 법정동명 | 지번구분 | 본번 | 부번 | 주소 | 중개업등록번호 | 중개업자명 | 사업자상호 | 전화번호 | 상태구분 | 행정처분 시작일 | 행정처분 종료일 | 조회 개수 | 도로명코드 | 건물 | 건물 본번 | 건물 부번 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
21756 | 116802022000682 | 11680 | 1168011000 | 강남구 | 압구정동 | 1 | 369 | 1 | 서울특별시 강남구 압구정로29길 71 1층 105호(압구정동, 점포4동) | 11680-2022-00518 | 정지수 | 압구정서울공인중개사사무소 | 02-546-6262 | 영업중 | <NA> | <NA> | 1611 | 116804166514 | 0 | 71 | 0 |
2665 | 111402008000052 | 11140 | 1114016500 | 중구 | 황학동 | 1 | 811 | 0 | 서울특별시 중구 난계로11길 8 1층 | 공92220000-1731 | 김은숙 | 필공인중개사사무소 | 02-2238-3113 | 영업중 | <NA> | <NA> | 478 | 111404103003 | 0 | 8 | 0 |
11178 | 115901984000021 | 11590 | 1159010400 | 동작구 | 본동 | 1 | 48 | 0 | 서울특별시 동작구 노량진로 252 (본동) | 나-92460000-177 | 송우용 | 중앙부동산중개인사무소 | 02-813-9254 | 영업중 | <NA> | <NA> | 843 | 115903119011 | 0 | 252 | 0 |
17117 | 116502023000263 | 11650 | 1165010800 | 서초구 | 서초동 | 1 | 1339 | 4 | 서울특별시 서초구 효령로 429 , 111호(서초동, 강남 삼부르네상스시티) | 11650-2023-00259 | 지정원 | 강남삼부공인중개사사무소 | <NA> | 영업중 | <NA> | <NA> | 1031 | 116503121021 | 0 | 429 | 0 |
9981 | 115602021000019 | 11560 | 1156013200 | 영등포구 | 신길동 | 1 | 261 | 12 | 서울특별시 영등포구 가마산로 466 104호 | 11560-2021-00017 | 양해숙 | 강산공인중개사사무소 | <NA> | 영업중 | <NA> | <NA> | 369 | 115603000023 | 0 | 466 | 0 |
1711 | 116802019000407 | 11680 | 1168010300 | 강남구 | 개포동 | 1 | 158 | 0 | 서울특별시 강남구 선릉로 28 1층[101호](개포동, 일영빌딩) | 11680-2019-00344 | 심언우 | 황금공인중개사사무소 | 02-575-2070 | 영업중 | <NA> | <NA> | 827 | 116803122006 | 0 | 28 | 0 |
1220 | 117102004000211 | 11710 | 1171010700 | 송파구 | 가락동 | 1 | 98 | 0 | 서울특별시 송파구 송파대로30길 16 109호(가락동) | 9253-4490 | 송기출 | 재성공인중개사사무소 | 408-6700 | 영업중 | <NA> | <NA> | 1267 | 117104169334 | 0 | 16 | 0 |
9608 | 117402019000214 | 11740 | 1174010300 | 강동구 | 상일동 | 1 | 490 | 0 | 서울특별시 강동구 상일로 74 상가2동 102호 (상일동, 고덕리엔파크3단지) | 11740-2019-00214 | 권혁추 | 리엔한강(441-6789)공인중개사사무소 | 02-441-6789 | 영업중 | <NA> | <NA> | 271 | 117403124004 | 0 | 74 | 0 |
17804 | 117402019000241 | 11740 | 1174010900 | 강동구 | 천호동 | 1 | 328 | 1 | 서울특별시 강동구 천중로 6 제1층 제108호(천호동) | 11740-2019-00241 | 유한승 | 갤럭시공인중개사사무소 | 02-471-7107 | 영업중 | <NA> | <NA> | 280 | 117403124009 | 0 | 6 | 0 |
11198 | 115602019000244 | 11560 | 1156011000 | 영등포구 | 여의도동 | 1 | 41 | 2 | 서울특별시 영등포구 여의대방로 417 108호(여의도동) | 11560-2019-00243 | 정우영 | 고바우공인중개사사무소 | 02-782-2459 | 영업중 | <NA> | <NA> | 272 | 115603118028 | 0 | 417 | 0 |
시스템등록번호 | 시군구코드 | 법정동코드 | 자치구명 | 법정동명 | 지번구분 | 본번 | 부번 | 주소 | 중개업등록번호 | 중개업자명 | 사업자상호 | 전화번호 | 상태구분 | 행정처분 시작일 | 행정처분 종료일 | 조회 개수 | 도로명코드 | 건물 | 건물 본번 | 건물 부번 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
8638 | 116802019000156 | 11680 | 1168010400 | 강남구 | 청담동 | 1 | 134 | 20 | 서울특별시 강남구 학동로101길 26 1층 128호(청담동, 청담삼익상가) | 11680-2019-00138 | 김학돈 | 부자촌공인중개사사무소 | 02-517-4252 | 영업중 | <NA> | <NA> | 768 | 116804166770 | 0 | 26 | 0 |
20643 | 116802023000504 | 11680 | 1168010800 | 강남구 | 논현동 | 1 | 191 | 15 | 서울특별시 강남구 논현로 615 1층, 2층(논현동) | 11680-2023-00376 | 김주환 | 빌딩온부동산중개주식회사 | 02-2088-5477, 02-2088-1042, 6141, 8206, 8605, 8317, 1779, 0735, 8631, 8134, 8471, 3951, 1043, 1486, 1966, 0680, 5424, 2059, 8603, 5477 | 영업중 | <NA> | <NA> | 1864 | 116803121022 | 0 | 615 | 0 |
516 | 111702023000031 | 11170 | 1117012900 | 용산구 | 이촌동 | 1 | 300 | 10 | 서울특별시 용산구 이촌로 290 제10호 (이촌동, 점보상가) | 11170-2023-00028 | 홍경헌 | 라인부동산공인중개사사무소 | 02-790-4911 | 영업중 | <NA> | <NA> | 548 | 111703102008 | 0 | 290 | 0 |
13574 | 115302011000251 | 11530 | 1153011000 | 구로구 | 온수동 | 1 | 9 | 9 | 서울특별시 구로구 부일로 875 (온수동) | 92420000-3839 | 이안순 | 광개토공인중개사사무소 | 2060-1114 | 영업중 | <NA> | <NA> | 639 | 115303000019 | 0 | 875 | 0 |
9046 | 111102014000045 | 11110 | 1111012000 | 종로구 | 신문로1가 | 1 | 163 | 0 | 서울특별시 종로구 새문안로 92 광화문오피시아빌딩 제404-2호(신문로1가) | 92200000-2715 | 정현애 | LB공인중개사사무소 | 02-720-4020 | 영업중 | <NA> | <NA> | 496 | 111103005004 | 0 | 92 | 0 |
566 | 111402020000068 | 11140 | 1114016200 | 중구 | 신당동 | 1 | 300 | 20 | 서울특별시 중구 다산로33길 2 1층 (신당동) | 11140-2020-00057 | 김만호 | 온나라공인중개사사무소 | 02-2252-8945 | 영업중 | <NA> | <NA> | 155 | 111404103050 | 0 | 2 | 0 |
21492 | 116802002000156 | 11680 | 1168010100 | 강남구 | 역삼동 | 1 | 740 | 0 | 서울특별시 강남구 테헤란로20길 25 1층(역삼동) | 9250-4299 | 정재영 | 동명공인중개사사무소 | 556-5365 | 영업중 | <NA> | <NA> | 2640 | 116804166723 | 0 | 25 | 0 |
22571 | 113802024000016 | 11380 | 1138010600 | 은평구 | 대조동 | 1 | 222 | 2 | 서울특별시 은평구 연서로20길 4 ,1층 | 11380-2024-00016 | 유정우 | 동신공인중개사사무소 | <NA> | 영업중 | <NA> | <NA> | 720 | 113804133151 | 0 | 4 | 0 |
14199 | 115302007000248 | 11530 | 1153010800 | 구로구 | 오류동 | 1 | 31 | 280 | 서울특별시 구로구 경인로 233 지하1층 B104호(오류동, 구로예미지어반코어) | 92420000-2793 | 이다경 | 예미지공인중개사사무소 | 02-2614-9999 | 영업중 | <NA> | <NA> | 540 | 115303000028 | 0 | 233 | 0 |
20766 | 116802023000470 | 11680 | 1168010100 | 강남구 | 역삼동 | 1 | 755 | 0 | 서울특별시 강남구 역삼로 310 1층 95호(역삼동, 한솔필리아) | 11680-2023-00354 | 송광동 | KD부동산중개 | <NA> | 영업중 | <NA> | <NA> | 1849 | 116803122008 | 0 | 310 | 0 |