Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 8802 |
Missing cells | 8838 |
Missing cells (%) | 8.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 859.7 KiB |
Average record size in memory | 100.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 3 |
Categorical | 4 |
DateTime | 1 |
Dataset
Description | 업소일련번호,업소명,동명,주소,면적(㎡),전화번호,업종,품목코드,품목,가격(원),점검일자,구명 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-1169/S/1/datasetView.do |
동명 is highly overall correlated with 구명 | High correlation |
구명 is highly overall correlated with 동명 | High correlation |
품목 is highly overall correlated with 면적(㎡) and 2 other fields | High correlation |
업종 is highly overall correlated with 품목 | High correlation |
업소일련번호 is highly overall correlated with 면적(㎡) | High correlation |
면적(㎡) is highly overall correlated with 업소일련번호 and 1 other fields | High correlation |
품목코드 is highly overall correlated with 품목 | High correlation |
면적(㎡) has 7390 (84.0%) missing values | Missing |
전화번호 has 386 (4.4%) missing values | Missing |
가격(원) has 1062 (12.1%) missing values | Missing |
면적(㎡) has 481 (5.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 01:36:10.910549 |
---|---|
Analysis finished | 2024-05-11 01:36:31.017666 |
Duration | 20.11 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
업소일련번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 4086 |
---|---|
Distinct (%) | 46.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.4671362 × 1012 |
Minimum | 1.4188906 × 1012 |
---|---|
Maximum | 1.7127945 × 1012 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 77.5 KiB |
Quantile statistics
Minimum | 1.4188906 × 1012 |
---|---|
5-th percentile | 1.4188906 × 1012 |
Q1 | 1.4188907 × 1012 |
median | 1.4188907 × 1012 |
Q3 | 1.5020826 × 1012 |
95-th percentile | 1.6541446 × 1012 |
Maximum | 1.7127945 × 1012 |
Range | 2.9390388 × 1011 |
Interquartile range (IQR) | 8.3191939 × 1010 |
Descriptive statistics
Standard deviation | 8.502314 × 1010 |
---|---|
Coefficient of variation (CV) | 0.057951769 |
Kurtosis | 0.62087466 |
Mean | 1.4671362 × 1012 |
Median Absolute Deviation (MAD) | 64561.5 |
Skewness | 1.4774569 |
Sum | 1.2913733 × 1016 |
Variance | 7.2289343 × 1021 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1418890676147 | 17 | 0.2% |
1418890656533 | 16 | 0.2% |
1418890620381 | 16 | 0.2% |
1418890650962 | 16 | 0.2% |
1418890709186 | 15 | 0.2% |
1418890651601 | 14 | 0.2% |
1536900840375 | 14 | 0.2% |
1418890650090 | 14 | 0.2% |
1505380539885 | 12 | 0.1% |
1502082484104 | 12 | 0.1% |
Other values (4076) | 8656 |
Value | Count | Frequency (%) |
1418890614412 | 2 | |
1418890614413 | 2 | |
1418890614418 | 2 | |
1418890614422 | 1 | |
1418890614427 | 2 | |
1418890614428 | 2 | |
1418890614429 | 2 | |
1418890614431 | 2 | |
1418890614432 | 2 | |
1418890614496 | 1 |
Value | Count | Frequency (%) |
1712794493809 | 1 | < 0.1% |
1712283973383 | 7 | |
1712283274955 | 1 | < 0.1% |
1712282495324 | 1 | < 0.1% |
1712282427985 | 2 | < 0.1% |
1712276899269 | 1 | < 0.1% |
1712212831795 | 2 | < 0.1% |
1712210638109 | 3 | |
1712203128433 | 1 | < 0.1% |
1712194584196 | 1 | < 0.1% |
업소명
Text
Distinct | 3650 |
---|---|
Distinct (%) | 41.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
Value | Count | Frequency (%) |
김밥천국 | 88 | 0.9% |
김밥나라 | 40 | 0.4% |
김밥 | 37 | 0.4% |
크린토피아 | 36 | 0.4% |
헤어 | 34 | 0.4% |
hair | 24 | 0.2% |
미용실 | 24 | 0.2% |
피자스쿨 | 22 | 0.2% |
멸치국수 | 22 | 0.2% |
메가커피 | 20 | 0.2% |
Other values (3841) | 9333 |
Most occurring characters
Value | Count | Frequency (%) |
어 | 1254 | 2.6% |
헤 | 1207 | 2.5% |
이 | 902 | 1.9% |
미 | 882 | 1.8% |
881 | 1.8% | |
리 | 861 | 1.8% |
세 | 670 | 1.4% |
밥 | 618 | 1.3% |
탁 | 613 | 1.3% |
김 | 589 | 1.2% |
Other values (805) | 39407 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 43150 | |
Uppercase Letter | 1067 | 2.2% |
Lowercase Letter | 1048 | 2.2% |
Space Separator | 881 | 1.8% |
Decimal Number | 718 | 1.5% |
Other Punctuation | 708 | 1.5% |
Open Punctuation | 151 | 0.3% |
Close Punctuation | 151 | 0.3% |
Dash Punctuation | 8 | < 0.1% |
Other Symbol | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
어 | 1254 | 2.9% |
헤 | 1207 | 2.8% |
이 | 902 | 2.1% |
미 | 882 | 2.0% |
리 | 861 | 2.0% |
세 | 670 | 1.6% |
밥 | 618 | 1.4% |
탁 | 613 | 1.4% |
김 | 589 | 1.4% |
용 | 574 | 1.3% |
Other values (729) | 34980 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 110 | 10.3% |
B | 80 | 7.5% |
O | 64 | 6.0% |
E | 61 | 5.7% |
S | 58 | 5.4% |
H | 58 | 5.4% |
A | 58 | 5.4% |
M | 56 | 5.2% |
P | 54 | 5.1% |
K | 52 | 4.9% |
Other values (16) | 416 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 188 | |
p | 142 | |
m | 136 | |
e | 100 | |
o | 64 | 6.1% |
r | 46 | 4.4% |
i | 43 | 4.1% |
s | 40 | 3.8% |
y | 34 | 3.2% |
f | 34 | 3.2% |
Other values (14) | 221 |
Other Punctuation
Value | Count | Frequency (%) |
; | 267 | |
& | 215 | |
# | 154 | |
. | 40 | 5.6% |
, | 20 | 2.8% |
? | 3 | 0.4% |
! | 3 | 0.4% |
/ | 3 | 0.4% |
& | 2 | 0.3% |
% | 1 | 0.1% |
Decimal Number
Value | Count | Frequency (%) |
4 | 175 | |
1 | 150 | |
0 | 132 | |
2 | 98 | |
5 | 41 | 5.7% |
9 | 41 | 5.7% |
3 | 30 | 4.2% |
8 | 21 | 2.9% |
6 | 16 | 2.2% |
7 | 14 | 1.9% |
Space Separator
Value | Count | Frequency (%) |
881 |
Open Punctuation
Value | Count | Frequency (%) |
( | 151 |
Close Punctuation
Value | Count | Frequency (%) |
) | 151 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8 |
Other Symbol
Value | Count | Frequency (%) |
° | 1 |
Math Symbol
Value | Count | Frequency (%) |
= | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 43140 | |
Common | 2619 | 5.5% |
Latin | 2115 | 4.4% |
Han | 10 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
어 | 1254 | 2.9% |
헤 | 1207 | 2.8% |
이 | 902 | 2.1% |
미 | 882 | 2.0% |
리 | 861 | 2.0% |
세 | 670 | 1.6% |
밥 | 618 | 1.4% |
탁 | 613 | 1.4% |
김 | 589 | 1.4% |
용 | 574 | 1.3% |
Other values (724) | 34970 |
Latin
Value | Count | Frequency (%) |
a | 188 | 8.9% |
p | 142 | 6.7% |
m | 136 | 6.4% |
C | 110 | 5.2% |
e | 100 | 4.7% |
B | 80 | 3.8% |
o | 64 | 3.0% |
O | 64 | 3.0% |
E | 61 | 2.9% |
S | 58 | 2.7% |
Other values (40) | 1112 |
Common
Value | Count | Frequency (%) |
881 | ||
; | 267 | 10.2% |
& | 215 | 8.2% |
4 | 175 | 6.7% |
# | 154 | 5.9% |
( | 151 | 5.8% |
) | 151 | 5.8% |
1 | 150 | 5.7% |
0 | 132 | 5.0% |
2 | 98 | 3.7% |
Other values (16) | 245 | 9.4% |
Han
Value | Count | Frequency (%) |
日 | 2 | |
月 | 2 | |
李 | 2 | |
家 | 2 | |
美 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 43140 | |
ASCII | 4731 | 9.9% |
CJK | 8 | < 0.1% |
None | 3 | < 0.1% |
CJK Compat Ideographs | 2 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
어 | 1254 | 2.9% |
헤 | 1207 | 2.8% |
이 | 902 | 2.1% |
미 | 882 | 2.0% |
리 | 861 | 2.0% |
세 | 670 | 1.6% |
밥 | 618 | 1.4% |
탁 | 613 | 1.4% |
김 | 589 | 1.4% |
용 | 574 | 1.3% |
Other values (724) | 34970 |
ASCII
Value | Count | Frequency (%) |
881 | 18.6% | |
; | 267 | 5.6% |
& | 215 | 4.5% |
a | 188 | 4.0% |
4 | 175 | 3.7% |
# | 154 | 3.3% |
( | 151 | 3.2% |
) | 151 | 3.2% |
1 | 150 | 3.2% |
p | 142 | 3.0% |
Other values (64) | 2257 |
None
Value | Count | Frequency (%) |
& | 2 | |
° | 1 |
CJK
Value | Count | Frequency (%) |
日 | 2 | |
月 | 2 | |
家 | 2 | |
美 | 2 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 2 |
동명
Categorical
HIGH CORRELATION
 
Distinct | 46 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
미아동 | |
---|---|
수유동 | |
봉천동 | |
번동 | |
신림동 | |
Other values (41) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 2.8581004 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 중화동 |
---|---|
2nd row | 중화동 |
3rd row | 풍납동 |
4th row | 풍납동 |
5th row | 풍납동 |
Common Values
Value | Count | Frequency (%) |
미아동 | 1251 | |
수유동 | 1021 | 11.6% |
봉천동 | 819 | 9.3% |
번동 | 649 | 7.4% |
신림동 | 644 | 7.3% |
면목동 | 531 | 6.0% |
창동 | 394 | 4.5% |
묵동 | 325 | 3.7% |
쌍문동 | 275 | 3.1% |
방학동 | 246 | 2.8% |
Other values (36) | 2647 |
Length
Value | Count | Frequency (%) |
미아동 | 1251 | |
수유동 | 1021 | 11.6% |
봉천동 | 819 | 9.3% |
번동 | 649 | 7.4% |
신림동 | 644 | 7.3% |
면목동 | 531 | 6.0% |
창동 | 394 | 4.5% |
묵동 | 325 | 3.7% |
쌍문동 | 275 | 3.1% |
방학동 | 246 | 2.8% |
Other values (36) | 2647 |
주소
Text
Distinct | 3424 |
---|---|
Distinct (%) | 38.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
Length
Max length | 55 |
---|---|
Median length | 51 |
Mean length | 26.273347 |
Min length | 16 |
Characters and Unicode
Total characters | 231258 |
---|---|
Distinct characters | 439 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 1082 ? |
---|---|
Unique (%) | 12.3% |
Sample
1st row | 서울특별시 중랑구 동일로 752 (중화동, 중화한신아파트) |
---|---|
2nd row | 서울특별시 중랑구 동일로 752 (중화동, 중화한신아파트) |
3rd row | 서울특별시 송파구 올림픽로47길 15 (풍납동) |
4th row | 서울특별시 송파구 풍성로16길 8-1 (풍납동) |
5th row | 서울특별시 송파구 풍성로14길 7 (풍납동) |
Value | Count | Frequency (%) |
서울특별시 | 8799 | 18.9% |
강북구 | 3059 | 6.6% |
관악구 | 1463 | 3.1% |
마포구 | 1403 | 3.0% |
미아동 | 1243 | 2.7% |
도봉구 | 1091 | 2.3% |
수유동 | 1010 | 2.2% |
중랑구 | 886 | 1.9% |
봉천동 | 731 | 1.6% |
번동 | 645 | 1.4% |
Other values (2425) | 26166 |
Most occurring characters
Value | Count | Frequency (%) |
37725 | 16.3% | |
동 | 9342 | 4.0% |
서 | 8987 | 3.9% |
시 | 8936 | 3.9% |
구 | 8856 | 3.8% |
울 | 8832 | 3.8% |
( | 8832 | 3.8% |
) | 8825 | 3.8% |
특 | 8799 | 3.8% |
별 | 8799 | 3.8% |
Other values (429) | 113325 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 140183 | |
Space Separator | 37725 | 16.3% |
Decimal Number | 31228 | 13.5% |
Open Punctuation | 8836 | 3.8% |
Close Punctuation | 8829 | 3.8% |
Other Punctuation | 3236 | 1.4% |
Dash Punctuation | 779 | 0.3% |
Lowercase Letter | 223 | 0.1% |
Uppercase Letter | 215 | 0.1% |
Letter Number | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9342 | 6.7% |
서 | 8987 | 6.4% |
시 | 8936 | 6.4% |
구 | 8856 | 6.3% |
울 | 8832 | 6.3% |
특 | 8799 | 6.3% |
별 | 8799 | 6.3% |
로 | 8322 | 5.9% |
길 | 4929 | 3.5% |
봉 | 3638 | 2.6% |
Other values (368) | 60743 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 35 | |
C | 23 | |
K | 21 | |
I | 21 | |
A | 20 | |
T | 17 | |
G | 16 | |
S | 13 | 6.0% |
D | 12 | 5.6% |
M | 12 | 5.6% |
Other values (9) | 25 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 35 | |
t | 33 | |
a | 32 | |
m | 30 | |
p | 30 | |
l | 15 | |
n | 13 | 5.8% |
r | 13 | 5.8% |
g | 9 | 4.0% |
d | 5 | 2.2% |
Other values (4) | 8 | 3.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 6536 | |
2 | 4487 | |
3 | 3711 | |
4 | 3250 | |
5 | 2801 | |
7 | 2238 | 7.2% |
6 | 2191 | 7.0% |
0 | 2107 | 6.7% |
9 | 2018 | 6.5% |
8 | 1889 | 6.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2197 | |
; | 346 | 10.7% |
& | 316 | 9.8% |
# | 298 | 9.2% |
. | 45 | 1.4% |
: | 29 | 0.9% |
? | 2 | 0.1% |
@ | 2 | 0.1% |
/ | 1 | < 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 8832 | |
[ | 3 | < 0.1% |
{ | 1 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 8825 | |
] | 3 | < 0.1% |
} | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
37725 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 779 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 140183 | |
Common | 90633 | |
Latin | 442 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 9342 | 6.7% |
서 | 8987 | 6.4% |
시 | 8936 | 6.4% |
구 | 8856 | 6.3% |
울 | 8832 | 6.3% |
특 | 8799 | 6.3% |
별 | 8799 | 6.3% |
로 | 8322 | 5.9% |
길 | 4929 | 3.5% |
봉 | 3638 | 2.6% |
Other values (368) | 60743 |
Latin
Value | Count | Frequency (%) |
B | 35 | 7.9% |
e | 35 | 7.9% |
t | 33 | 7.5% |
a | 32 | 7.2% |
m | 30 | 6.8% |
p | 30 | 6.8% |
C | 23 | 5.2% |
K | 21 | 4.8% |
I | 21 | 4.8% |
A | 20 | 4.5% |
Other values (24) | 162 |
Common
Value | Count | Frequency (%) |
37725 | ||
( | 8832 | 9.7% |
) | 8825 | 9.7% |
1 | 6536 | 7.2% |
2 | 4487 | 5.0% |
3 | 3711 | 4.1% |
4 | 3250 | 3.6% |
5 | 2801 | 3.1% |
7 | 2238 | 2.5% |
, | 2197 | 2.4% |
Other values (17) | 10031 | 11.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 140183 | |
ASCII | 91069 | |
Number Forms | 4 | < 0.1% |
None | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
37725 | ||
( | 8832 | 9.7% |
) | 8825 | 9.7% |
1 | 6536 | 7.2% |
2 | 4487 | 4.9% |
3 | 3711 | 4.1% |
4 | 3250 | 3.6% |
5 | 2801 | 3.1% |
7 | 2238 | 2.5% |
, | 2197 | 2.4% |
Other values (49) | 10467 | 11.5% |
Hangul
Value | Count | Frequency (%) |
동 | 9342 | 6.7% |
서 | 8987 | 6.4% |
시 | 8936 | 6.4% |
구 | 8856 | 6.3% |
울 | 8832 | 6.3% |
특 | 8799 | 6.3% |
별 | 8799 | 6.3% |
로 | 8322 | 5.9% |
길 | 4929 | 3.5% |
봉 | 3638 | 2.6% |
Other values (368) | 60743 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 4 |
None
Value | Count | Frequency (%) |
? | 2 |
면적(㎡)
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 158 |
---|---|
Distinct (%) | 11.2% |
Missing | 7390 |
Missing (%) | 84.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.246565 |
Minimum | 0 |
---|---|
Maximum | 3788 |
Zeros | 481 |
Zeros (%) | 5.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 77.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 20 |
Q3 | 33 |
95-th percentile | 114 |
Maximum | 3788 |
Range | 3788 |
Interquartile range (IQR) | 33 |
Descriptive statistics
Standard deviation | 163.244 |
---|---|
Coefficient of variation (CV) | 4.1594468 |
Kurtosis | 400.58916 |
Mean | 39.246565 |
Median Absolute Deviation (MAD) | 20 |
Skewness | 18.382483 |
Sum | 55416.15 |
Variance | 26648.604 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 481 | 5.5% |
30.0 | 119 | 1.4% |
20.0 | 60 | 0.7% |
33.0 | 46 | 0.5% |
15.0 | 45 | 0.5% |
26.0 | 32 | 0.4% |
18.0 | 28 | 0.3% |
24.0 | 26 | 0.3% |
21.0 | 25 | 0.3% |
40.0 | 23 | 0.3% |
Other values (148) | 527 | 6.0% |
(Missing) | 7390 |
Value | Count | Frequency (%) |
0.0 | 481 | |
3.0 | 2 | < 0.1% |
4.0 | 3 | < 0.1% |
5.0 | 2 | < 0.1% |
6.0 | 3 | < 0.1% |
7.0 | 2 | < 0.1% |
8.0 | 5 | 0.1% |
9.0 | 3 | < 0.1% |
10.0 | 23 | 0.3% |
11.0 | 9 | 0.1% |
Value | Count | Frequency (%) |
3788.0 | 2 | |
1296.0 | 1 | |
1268.0 | 1 | |
1057.0 | 1 | |
931.0 | 1 | |
900.0 | 2 | |
495.0 | 1 | |
452.0 | 1 | |
336.6 | 1 | |
330.0 | 1 |
전화번호
Text
MISSING
 
Distinct | 3729 |
---|---|
Distinct (%) | 44.3% |
Missing | 386 |
Missing (%) | 4.4% |
Memory size | 68.9 KiB |
Length
Max length | 14 |
---|---|
Median length | 11 |
Mean length | 11.182391 |
Min length | 2 |
Characters and Unicode
Total characters | 94111 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1390 ? |
---|---|
Unique (%) | 16.5% |
Sample
1st row | 02-482-9064 |
---|---|
2nd row | 02-488-8375 |
3rd row | 02-475-3158 |
4th row | 02-482-6568 |
5th row | 02-482-6568 |
Value | Count | Frequency (%) |
02-0000-0000 | 143 | 1.7% |
02-988-6039 | 17 | 0.2% |
02-988-4005 | 16 | 0.2% |
02-900-9760 | 16 | 0.2% |
02-990-1911 | 16 | 0.2% |
02-987-7066 | 16 | 0.2% |
02-985-6048 | 15 | 0.2% |
02-989-5905 | 14 | 0.2% |
02-981-1708 | 14 | 0.2% |
02-989-3392 | 14 | 0.2% |
Other values (3719) | 8135 |
Most occurring characters
Value | Count | Frequency (%) |
- | 16820 | |
0 | 16116 | |
2 | 13041 | |
9 | 9465 | |
8 | 7515 | |
3 | 6292 | 6.7% |
7 | 5835 | 6.2% |
5 | 5146 | 5.5% |
4 | 5062 | 5.4% |
1 | 4733 | 5.0% |
Other values (3) | 4086 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 77287 | |
Dash Punctuation | 16820 | 17.9% |
Other Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 16116 | |
2 | 13041 | |
9 | 9465 | |
8 | 7515 | |
3 | 6292 | 8.1% |
7 | 5835 | 7.5% |
5 | 5146 | 6.7% |
4 | 5062 | 6.5% |
1 | 4733 | 6.1% |
6 | 4082 | 5.3% |
Other Letter
Value | Count | Frequency (%) |
없 | 2 | |
음 | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16820 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 94107 | |
Hangul | 4 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 16820 | |
0 | 16116 | |
2 | 13041 | |
9 | 9465 | |
8 | 7515 | |
3 | 6292 | 6.7% |
7 | 5835 | 6.2% |
5 | 5146 | 5.5% |
4 | 5062 | 5.4% |
1 | 4733 | 5.0% |
Hangul
Value | Count | Frequency (%) |
없 | 2 | |
음 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 94107 | |
Hangul | 4 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 16820 | |
0 | 16116 | |
2 | 13041 | |
9 | 9465 | |
8 | 7515 | |
3 | 6292 | 6.7% |
7 | 5835 | 6.2% |
5 | 5146 | 5.5% |
4 | 5062 | 5.4% |
1 | 4733 | 5.0% |
Hangul
Value | Count | Frequency (%) |
없 | 2 | |
음 | 2 |
업종
Categorical
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
한식 | |
---|---|
미용업 | |
기타서비스 | |
다방업 | |
세탁업 | |
Other values (9) |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 2.9401272 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 미용업 |
---|---|
2nd row | 미용업 |
3rd row | 이용업 |
4th row | 세탁업 |
5th row | 이용업 |
Common Values
Value | Count | Frequency (%) |
한식 | 2860 | |
미용업 | 2136 | |
기타서비스 | 977 | 11.1% |
다방업 | 619 | 7.0% |
세탁업 | 524 | 6.0% |
기타음식업 | 489 | 5.6% |
중식 | 483 | 5.5% |
이용업 | 186 | 2.1% |
경양식 | 176 | 2.0% |
숙박업 | 165 | 1.9% |
Other values (4) | 187 | 2.1% |
Length
Value | Count | Frequency (%) |
한식 | 2860 | |
미용업 | 2136 | |
기타서비스 | 977 | 11.1% |
다방업 | 619 | 7.0% |
세탁업 | 524 | 6.0% |
기타음식업 | 489 | 5.6% |
중식 | 483 | 5.5% |
이용업 | 186 | 2.1% |
경양식 | 176 | 2.0% |
숙박업 | 165 | 1.9% |
Other values (4) | 187 | 2.1% |
품목코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 48 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.4188938 × 1012 |
Minimum | 1.4186228 × 1012 |
---|---|
Maximum | 1.5325825 × 1012 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 77.5 KiB |
Quantile statistics
Minimum | 1.4186228 × 1012 |
---|---|
5-th percentile | 1.4186228 × 1012 |
Q1 | 1.4186228 × 1012 |
median | 1.4186228 × 1012 |
Q3 | 1.4186228 × 1012 |
95-th percentile | 1.4186228 × 1012 |
Maximum | 1.5325825 × 1012 |
Range | 1.139597 × 1011 |
Interquartile range (IQR) | 19 |
Descriptive statistics
Standard deviation | 4.1450172 × 109 |
---|---|
Coefficient of variation (CV) | 0.0029213018 |
Kurtosis | 287.61612 |
Mean | 1.4188938 × 1012 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 16.244856 |
Sum | 1.2489104 × 1016 |
Variance | 1.7181168 × 1019 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1418622761403 | 1072 | 12.2% |
1418622761402 | 1064 | 12.1% |
1418622761392 | 487 | 5.5% |
1418622761397 | 427 | 4.9% |
1418622761384 | 403 | 4.6% |
1418622761381 | 394 | 4.5% |
1418622761373 | 389 | 4.4% |
1418622761411 | 381 | 4.3% |
1418622761385 | 356 | 4.0% |
1418622761372 | 351 | 4.0% |
Other values (38) | 3478 |
Value | Count | Frequency (%) |
1418622761370 | 121 | 1.4% |
1418622761371 | 79 | 0.9% |
1418622761372 | 351 | |
1418622761373 | 389 | |
1418622761374 | 181 | |
1418622761375 | 163 | |
1418622761376 | 162 | |
1418622761378 | 29 | 0.3% |
1418622761379 | 57 | 0.6% |
1418622761381 | 394 |
Value | Count | Frequency (%) |
1532582458411 | 2 | < 0.1% |
1476951899274 | 37 | 0.4% |
1418622761427 | 1 | < 0.1% |
1418622761426 | 39 | 0.4% |
1418622761425 | 12 | 0.1% |
1418622761423 | 2 | < 0.1% |
1418622761422 | 105 | |
1418622761421 | 28 | 0.3% |
1418622761420 | 176 | |
1418622761419 | 158 |
품목
Categorical
HIGH CORRELATION
 
Distinct | 47 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
미용료 (커트) | |
---|---|
미용료 (파마) | |
양복 세탁료 | 487 |
의복수선료 | 431 |
커피(외식) | 427 |
Other values (42) |
Length
Max length | 10 |
---|---|
Median length | 8 |
Mean length | 5.7359691 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 미용료 (파마) |
---|---|
2nd row | 미용료 (커트) |
3rd row | 이용료(커트) |
4th row | 양복 세탁료 |
5th row | 이용료(커트) |
Common Values
Value | Count | Frequency (%) |
미용료 (커트) | 1072 | 12.2% |
미용료 (파마) | 1064 | 12.1% |
양복 세탁료 | 487 | 5.5% |
의복수선료 | 431 | 4.9% |
커피(외식) | 427 | 4.9% |
김치찌개 백반 | 403 | 4.6% |
삼겹살 | 389 | 4.4% |
냉면(물) | 381 | 4.3% |
치킨 | 356 | 4.0% |
된장찌개 백반 | 351 | 4.0% |
Other values (37) | 3441 |
Length
Value | Count | Frequency (%) |
미용료 | 2136 | 16.3% |
커트 | 1072 | 8.2% |
파마 | 1064 | 8.1% |
백반 | 754 | 5.8% |
양복 | 487 | 3.7% |
세탁료 | 487 | 3.7% |
의복수선료 | 431 | 3.3% |
커피(외식 | 427 | 3.3% |
이용료 | 412 | 3.1% |
김치찌개 | 403 | 3.1% |
Other values (47) | 5432 |
가격(원)
Real number (ℝ)
MISSING
 
Distinct | 220 |
---|---|
Distinct (%) | 2.8% |
Missing | 1062 |
Missing (%) | 12.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16132.522 |
Minimum | 0 |
---|---|
Maximum | 250000 |
Zeros | 5 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 77.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3000 |
Q1 | 7000 |
median | 10000 |
Q3 | 18000 |
95-th percentile | 45000 |
Maximum | 250000 |
Range | 250000 |
Interquartile range (IQR) | 11000 |
Descriptive statistics
Standard deviation | 19379.433 |
---|---|
Coefficient of variation (CV) | 1.2012649 |
Kurtosis | 31.329789 |
Mean | 16132.522 |
Median Absolute Deviation (MAD) | 5000 |
Skewness | 4.6004239 |
Sum | 1.2486572 × 108 |
Variance | 3.7556241 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8000 | 789 | 9.0% |
7000 | 594 | 6.7% |
15000 | 426 | 4.8% |
10000 | 401 | 4.6% |
9000 | 385 | 4.4% |
4000 | 365 | 4.1% |
30000 | 335 | 3.8% |
6000 | 281 | 3.2% |
12000 | 277 | 3.1% |
20000 | 257 | 2.9% |
Other values (210) | 3630 | |
(Missing) | 1062 | 12.1% |
Value | Count | Frequency (%) |
0 | 5 | 0.1% |
300 | 4 | < 0.1% |
350 | 1 | < 0.1% |
400 | 3 | < 0.1% |
450 | 1 | < 0.1% |
500 | 24 | |
573 | 1 | < 0.1% |
600 | 5 | 0.1% |
610 | 1 | < 0.1% |
615 | 1 | < 0.1% |
Value | Count | Frequency (%) |
250000 | 1 | < 0.1% |
230000 | 1 | < 0.1% |
225634 | 1 | < 0.1% |
222743 | 1 | < 0.1% |
200000 | 5 | |
199757 | 1 | < 0.1% |
192351 | 1 | < 0.1% |
190000 | 2 | < 0.1% |
184266 | 1 | < 0.1% |
180412 | 1 | < 0.1% |
점검일자
Date
Distinct | 2703 |
---|---|
Distinct (%) | 30.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
Minimum | 2024-04-11 00:00:00 |
---|---|
Maximum | 2024-05-10 14:00:24 |
구명
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 68.9 KiB |
강북구 | |
---|---|
관악구 | |
마포구 | |
도봉구 | |
중랑구 | |
Other values (2) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 중랑구 |
---|---|
2nd row | 중랑구 |
3rd row | 송파구 |
4th row | 송파구 |
5th row | 송파구 |
Common Values
Value | Count | Frequency (%) |
강북구 | 3059 | |
관악구 | 1463 | |
마포구 | 1403 | |
도봉구 | 1091 | 12.4% |
중랑구 | 886 | 10.1% |
노원구 | 483 | 5.5% |
송파구 | 417 | 4.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
강북구 | 3059 | |
관악구 | 1463 | |
마포구 | 1403 | |
도봉구 | 1091 | 12.4% |
중랑구 | 886 | 10.1% |
노원구 | 483 | 5.5% |
송파구 | 417 | 4.7% |
업소일련번호 | 동명 | 면적(㎡) | 업종 | 품목코드 | 품목 | 가격(원) | 구명 | |
---|---|---|---|---|---|---|---|---|
업소일련번호 | 1.000 | 0.665 | 0.000 | 0.258 | 0.151 | 0.322 | 0.131 | 0.537 |
동명 | 0.665 | 1.000 | 0.000 | 0.358 | 0.120 | 0.372 | 0.519 | 1.000 |
면적(㎡) | 0.000 | 0.000 | 1.000 | 0.417 | 0.000 | 0.890 | 0.324 | 0.000 |
업종 | 0.258 | 0.358 | 0.417 | 1.000 | 0.391 | 1.000 | 0.436 | 0.333 |
품목코드 | 0.151 | 0.120 | 0.000 | 0.391 | 1.000 | 0.904 | 0.000 | 0.068 |
품목 | 0.322 | 0.372 | 0.890 | 1.000 | 0.904 | 1.000 | 0.765 | 0.383 |
가격(원) | 0.131 | 0.519 | 0.324 | 0.436 | 0.000 | 0.765 | 1.000 | 0.140 |
구명 | 0.537 | 1.000 | 0.000 | 0.333 | 0.068 | 0.383 | 0.140 | 1.000 |
동명 | 구명 | 품목 | 업종 | |
---|---|---|---|---|
동명 | 1.000 | 0.998 | 0.073 | 0.112 |
구명 | 0.998 | 1.000 | 0.164 | 0.129 |
품목 | 0.073 | 0.164 | 1.000 | 0.994 |
업종 | 0.112 | 0.129 | 0.994 | 1.000 |
업소일련번호 | 면적(㎡) | 품목코드 | 가격(원) | 동명 | 업종 | 품목 | 구명 | |
---|---|---|---|---|---|---|---|---|
업소일련번호 | 1.000 | -0.666 | 0.043 | -0.015 | 0.295 | 0.107 | 0.116 | 0.308 |
면적(㎡) | -0.666 | 1.000 | -0.061 | 0.077 | 0.000 | 0.241 | 0.609 | 0.000 |
품목코드 | 0.043 | -0.061 | 1.000 | 0.089 | 0.058 | 0.237 | 0.732 | 0.045 |
가격(원) | -0.015 | 0.077 | 0.089 | 1.000 | 0.205 | 0.192 | 0.383 | 0.071 |
동명 | 0.295 | 0.000 | 0.058 | 0.205 | 1.000 | 0.112 | 0.073 | 0.998 |
업종 | 0.107 | 0.241 | 0.237 | 0.192 | 0.112 | 1.000 | 0.994 | 0.129 |
품목 | 0.116 | 0.609 | 0.732 | 0.383 | 0.073 | 0.994 | 1.000 | 0.164 |
구명 | 0.308 | 0.000 | 0.045 | 0.071 | 0.998 | 0.129 | 0.164 | 1.000 |
업소일련번호 | 업소명 | 동명 | 주소 | 면적(㎡) | 전화번호 | 업종 | 품목코드 | 품목 | 가격(원) | 점검일자 | 구명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1617926393629 | 이수오미용실 | 중화동 | 서울특별시 중랑구 동일로 752 (중화동, 중화한신아파트) | <NA> | <NA> | 미용업 | 1418622761402 | 미용료 (파마) | 45000 | 2024-05-10 14:00:24.0 | 중랑구 |
1 | 1617926393629 | 이수오미용실 | 중화동 | 서울특별시 중랑구 동일로 752 (중화동, 중화한신아파트) | <NA> | <NA> | 미용업 | 1418622761403 | 미용료 (커트) | 14000 | 2024-05-10 14:00:24.0 | 중랑구 |
2 | 1418890619560 | 남성이발관 | 풍납동 | 서울특별시 송파구 올림픽로47길 15 (풍납동) | <NA> | 02-482-9064 | 이용업 | 1418622761401 | 이용료(커트) | 10000 | 2024-05-10 09:35:05.0 | 송파구 |
3 | 1418890732538 | 현대세탁 | 풍납동 | 서울특별시 송파구 풍성로16길 8-1 (풍납동) | <NA> | 02-488-8375 | 세탁업 | 1418622761392 | 양복 세탁료 | 10000 | 2024-05-10 09:25:53.0 | 송파구 |
4 | 1418890759507 | 명진이발관 | 풍납동 | 서울특별시 송파구 풍성로14길 7 (풍납동) | <NA> | 02-475-3158 | 이용업 | 1418622761401 | 이용료(커트) | 10000 | 2024-05-10 09:25:00.0 | 송파구 |
5 | 1418890674401 | 크로바건강랜드 | 풍납동 | 서울특별시 송파구 풍성로 52 (풍납동, 대아아파트) | <NA> | 02-482-6568 | 목욕업 | 1418622761404 | 목욕료 (성인) | 10000 | 2024-05-10 09:23:09.0 | 송파구 |
6 | 1418890674401 | 크로바건강랜드 | 풍납동 | 서울특별시 송파구 풍성로 52 (풍납동, 대아아파트) | <NA> | 02-482-6568 | 목욕업 | 1418622761425 | 찜질방이용료 | 12000 | 2024-05-10 09:23:09.0 | 송파구 |
7 | 1418890655689 | 영헤어라인 | 풍납동 | 서울특별시 송파구 풍성로 38-1 (풍납동) | <NA> | 02-483-8319 | 미용업 | 1418622761403 | 미용료 (커트) | 15000 | 2024-05-10 09:22:39.0 | 송파구 |
8 | 1418890655689 | 영헤어라인 | 풍납동 | 서울특별시 송파구 풍성로 38-1 (풍납동) | <NA> | 02-483-8319 | 미용업 | 1418622761402 | 미용료 (파마) | 40000 | 2024-05-10 09:22:39.0 | 송파구 |
9 | 1418890728876 | 진성원 | 풍납동 | 서울특별시 송파구 풍성로 34 (풍납동) | <NA> | 02-484-6463 | 중식 | 1418622761419 | 짬뽕 | 8000 | 2024-05-10 09:21:53.0 | 송파구 |
업소일련번호 | 업소명 | 동명 | 주소 | 면적(㎡) | 전화번호 | 업종 | 품목코드 | 품목 | 가격(원) | 점검일자 | 구명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
8792 | 1418890635289 | 일광빨래방세탁 | 방이동 | 서울특별시 송파구 오금로11길 30-10 (방이동) | <NA> | 02-415-0169 | 세탁업 | 1418622761392 | 양복 세탁료 | <NA> | 2024-04-11 09:54:22.0 | 송파구 |
8793 | 1418890674862 | 유엔아이 | 방이동 | 서울특별시 송파구 오금로11길 29-19 (방이동) | <NA> | 02-420-1981 | 숙박업 | 1418622761406 | 숙박료 (여관) | <NA> | 2024-04-11 09:53:55.0 | 송파구 |
8794 | 1418890730230 | 호텔트라움 | 방이동 | 서울특별시 송파구 오금로11길 21-19 (방이동) | <NA> | 02-423-1170 | 숙박업 | 1418622761406 | 숙박료 (여관) | <NA> | 2024-04-11 09:49:17.0 | 송파구 |
8795 | 1418890728299 | 버디호프 | 방이동 | 서울특별시 송파구 오금로11길 16 (방이동) | <NA> | 02-416-3849 | 기타음식업 | 1418622761385 | 치킨 | <NA> | 2024-04-11 09:48:35.0 | 송파구 |
8796 | 1418890749808 | 지상헤어갤러리 | 성산동 | 서울특별시 마포구 성미산로 92 (성산동, 의집빌딩) | <NA> | 02-325-7338 | 미용업 | 1418622761403 | 미용료 (커트) | 17000 | 2024-04-11 00:00:00.0 | 마포구 |
8797 | 1418890661036 | 백청사 | 성산동 | 서울특별시 마포구 성미산로10길 15 (성산동) | 15.0 | 02-336-0763 | 세탁업 | 1418622761392 | 양복 세탁료 | 10000 | 2024-04-11 00:00:00.0 | 마포구 |
8798 | 1418890615804 | 동일이발관(착한가격업소) | 성산동 | 서울특별시 마포구 성미산로1길 30 (성산동) | <NA> | 02-333-0771 | 이용업 | 1418622761401 | 이용료(커트) | 12000 | 2024-04-11 00:00:00.0 | 마포구 |
8799 | 1418890749808 | 지상헤어갤러리 | 성산동 | 서울특별시 마포구 성미산로 92 (성산동, 의집빌딩) | <NA> | 02-325-7338 | 미용업 | 1418622761402 | 미용료 (파마) | 40000 | 2024-04-11 00:00:00.0 | 마포구 |
8800 | 1418890661036 | 백청사 | 성산동 | 서울특별시 마포구 성미산로10길 15 (성산동) | 15.0 | 02-336-0763 | 기타서비스 | 1418622761381 | 의복수선료 | 5000 | 2024-04-11 00:00:00.0 | 마포구 |
8801 | 1418890657803 | 우리노래방 | 성산동 | 서울특별시 마포구 성미산로 85 (성산동) | 99.0 | 02-322-8379 | 기타서비스 | 1418622761391 | 노래방 이용료 | 25000 | 2024-04-11 00:00:00.0 | 마포구 |