Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 4056 |
Missing cells | 251 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 360.6 KiB |
Average record size in memory | 91.0 B |
Variable types
Numeric | 3 |
---|---|
Text | 5 |
Categorical | 3 |
Dataset
Description | 2021-01-05 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901053101142000 |
reference_date is highly overall correlated with skey and 2 other fields | High correlation |
gugun is highly overall correlated with skey and 4 other fields | High correlation |
skey is highly overall correlated with gugun and 1 other fields | High correlation |
lat is highly overall correlated with gugun | High correlation |
instt_code is highly overall correlated with gugun and 1 other fields | High correlation |
last_load_dttm is highly overall correlated with gugun | High correlation |
tel has 74 (1.8%) missing values | Missing |
skey has unique values | Unique |
Reproduction
Analysis started | 2024-04-16 04:51:12.162830 |
---|---|
Analysis finished | 2024-04-16 04:51:14.376945 |
Duration | 2.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
skey
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 4056 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50404.425 |
Minimum | 46404 |
---|---|
Maximum | 53739 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 35.8 KiB |
Quantile statistics
Minimum | 46404 |
---|---|
5-th percentile | 46606.75 |
Q1 | 48681.75 |
median | 49695.5 |
Q3 | 52725.25 |
95-th percentile | 53536.25 |
Maximum | 53739 |
Range | 7335 |
Interquartile range (IQR) | 4043.5 |
Descriptive statistics
Standard deviation | 2366.9695 |
---|---|
Coefficient of variation (CV) | 0.046959558 |
Kurtosis | -1.4163378 |
Mean | 50404.425 |
Median Absolute Deviation (MAD) | 2466 |
Skewness | -0.13487745 |
Sum | 2.0444035 × 108 |
Variance | 5602544.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
52546 | 1 | < 0.1% |
48730 | 1 | < 0.1% |
48717 | 1 | < 0.1% |
48718 | 1 | < 0.1% |
48719 | 1 | < 0.1% |
48720 | 1 | < 0.1% |
48721 | 1 | < 0.1% |
48722 | 1 | < 0.1% |
48723 | 1 | < 0.1% |
48724 | 1 | < 0.1% |
Other values (4046) | 4046 |
Value | Count | Frequency (%) |
46404 | 1 | |
46405 | 1 | |
46406 | 1 | |
46407 | 1 | |
46408 | 1 | |
46409 | 1 | |
46410 | 1 | |
46411 | 1 | |
46412 | 1 | |
46413 | 1 |
Value | Count | Frequency (%) |
53739 | 1 | |
53738 | 1 | |
53737 | 1 | |
53736 | 1 | |
53735 | 1 | |
53734 | 1 | |
53733 | 1 | |
53732 | 1 | |
53731 | 1 | |
53730 | 1 |
business_nm
Text
Distinct | 3352 |
---|---|
Distinct (%) | 83.4% |
Missing | 35 |
Missing (%) | 0.9% |
Memory size | 31.8 KiB |
Length
Max length | 28 |
---|---|
Median length | 24 |
Mean length | 7.41905 |
Min length | 2 |
Characters and Unicode
Total characters | 29832 |
---|---|
Distinct characters | 519 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 2892 ? |
---|---|
Unique (%) | 71.9% |
Sample
1st row | 주식회사위드현대 |
---|---|
2nd row | 주식회사유니온디엔시 |
3rd row | 주식회사유승 |
4th row | 주식회사자인이씨엘 |
5th row | 주식회사제이원이엔씨 |
Value | Count | Frequency (%) |
티엘엔지니어링건축사사무소(주 | 12 | 0.3% |
현대설비 | 8 | 0.2% |
주)남경엔지니어링토건 | 7 | 0.2% |
금풍건설이엔씨(주 | 7 | 0.2% |
주)중앙기건 | 6 | 0.1% |
주)우상건축디자인 | 6 | 0.1% |
주)세광 | 6 | 0.1% |
에이티건설(주 | 6 | 0.1% |
강호건설(주 | 6 | 0.1% |
동림건업(주 | 6 | 0.1% |
Other values (3343) | 3954 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 3038 | 10.2% |
( | 2622 | 8.8% |
) | 2622 | 8.8% |
설 | 1350 | 4.5% |
건 | 1284 | 4.3% |
이 | 757 | 2.5% |
사 | 605 | 2.0% |
비 | 532 | 1.8% |
지 | 471 | 1.6% |
스 | 433 | 1.5% |
Other values (509) | 16118 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 24215 | |
Open Punctuation | 2622 | 8.8% |
Close Punctuation | 2622 | 8.8% |
Uppercase Letter | 214 | 0.7% |
Other Punctuation | 59 | 0.2% |
Other Symbol | 43 | 0.1% |
Lowercase Letter | 39 | 0.1% |
Decimal Number | 14 | < 0.1% |
Space Separator | 3 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 3038 | 12.5% |
설 | 1350 | 5.6% |
건 | 1284 | 5.3% |
이 | 757 | 3.1% |
사 | 605 | 2.5% |
비 | 532 | 2.2% |
지 | 471 | 1.9% |
스 | 433 | 1.8% |
엔 | 429 | 1.8% |
업 | 404 | 1.7% |
Other values (455) | 14912 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 39 | |
E | 33 | |
N | 29 | |
S | 21 | |
C | 14 | 6.5% |
K | 11 | 5.1% |
A | 11 | 5.1% |
T | 10 | 4.7% |
D | 7 | 3.3% |
R | 6 | 2.8% |
Other values (11) | 33 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 5 | |
o | 5 | |
s | 4 | |
g | 3 | |
e | 3 | |
i | 3 | |
y | 3 | |
r | 3 | |
c | 2 | 5.1% |
t | 2 | 5.1% |
Other values (4) | 6 |
Other Punctuation
Value | Count | Frequency (%) |
. | 29 | |
, | 9 | 15.3% |
, | 9 | 15.3% |
& | 5 | 8.5% |
/ | 4 | 6.8% |
· | 2 | 3.4% |
& | 1 | 1.7% |
Decimal Number
Value | Count | Frequency (%) |
1 | 4 | |
8 | 4 | |
2 | 2 | |
5 | 1 | 7.1% |
6 | 1 | 7.1% |
3 | 1 | 7.1% |
4 | 1 | 7.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 2622 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2622 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 43 |
Space Separator
Value | Count | Frequency (%) |
3 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 24258 | |
Common | 5321 | 17.8% |
Latin | 253 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 3038 | 12.5% |
설 | 1350 | 5.6% |
건 | 1284 | 5.3% |
이 | 757 | 3.1% |
사 | 605 | 2.5% |
비 | 532 | 2.2% |
지 | 471 | 1.9% |
스 | 433 | 1.8% |
엔 | 429 | 1.8% |
업 | 404 | 1.7% |
Other values (456) | 14955 |
Latin
Value | Count | Frequency (%) |
G | 39 | |
E | 33 | |
N | 29 | |
S | 21 | 8.3% |
C | 14 | 5.5% |
K | 11 | 4.3% |
A | 11 | 4.3% |
T | 10 | 4.0% |
D | 7 | 2.8% |
R | 6 | 2.4% |
Other values (25) | 72 |
Common
Value | Count | Frequency (%) |
( | 2622 | |
) | 2622 | |
. | 29 | 0.5% |
, | 9 | 0.2% |
, | 9 | 0.2% |
& | 5 | 0.1% |
1 | 4 | 0.1% |
/ | 4 | 0.1% |
8 | 4 | 0.1% |
3 | 0.1% | |
Other values (8) | 10 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 24215 | |
ASCII | 5562 | 18.6% |
None | 55 | 0.2% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 3038 | 12.5% |
설 | 1350 | 5.6% |
건 | 1284 | 5.3% |
이 | 757 | 3.1% |
사 | 605 | 2.5% |
비 | 532 | 2.2% |
지 | 471 | 1.9% |
스 | 433 | 1.8% |
엔 | 429 | 1.8% |
업 | 404 | 1.7% |
Other values (455) | 14912 |
ASCII
Value | Count | Frequency (%) |
( | 2622 | |
) | 2622 | |
G | 39 | 0.7% |
E | 33 | 0.6% |
. | 29 | 0.5% |
N | 29 | 0.5% |
S | 21 | 0.4% |
C | 14 | 0.3% |
K | 11 | 0.2% |
A | 11 | 0.2% |
Other values (40) | 131 | 2.4% |
None
Value | Count | Frequency (%) |
㈜ | 43 | |
, | 9 | 16.4% |
· | 2 | 3.6% |
& | 1 | 1.8% |
type_of_business
Text
Distinct | 463 |
---|---|
Distinct (%) | 11.5% |
Missing | 35 |
Missing (%) | 0.9% |
Memory size | 31.8 KiB |
Length
Max length | 100 |
---|---|
Median length | 95 |
Mean length | 11.604825 |
Min length | 4 |
Characters and Unicode
Total characters | 46663 |
---|---|
Distinct characters | 86 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 329 ? |
---|---|
Unique (%) | 8.2% |
Sample
1st row | 실내건축공사업 |
---|---|
2nd row | 토공사업 |
3rd row | 포장공사업 |
4th row | 토공사업 철근ㆍ콘크리트공사업 상ㆍ하수도설비공사업 |
5th row | 포장공사업 |
Value | Count | Frequency (%) |
제2종 | 706 | 11.4% |
가스시설시공업 | 624 | 10.1% |
난방시공업 | 512 | 8.3% |
기계설비공사업 | 448 | 7.2% |
실내건축공사업 | 418 | 6.7% |
시설물유지관리업 | 408 | 6.6% |
철근ㆍ콘크리트공사업 | 361 | 5.8% |
금속구조물ㆍ창호ㆍ온실공사업 | 347 | 5.6% |
상ㆍ하수도설비공사업 | 316 | 5.1% |
제3종 | 266 | 4.3% |
Other values (69) | 1796 |
Most occurring characters
Value | Count | Frequency (%) |
업 | 5104 | 10.9% |
공 | 4622 | 9.9% |
사 | 3483 | 7.5% |
2927 | 6.3% | |
시 | 2291 | 4.9% |
설 | 2118 | 4.5% |
ㆍ | 1889 | 4.0% |
물 | 1169 | 2.5% |
종 | 1138 | 2.4% |
제 | 1138 | 2.4% |
Other values (76) | 20784 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 42008 | |
Space Separator | 2927 | 6.3% |
Decimal Number | 1138 | 2.4% |
Other Punctuation | 356 | 0.8% |
Close Punctuation | 117 | 0.3% |
Open Punctuation | 117 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 5104 | 12.2% |
공 | 4622 | 11.0% |
사 | 3483 | 8.3% |
시 | 2291 | 5.5% |
설 | 2118 | 5.0% |
ㆍ | 1889 | 4.5% |
물 | 1169 | 2.8% |
종 | 1138 | 2.7% |
제 | 1138 | 2.7% |
비 | 981 | 2.3% |
Other values (68) | 18075 |
Decimal Number
Value | Count | Frequency (%) |
2 | 711 | |
3 | 277 | 24.3% |
1 | 150 | 13.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 354 | |
. | 2 | 0.6% |
Space Separator
Value | Count | Frequency (%) |
2927 |
Close Punctuation
Value | Count | Frequency (%) |
) | 117 |
Open Punctuation
Value | Count | Frequency (%) |
( | 117 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 42008 | |
Common | 4655 | 10.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 5104 | 12.2% |
공 | 4622 | 11.0% |
사 | 3483 | 8.3% |
시 | 2291 | 5.5% |
설 | 2118 | 5.0% |
ㆍ | 1889 | 4.5% |
물 | 1169 | 2.8% |
종 | 1138 | 2.7% |
제 | 1138 | 2.7% |
비 | 981 | 2.3% |
Other values (68) | 18075 |
Common
Value | Count | Frequency (%) |
2927 | ||
2 | 711 | 15.3% |
, | 354 | 7.6% |
3 | 277 | 6.0% |
1 | 150 | 3.2% |
) | 117 | 2.5% |
( | 117 | 2.5% |
. | 2 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 40119 | |
ASCII | 4655 | 10.0% |
Compat Jamo | 1889 | 4.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
업 | 5104 | 12.7% |
공 | 4622 | 11.5% |
사 | 3483 | 8.7% |
시 | 2291 | 5.7% |
설 | 2118 | 5.3% |
물 | 1169 | 2.9% |
종 | 1138 | 2.8% |
제 | 1138 | 2.8% |
비 | 981 | 2.4% |
조 | 931 | 2.3% |
Other values (67) | 17144 |
ASCII
Value | Count | Frequency (%) |
2927 | ||
2 | 711 | 15.3% |
, | 354 | 7.6% |
3 | 277 | 6.0% |
1 | 150 | 3.2% |
) | 117 | 2.5% |
( | 117 | 2.5% |
. | 2 | < 0.1% |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1889 |
addr
Text
Distinct | 3328 |
---|---|
Distinct (%) | 82.8% |
Missing | 35 |
Missing (%) | 0.9% |
Memory size | 31.8 KiB |
Length
Max length | 59 |
---|---|
Median length | 49 |
Mean length | 29.720965 |
Min length | 12 |
Characters and Unicode
Total characters | 119508 |
---|---|
Distinct characters | 433 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 2850 ? |
---|---|
Unique (%) | 70.9% |
Sample
1st row | 부산광역시 강서구 대저로 259 (대저1동) |
---|---|
2nd row | 부산광역시 강서구 화전산업대로 272-5 ,301호,한솔드림센타 (녹산동) |
3rd row | 부산광역시 강서구 신호산단1로 215 ,409호,새미래오피스빌딩 (신호동) |
4th row | 부산광역시 강서구 유통단지1로 41, 131동 215,216호(대저2동, 부산티플렉스) |
5th row | 부산광역시 강서구 식만로 69 (죽림동) |
Value | Count | Frequency (%) |
부산광역시 | 3993 | 17.5% |
해운대구 | 482 | 2.1% |
동래구 | 415 | 1.8% |
연제구 | 395 | 1.7% |
금정구 | 367 | 1.6% |
강서구 | 336 | 1.5% |
사상구 | 335 | 1.5% |
부산진구 | 299 | 1.3% |
수영구 | 291 | 1.3% |
기장군 | 276 | 1.2% |
Other values (4157) | 15653 |
Most occurring characters
Value | Count | Frequency (%) |
18826 | 15.8% | |
산 | 5080 | 4.3% |
동 | 4855 | 4.1% |
부 | 4576 | 3.8% |
1 | 4432 | 3.7% |
광 | 4273 | 3.6% |
시 | 4163 | 3.5% |
역 | 4005 | 3.4% |
구 | 3959 | 3.3% |
로 | 3882 | 3.2% |
Other values (423) | 61457 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 71069 | |
Decimal Number | 20010 | 16.7% |
Space Separator | 18826 | 15.8% |
Close Punctuation | 3508 | 2.9% |
Open Punctuation | 3508 | 2.9% |
Other Punctuation | 1684 | 1.4% |
Dash Punctuation | 749 | 0.6% |
Uppercase Letter | 145 | 0.1% |
Lowercase Letter | 7 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 5080 | 7.1% |
동 | 4855 | 6.8% |
부 | 4576 | 6.4% |
광 | 4273 | 6.0% |
시 | 4163 | 5.9% |
역 | 4005 | 5.6% |
구 | 3959 | 5.6% |
로 | 3882 | 5.5% |
대 | 1833 | 2.6% |
길 | 1803 | 2.5% |
Other values (390) | 32640 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 45 | |
C | 23 | |
B | 23 | |
E | 21 | |
P | 20 | |
T | 3 | 2.1% |
S | 2 | 1.4% |
K | 2 | 1.4% |
D | 2 | 1.4% |
O | 2 | 1.4% |
Other values (2) | 2 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 4432 | |
2 | 3075 | |
3 | 2263 | |
0 | 2126 | |
4 | 1637 | 8.2% |
5 | 1510 | 7.5% |
6 | 1393 | 7.0% |
7 | 1247 | 6.2% |
9 | 1164 | 5.8% |
8 | 1163 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1424 | |
, | 225 | 13.4% |
. | 30 | 1.8% |
/ | 4 | 0.2% |
# | 1 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
18826 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3508 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3508 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 749 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 7 |
Math Symbol
Value | Count | Frequency (%) |
~ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 71069 | |
Common | 48287 | |
Latin | 152 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 5080 | 7.1% |
동 | 4855 | 6.8% |
부 | 4576 | 6.4% |
광 | 4273 | 6.0% |
시 | 4163 | 5.9% |
역 | 4005 | 5.6% |
구 | 3959 | 5.6% |
로 | 3882 | 5.5% |
대 | 1833 | 2.6% |
길 | 1803 | 2.5% |
Other values (390) | 32640 |
Common
Value | Count | Frequency (%) |
18826 | ||
1 | 4432 | 9.2% |
) | 3508 | 7.3% |
( | 3508 | 7.3% |
2 | 3075 | 6.4% |
3 | 2263 | 4.7% |
0 | 2126 | 4.4% |
4 | 1637 | 3.4% |
5 | 1510 | 3.1% |
, | 1424 | 2.9% |
Other values (10) | 5978 | 12.4% |
Latin
Value | Count | Frequency (%) |
A | 45 | |
C | 23 | |
B | 23 | |
E | 21 | |
P | 20 | |
e | 7 | 4.6% |
T | 3 | 2.0% |
S | 2 | 1.3% |
K | 2 | 1.3% |
D | 2 | 1.3% |
Other values (3) | 4 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 71069 | |
ASCII | 48214 | |
None | 225 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
18826 | ||
1 | 4432 | 9.2% |
) | 3508 | 7.3% |
( | 3508 | 7.3% |
2 | 3075 | 6.4% |
3 | 2263 | 4.7% |
0 | 2126 | 4.4% |
4 | 1637 | 3.4% |
5 | 1510 | 3.1% |
, | 1424 | 3.0% |
Other values (22) | 5905 | 12.2% |
Hangul
Value | Count | Frequency (%) |
산 | 5080 | 7.1% |
동 | 4855 | 6.8% |
부 | 4576 | 6.4% |
광 | 4273 | 6.0% |
시 | 4163 | 5.9% |
역 | 4005 | 5.6% |
구 | 3959 | 5.6% |
로 | 3882 | 5.5% |
대 | 1833 | 2.6% |
길 | 1803 | 2.5% |
Other values (390) | 32640 |
None
Value | Count | Frequency (%) |
, | 225 |
tel
Text
MISSING
 
Distinct | 3266 |
---|---|
Distinct (%) | 82.0% |
Missing | 74 |
Missing (%) | 1.8% |
Memory size | 31.8 KiB |
Length
Max length | 14 |
---|---|
Median length | 12 |
Mean length | 12.020342 |
Min length | 11 |
Characters and Unicode
Total characters | 47865 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2787 ? |
---|---|
Unique (%) | 70.0% |
Sample
1st row | 051-529-4282 |
---|---|
2nd row | 051-941-6991 |
3rd row | 051-941-1865 |
4th row | 051-625-9353 |
5th row | 051-972-2141 |
Value | Count | Frequency (%) |
051-000-0000 | 21 | 0.5% |
051-623-3999 | 14 | 0.4% |
00-000-0000 | 12 | 0.3% |
051-740-6114 | 7 | 0.2% |
051-751-4492 | 7 | 0.2% |
051-631-1687 | 6 | 0.2% |
051-954-5800 | 6 | 0.2% |
051-501-8555 | 6 | 0.2% |
051-412-8766 | 6 | 0.2% |
051-807-8085 | 6 | 0.2% |
Other values (3256) | 3891 |
Most occurring characters
Value | Count | Frequency (%) |
- | 7966 | |
0 | 7488 | |
5 | 7252 | |
1 | 6840 | |
7 | 3056 | 6.4% |
2 | 2978 | 6.2% |
3 | 2831 | 5.9% |
8 | 2590 | 5.4% |
6 | 2515 | 5.3% |
4 | 2505 | 5.2% |
Other values (2) | 1844 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39898 | |
Dash Punctuation | 7966 | 16.6% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 7488 | |
5 | 7252 | |
1 | 6840 | |
7 | 3056 | |
2 | 2978 | 7.5% |
3 | 2831 | 7.1% |
8 | 2590 | 6.5% |
6 | 2515 | 6.3% |
4 | 2505 | 6.3% |
9 | 1843 | 4.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7966 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47865 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 7966 | |
0 | 7488 | |
5 | 7252 | |
1 | 6840 | |
7 | 3056 | 6.4% |
2 | 2978 | 6.2% |
3 | 2831 | 5.9% |
8 | 2590 | 5.4% |
6 | 2515 | 5.3% |
4 | 2505 | 5.2% |
Other values (2) | 1844 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 47865 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 7966 | |
0 | 7488 | |
5 | 7252 | |
1 | 6840 | |
7 | 3056 | 6.4% |
2 | 2978 | 6.2% |
3 | 2831 | 5.9% |
8 | 2590 | 5.4% |
6 | 2515 | 5.3% |
4 | 2505 | 5.2% |
Other values (2) | 1844 | 3.9% |
lat
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 2334 |
---|---|
Distinct (%) | 58.1% |
Missing | 36 |
Missing (%) | 0.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.971893 |
Minimum | 35.023014 |
---|---|
Maximum | 129.117 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 35.8 KiB |
Quantile statistics
Minimum | 35.023014 |
---|---|
5-th percentile | 35.093776 |
Q1 | 35.15799 |
median | 35.183172 |
Q3 | 35.213822 |
95-th percentile | 129.067 |
Maximum | 129.117 |
Range | 94.093986 |
Interquartile range (IQR) | 0.0558312 |
Descriptive statistics
Standard deviation | 20.660855 |
---|---|
Coefficient of variation (CV) | 0.51688456 |
Kurtosis | 14.68302 |
Mean | 39.971893 |
Median Absolute Deviation (MAD) | 0.02794192 |
Skewness | 4.0835748 |
Sum | 160687.01 |
Variance | 426.87091 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
35.153 | 30 | 0.7% |
35.176159 | 27 | 0.7% |
35.159 | 27 | 0.7% |
35.1766409 | 26 | 0.6% |
35.1579903 | 25 | 0.6% |
35.175162 | 24 | 0.6% |
35.161 | 23 | 0.6% |
35.16 | 21 | 0.5% |
35.146 | 21 | 0.5% |
35.173 | 20 | 0.5% |
Other values (2324) | 3776 | |
(Missing) | 36 | 0.9% |
Value | Count | Frequency (%) |
35.02301379 | 1 | |
35.03007759 | 1 | |
35.05313597 | 1 | |
35.05390152 | 1 | |
35.05474698 | 1 | |
35.05533028 | 1 | |
35.05643765 | 1 | |
35.05871353 | 1 | |
35.05911104 | 1 | |
35.059544 | 1 |
Value | Count | Frequency (%) |
129.117 | 1 | < 0.1% |
129.116 | 1 | < 0.1% |
129.115 | 4 | |
129.114 | 6 | |
129.113 | 3 | |
129.112 | 3 | |
129.111 | 2 | < 0.1% |
129.11 | 4 | |
129.109 | 5 | |
129.108 | 2 | < 0.1% |
lng
Text
Distinct | 2332 |
---|---|
Distinct (%) | 58.0% |
Missing | 36 |
Missing (%) | 0.9% |
Memory size | 31.8 KiB |
Length
Max length | 19 |
---|---|
Median length | 14 |
Mean length | 10.208458 |
Min length | 3 |
Characters and Unicode
Total characters | 41038 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1734 ? |
---|---|
Unique (%) | 43.1% |
Sample
1st row | 128.98090382 |
---|---|
2nd row | 128.88685464 |
3rd row | 128.88159778 |
4th row | 128.95570195 |
5th row | 128.90263587 |
Value | Count | Frequency (%) |
128.985 | 37 | 0.9% |
129.111 | 27 | 0.7% |
129.1258491 | 27 | 0.7% |
128.981 | 26 | 0.6% |
129.1254056 | 26 | 0.6% |
129.1475719 | 25 | 0.6% |
129.1245070 | 24 | 0.6% |
129.112 | 24 | 0.6% |
129.109 | 23 | 0.6% |
128.989 | 23 | 0.6% |
Other values (2322) | 3758 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 7276 | |
2 | 6100 | |
9 | 5854 | |
. | 4014 | |
0 | 3570 | |
8 | 3403 | |
5 | 2319 | 5.7% |
7 | 2187 | 5.3% |
3 | 2135 | 5.2% |
4 | 2116 | 5.2% |
Other values (3) | 2064 | 5.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 37022 | |
Other Punctuation | 4015 | 9.8% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 7276 | |
2 | 6100 | |
9 | 5854 | |
0 | 3570 | |
8 | 3403 | |
5 | 2319 | 6.3% |
7 | 2187 | 5.9% |
3 | 2135 | 5.8% |
4 | 2116 | 5.7% |
6 | 2062 | 5.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 4014 | |
: | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 41038 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 7276 | |
2 | 6100 | |
9 | 5854 | |
. | 4014 | |
0 | 3570 | |
8 | 3403 | |
5 | 2319 | 5.7% |
7 | 2187 | 5.3% |
3 | 2135 | 5.2% |
4 | 2116 | 5.2% |
Other values (3) | 2064 | 5.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 41038 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 7276 | |
2 | 6100 | |
9 | 5854 | |
. | 4014 | |
0 | 3570 | |
8 | 3403 | |
5 | 2319 | 5.7% |
7 | 2187 | 5.3% |
3 | 2135 | 5.2% |
4 | 2116 | 5.2% |
Other values (3) | 2064 | 5.0% |
gugun
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.8 KiB |
부산광역시 해운대구 | |
---|---|
부산광역시 동래구 | |
부산광역시 연제구 | |
부산광역시 금정구 | |
부산광역시 강서구 | |
Other values (12) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.0115878 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 강서구 |
---|---|
2nd row | 부산광역시 강서구 |
3rd row | 부산광역시 강서구 |
4th row | 부산광역시 강서구 |
5th row | 부산광역시 강서구 |
Common Values
Value | Count | Frequency (%) |
부산광역시 해운대구 | 482 | |
부산광역시 동래구 | 415 | |
부산광역시 연제구 | 395 | |
부산광역시 금정구 | 367 | |
부산광역시 강서구 | 336 | |
부산광역시 사상구 | 335 | |
부산광역시 부산진구 | 299 | |
부산광역시 수영구 | 291 | |
부산광역시 기장군 | 276 | |
부산광역시 남구 | 205 | 5.1% |
Other values (7) | 655 |
Length
Value | Count | Frequency (%) |
부산광역시 | 4021 | |
해운대구 | 482 | 6.0% |
동래구 | 415 | 5.1% |
연제구 | 395 | 4.9% |
금정구 | 367 | 4.5% |
강서구 | 336 | 4.2% |
사상구 | 335 | 4.1% |
부산진구 | 299 | 3.7% |
수영구 | 291 | 3.6% |
기장군 | 276 | 3.4% |
Other values (8) | 860 | 10.6% |
reference_date
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.8 KiB |
2020-07-31 | |
---|---|
2020-09-03 | |
2020-07-22 | |
2020-08-31 | |
2020-08-20 | |
Other values (4) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9469921 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-08-20 |
---|---|
2nd row | 2020-08-20 |
3rd row | 2020-08-20 |
4th row | 2020-08-20 |
5th row | 2020-08-20 |
Common Values
Value | Count | Frequency (%) |
2020-07-31 | 2056 | |
2020-09-03 | 482 | 11.9% |
2020-07-22 | 395 | 9.7% |
2020-08-31 | 367 | 9.0% |
2020-08-20 | 336 | 8.3% |
2020-08-28 | 291 | 7.2% |
2020-09-08 | 89 | 2.2% |
<NA> | 35 | 0.9% |
20. 7. 31 | 5 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-07-31 | 2056 | |
2020-09-03 | 482 | 11.9% |
2020-07-22 | 395 | 9.7% |
2020-08-31 | 367 | 9.0% |
2020-08-20 | 336 | 8.3% |
2020-08-28 | 291 | 7.2% |
2020-09-08 | 89 | 2.2% |
na | 35 | 0.9% |
20 | 5 | 0.1% |
7 | 5 | 0.1% |
instt_code
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3339627.7 |
Minimum | 3250000 |
---|---|
Maximum | 3400000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 35.8 KiB |
Quantile statistics
Minimum | 3250000 |
---|---|
5-th percentile | 3270000 |
Q1 | 3300000 |
median | 3340000 |
Q3 | 3370000 |
95-th percentile | 3400000 |
Maximum | 3400000 |
Range | 150000 |
Interquartile range (IQR) | 70000 |
Descriptive statistics
Standard deviation | 39181.319 |
---|---|
Coefficient of variation (CV) | 0.011732242 |
Kurtosis | -0.9450734 |
Mean | 3339627.7 |
Median Absolute Deviation (MAD) | 30000 |
Skewness | -0.23832165 |
Sum | 1.354553 × 1010 |
Variance | 1.5351758 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3330000 | 482 | |
3300000 | 415 | |
3370000 | 395 | |
3350000 | 367 | |
3360000 | 336 | |
3390000 | 335 | |
3290000 | 299 | |
3380000 | 291 | |
3400000 | 276 | |
3310000 | 205 | 5.1% |
Other values (6) | 655 |
Value | Count | Frequency (%) |
3250000 | 41 | 1.0% |
3260000 | 92 | 2.3% |
3270000 | 89 | 2.2% |
3280000 | 93 | 2.3% |
3290000 | 299 | |
3300000 | 415 | |
3310000 | 205 | |
3320000 | 166 | 4.1% |
3330000 | 482 | |
3340000 | 174 | 4.3% |
Value | Count | Frequency (%) |
3400000 | 276 | |
3390000 | 335 | |
3380000 | 291 | |
3370000 | 395 | |
3360000 | 336 | |
3350000 | 367 | |
3340000 | 174 | 4.3% |
3330000 | 482 | |
3320000 | 166 | 4.1% |
3310000 | 205 |
last_load_dttm
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.8 KiB |
2021-01-05 11:10:24 | |
---|---|
2021-01-05 11:10:25 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-01-05 11:10:24 |
---|---|
2nd row | 2021-01-05 11:10:24 |
3rd row | 2021-01-05 11:10:24 |
4th row | 2021-01-05 11:10:24 |
5th row | 2021-01-05 11:10:24 |
Common Values
Value | Count | Frequency (%) |
2021-01-05 11:10:24 | 2267 | |
2021-01-05 11:10:25 | 1789 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-01-05 | 4056 | |
11:10:24 | 2267 | |
11:10:25 | 1789 |
skey | lat | gugun | reference_date | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|
skey | 1.000 | 0.528 | 0.977 | 0.865 | 0.930 | 0.478 |
lat | 0.528 | 1.000 | 1.000 | 0.297 | 0.694 | 0.367 |
gugun | 0.977 | 1.000 | 1.000 | 0.998 | 1.000 | 0.753 |
reference_date | 0.865 | 0.297 | 0.998 | 1.000 | 0.920 | 0.413 |
instt_code | 0.930 | 0.694 | 1.000 | 0.920 | 1.000 | 0.566 |
last_load_dttm | 0.478 | 0.367 | 0.753 | 0.413 | 0.566 | 1.000 |
reference_date | last_load_dttm | gugun | |
---|---|---|---|
reference_date | 1.000 | 0.310 | 0.929 |
last_load_dttm | 0.310 | 1.000 | 0.607 |
gugun | 0.929 | 0.607 | 1.000 |
skey | lat | instt_code | gugun | reference_date | last_load_dttm | |
---|---|---|---|---|---|---|
skey | 1.000 | -0.166 | 0.461 | 0.904 | 0.660 | 0.479 |
lat | -0.166 | 1.000 | 0.116 | 0.998 | 0.223 | 0.239 |
instt_code | 0.461 | 0.116 | 1.000 | 0.999 | 0.771 | 0.449 |
gugun | 0.904 | 0.998 | 0.999 | 1.000 | 0.929 | 0.607 |
reference_date | 0.660 | 0.223 | 0.771 | 0.929 | 1.000 | 0.310 |
last_load_dttm | 0.479 | 0.239 | 0.449 | 0.607 | 0.310 | 1.000 |
skey | business_nm | type_of_business | addr | tel | lat | lng | gugun | reference_date | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 52546 | 주식회사위드현대 | 실내건축공사업 | 부산광역시 강서구 대저로 259 (대저1동) | 051-529-4282 | 35.213666 | 128.98090382 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
1 | 52547 | 주식회사유니온디엔시 | 토공사업 | 부산광역시 강서구 화전산업대로 272-5 ,301호,한솔드림센타 (녹산동) | 051-941-6991 | 35.11294 | 128.88685464 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
2 | 52548 | 주식회사유승 | 포장공사업 | 부산광역시 강서구 신호산단1로 215 ,409호,새미래오피스빌딩 (신호동) | 051-941-1865 | 35.086287 | 128.88159778 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
3 | 52549 | 주식회사자인이씨엘 | 토공사업 철근ㆍ콘크리트공사업 상ㆍ하수도설비공사업 | 부산광역시 강서구 유통단지1로 41, 131동 215,216호(대저2동, 부산티플렉스) | 051-625-9353 | 35.167293 | 128.95570195 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
4 | 52550 | 주식회사제이원이엔씨 | 포장공사업 | 부산광역시 강서구 식만로 69 (죽림동) | 051-972-2141 | 35.20181 | 128.90263587 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
5 | 52551 | 주식회사젠 | 수중공사업 | 부산광역시 강서구 가달2로 66 (생곡동) | 051-442-1235 | 35.138501 | 128.87479808 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
6 | 52552 | 주식회사창신기계산업 | 기계설비공사업 | 부산광역시 강서구 화전산단4로30번길 25-16 (화전동) | 051-941-1290 | 35.109764 | 128.88232265 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
7 | 52553 | 주식회사태창테크 | 비계ㆍ구조물해체공사업 | 부산광역시 강서구 경전철로 208-1 (대저1동) | 051-809-3538 | 35.197207 | 128.96413493 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
8 | 52554 | 주식회사템코 | 가스시설시공업 제1종 | 부산광역시 강서구 호계로79번길 68 (죽동동) | 051-971-8511 | 35.198657 | 128.89018084 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
9 | 52555 | 주식회사플래이메카 | 조경시설물설치공사업 | 부산광역시 강서구 도도본리길 54 (대저2동) | 051-831-9091 | 35.164792 | 128.92966835 | 부산광역시 강서구 | 2020-08-20 | 3360000 | 2021-01-05 11:10:24 |
skey | business_nm | type_of_business | addr | tel | lat | lng | gugun | reference_date | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|
4046 | 49798 | 디자인폼 | 실내건축공사업 | 부산광역시 해운대구 APEC로 55, 267호 (우동,벡스코) | 051-744-3604 | 35.16906 | 129.1360148 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4047 | 49799 | 디자인하늘 | 실내건축공사업 | 부산광역시 해운대구 센텀중앙로 48, 1807호 (우동,에이스하이테크21) | 051-702-9418 | 35.173021 | 129.1298566 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4048 | 49800 | 부경전시디자인 | 실내건축공사업 | 부산광역시 해운대구 APEC로 55 ,3층 356호 (우동, 벡스코) | 070-8804-3926 | 35.16906 | 129.1360148 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4049 | 49801 | 수석건설(주) | 실내건축공사업 | 부산광역시 해운대구 해운대로161번길 17-1 (재송동,유진빌딩) | 051-526-5001 | 35.184031 | 129.1235863 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4050 | 49802 | 와이지케이디자인 | 실내건축공사업 | 부산광역시 해운대구 센텀동로 57 (우동) 8층 806-2호 | 051-621-0071 | 35.17399 | 129.1293980 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4051 | 49803 | 장산이엔지(주) | 실내건축공사업 | 부산광역시 해운대구 재송1로32번길 29 (재송동) | 051-781-9000 | 35.186392 | 129.1227185 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4052 | 49804 | 주식회사건축사사무소환인 | 실내건축공사업 | 부산광역시 해운대구 마린시티3로 1 530호 (우동,썬프라자) | 051-742-4384 | 35.15799 | 129.1475719 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4053 | 49805 | 보명공영산업(주) | 실내건축공사업 | 부산광역시 해운대구 좌동로 152 ,245호 (좌동, 신도시시장) | 051-907-4426 | 35.174866 | 129.1811165 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4054 | 49806 | 아이엠커뮤니케이션 | 실내건축공사업 | 부산광역시 해운대구 센텀중앙로 97 A동 205,206호(재송동,센텀스카이비즈) | 051-925-0141 | 35.175162 | 129.1245070 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |
4055 | 49807 | 주식회사김목수이야기 | 실내건축공사업 | 부산광역시 해운대구 센텀동로 99, 520호(재송동, 벽산이센텀클래스원) | 051-781-4068 | 35.176159 | 129.1258491 | 부산광역시 해운대구 | 2020-09-03 | 3330000 | 2021-01-05 11:10:25 |