Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 13647 |
Missing cells (%) | 10.5% |
Duplicate rows | 3 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 1.1 MiB |
Average record size in memory | 115.0 B |
Variable types
Categorical | 7 |
---|---|
Text | 5 |
Numeric | 1 |
Dataset
Description | 2021-01-06 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901062201148000 |
lastupdtdt has constant value "" | Constant |
Dataset has 3 (< 0.1%) duplicate rows | Duplicates |
ofcpssecodenm is highly overall correlated with ofcpssecode | High correlation |
ldcodenm is highly overall correlated with ldcode and 1 other fields | High correlation |
brkrasortcode is highly overall correlated with brkrasortcodenm | High correlation |
ofcpssecode is highly overall correlated with ofcpssecodenm | High correlation |
last_load_dttm is highly overall correlated with ldcode and 1 other fields | High correlation |
brkrasortcodenm is highly overall correlated with brkrasortcode | High correlation |
ldcode is highly overall correlated with ldcodenm and 1 other fields | High correlation |
bsnmcmpnm has 2721 (27.2%) missing values | Missing |
crqfcacqdt has 4152 (41.5%) missing values | Missing |
crqfcno has 4053 (40.5%) missing values | Missing |
jurirno has 2721 (27.2%) missing values | Missing |
Reproduction
Analysis started | 2024-04-16 10:25:55.305514 |
---|---|
Analysis finished | 2024-04-16 10:25:56.781231 |
Duration | 1.48 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
brkrasortcode
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
4 | |
1 | 202 |
3 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
2 | 6560 | |
4 | 3236 | |
1 | 202 | 2.0% |
3 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6560 | |
4 | 3236 | |
1 | 202 | 2.0% |
3 | 2 | < 0.1% |
brkrasortcodenm
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
공인중개사 | |
---|---|
중개보조원 | |
중개인 | 202 |
법인 | 2 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.959 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 중개보조원 |
---|---|
2nd row | 공인중개사 |
3rd row | 공인중개사 |
4th row | 공인중개사 |
5th row | 중개보조원 |
Common Values
Value | Count | Frequency (%) |
공인중개사 | 6560 | |
중개보조원 | 3236 | |
중개인 | 202 | 2.0% |
법인 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공인중개사 | 6560 | |
중개보조원 | 3236 | |
중개인 | 202 | 2.0% |
법인 | 2 | < 0.1% |
brkrnm
Text
Distinct | 7906 |
---|---|
Distinct (%) | 79.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
김정희 | 18 | 0.2% |
김영희 | 16 | 0.2% |
김경희 | 12 | 0.1% |
이정희 | 11 | 0.1% |
박정희 | 10 | 0.1% |
김현주 | 10 | 0.1% |
김은희 | 10 | 0.1% |
이경숙 | 10 | 0.1% |
김정숙 | 9 | 0.1% |
김미경 | 9 | 0.1% |
Other values (7901) | 9890 |
Most occurring characters
Value | Count | Frequency (%) |
김 | 2201 | 7.3% |
이 | 1512 | 5.0% |
정 | 1315 | 4.4% |
영 | 994 | 3.3% |
박 | 889 | 2.9% |
희 | 702 | 2.3% |
경 | 623 | 2.1% |
현 | 557 | 1.8% |
숙 | 532 | 1.8% |
미 | 528 | 1.8% |
Other values (386) | 20310 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30047 | |
Open Punctuation | 43 | 0.1% |
Close Punctuation | 43 | 0.1% |
Uppercase Letter | 19 | 0.1% |
Space Separator | 6 | < 0.1% |
Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
김 | 2201 | 7.3% |
이 | 1512 | 5.0% |
정 | 1315 | 4.4% |
영 | 994 | 3.3% |
박 | 889 | 3.0% |
희 | 702 | 2.3% |
경 | 623 | 2.1% |
현 | 557 | 1.9% |
숙 | 532 | 1.8% |
미 | 528 | 1.8% |
Other values (367) | 20194 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 4 | |
I | 4 | |
A | 3 | |
B | 1 | 5.3% |
Z | 1 | 5.3% |
H | 1 | 5.3% |
W | 1 | 5.3% |
E | 1 | 5.3% |
Y | 1 | 5.3% |
T | 1 | 5.3% |
Lowercase Letter
Value | Count | Frequency (%) |
y | 1 | |
m | 1 | |
k | 1 | |
i | 1 | |
a | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 43 |
Close Punctuation
Value | Count | Frequency (%) |
) | 43 |
Space Separator
Value | Count | Frequency (%) |
6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29919 | |
Han | 128 | 0.4% |
Common | 92 | 0.3% |
Latin | 24 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
김 | 2201 | 7.4% |
이 | 1512 | 5.1% |
정 | 1315 | 4.4% |
영 | 994 | 3.3% |
박 | 889 | 3.0% |
희 | 702 | 2.3% |
경 | 623 | 2.1% |
현 | 557 | 1.9% |
숙 | 532 | 1.8% |
미 | 528 | 1.8% |
Other values (283) | 20066 |
Han
Value | Count | Frequency (%) |
金 | 12 | 9.4% |
李 | 11 | 8.6% |
崔 | 4 | 3.1% |
慶 | 4 | 3.1% |
惠 | 3 | 2.3% |
順 | 3 | 2.3% |
子 | 3 | 2.3% |
美 | 2 | 1.6% |
朴 | 2 | 1.6% |
鄭 | 2 | 1.6% |
Other values (74) | 82 |
Latin
Value | Count | Frequency (%) |
N | 4 | |
I | 4 | |
A | 3 | |
B | 1 | 4.2% |
Z | 1 | 4.2% |
H | 1 | 4.2% |
W | 1 | 4.2% |
y | 1 | 4.2% |
m | 1 | 4.2% |
E | 1 | 4.2% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
( | 43 | |
) | 43 | |
6 | 6.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29918 | |
CJK | 117 | 0.4% |
ASCII | 116 | 0.4% |
CJK Compat Ideographs | 11 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
김 | 2201 | 7.4% |
이 | 1512 | 5.1% |
정 | 1315 | 4.4% |
영 | 994 | 3.3% |
박 | 889 | 3.0% |
희 | 702 | 2.3% |
경 | 623 | 2.1% |
현 | 557 | 1.9% |
숙 | 532 | 1.8% |
미 | 528 | 1.8% |
Other values (282) | 20065 |
ASCII
Value | Count | Frequency (%) |
( | 43 | |
) | 43 | |
6 | 5.2% | |
N | 4 | 3.4% |
I | 4 | 3.4% |
A | 3 | 2.6% |
B | 1 | 0.9% |
Z | 1 | 0.9% |
H | 1 | 0.9% |
W | 1 | 0.9% |
Other values (9) | 9 | 7.8% |
CJK
Value | Count | Frequency (%) |
金 | 12 | 10.3% |
崔 | 4 | 3.4% |
慶 | 4 | 3.4% |
惠 | 3 | 2.6% |
順 | 3 | 2.6% |
子 | 3 | 2.6% |
美 | 2 | 1.7% |
朴 | 2 | 1.7% |
鄭 | 2 | 1.7% |
泰 | 2 | 1.7% |
Other values (73) | 80 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 11 |
Compat Jamo
Value | Count | Frequency (%) |
ㅂ | 1 |
bsnmcmpnm
Text
MISSING
 
Distinct | 3309 |
---|---|
Distinct (%) | 45.5% |
Missing | 2721 |
Missing (%) | 27.2% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 22 |
Mean length | 11.248798 |
Min length | 4 |
Characters and Unicode
Total characters | 81880 |
---|---|
Distinct characters | 580 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 2029 ? |
---|---|
Unique (%) | 27.9% |
Sample
1st row | 정인공인중개사사무소 |
---|---|
2nd row | 골드공인중개사사무소 |
3rd row | 삼보공인중개사사무소 |
4th row | 유한회사맥비스타부동산중개법인 |
5th row | 신세계공인중개사사무소 |
Value | Count | Frequency (%) |
주식회사 | 90 | 1.2% |
공인중개사사무소 | 80 | 1.1% |
사무소 | 61 | 0.8% |
조은공인중개사사무소 | 51 | 0.7% |
주)부동산중개법인개벽 | 40 | 0.5% |
삼성공인중개사사무소 | 37 | 0.5% |
현대공인중개사사무소 | 37 | 0.5% |
삼오부동산중개법인 | 33 | 0.4% |
태양공인중개사사무소 | 31 | 0.4% |
주)온나라부동산중개법인 | 31 | 0.4% |
Other values (3302) | 7081 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 12599 | |
개 | 7326 | 8.9% |
중 | 7302 | 8.9% |
소 | 6583 | 8.0% |
무 | 6532 | 8.0% |
인 | 6232 | 7.6% |
공 | 5820 | 7.1% |
동 | 3029 | 3.7% |
부 | 2765 | 3.4% |
산 | 2746 | 3.4% |
Other values (570) | 20946 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 79922 | |
Uppercase Letter | 734 | 0.9% |
Space Separator | 377 | 0.5% |
Decimal Number | 297 | 0.4% |
Open Punctuation | 189 | 0.2% |
Close Punctuation | 189 | 0.2% |
Lowercase Letter | 142 | 0.2% |
Other Punctuation | 22 | < 0.1% |
Dash Punctuation | 4 | < 0.1% |
Letter Number | 2 | < 0.1% |
Other values (2) | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 12599 | |
개 | 7326 | 9.2% |
중 | 7302 | 9.1% |
소 | 6583 | 8.2% |
무 | 6532 | 8.2% |
인 | 6232 | 7.8% |
공 | 5820 | 7.3% |
동 | 3029 | 3.8% |
부 | 2765 | 3.5% |
산 | 2746 | 3.4% |
Other values (507) | 18988 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 128 | |
S | 91 | |
L | 61 | 8.3% |
T | 57 | 7.8% |
B | 46 | 6.3% |
H | 42 | 5.7% |
W | 42 | 5.7% |
O | 35 | 4.8% |
C | 33 | 4.5% |
E | 23 | 3.1% |
Other values (14) | 176 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 60 | |
h | 20 | 14.1% |
t | 13 | 9.2% |
c | 10 | 7.0% |
k | 9 | 6.3% |
s | 8 | 5.6% |
w | 5 | 3.5% |
o | 3 | 2.1% |
i | 3 | 2.1% |
l | 2 | 1.4% |
Other values (6) | 9 | 6.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 129 | |
8 | 43 | 14.5% |
2 | 32 | 10.8% |
4 | 28 | 9.4% |
3 | 25 | 8.4% |
9 | 16 | 5.4% |
5 | 9 | 3.0% |
0 | 8 | 2.7% |
6 | 6 | 2.0% |
7 | 1 | 0.3% |
Other Punctuation
Value | Count | Frequency (%) |
& | 12 | |
. | 5 | |
? | 2 | 9.1% |
! | 1 | 4.5% |
· | 1 | 4.5% |
, | 1 | 4.5% |
Space Separator
Value | Count | Frequency (%) |
377 |
Open Punctuation
Value | Count | Frequency (%) |
( | 189 |
Close Punctuation
Value | Count | Frequency (%) |
) | 189 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 2 |
Other Symbol
Value | Count | Frequency (%) |
ⓡ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 79913 | |
Common | 1080 | 1.3% |
Latin | 878 | 1.1% |
Han | 9 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 12599 | |
개 | 7326 | 9.2% |
중 | 7302 | 9.1% |
소 | 6583 | 8.2% |
무 | 6532 | 8.2% |
인 | 6232 | 7.8% |
공 | 5820 | 7.3% |
동 | 3029 | 3.8% |
부 | 2765 | 3.5% |
산 | 2746 | 3.4% |
Other values (498) | 18979 |
Latin
Value | Count | Frequency (%) |
K | 128 | |
S | 91 | 10.4% |
L | 61 | 6.9% |
e | 60 | 6.8% |
T | 57 | 6.5% |
B | 46 | 5.2% |
H | 42 | 4.8% |
W | 42 | 4.8% |
O | 35 | 4.0% |
C | 33 | 3.8% |
Other values (31) | 283 |
Common
Value | Count | Frequency (%) |
377 | ||
( | 189 | |
) | 189 | |
1 | 129 | 11.9% |
8 | 43 | 4.0% |
2 | 32 | 3.0% |
4 | 28 | 2.6% |
3 | 25 | 2.3% |
9 | 16 | 1.5% |
& | 12 | 1.1% |
Other values (12) | 40 | 3.7% |
Han
Value | Count | Frequency (%) |
秀 | 1 | |
氷 | 1 | |
甲 | 1 | |
福 | 1 | |
該 | 1 | |
太 | 1 | |
利 | 1 | |
本 | 1 | |
炫 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 79913 | |
ASCII | 1952 | 2.4% |
CJK | 9 | < 0.1% |
None | 3 | < 0.1% |
Number Forms | 2 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 12599 | |
개 | 7326 | 9.2% |
중 | 7302 | 9.1% |
소 | 6583 | 8.2% |
무 | 6532 | 8.2% |
인 | 6232 | 7.8% |
공 | 5820 | 7.3% |
동 | 3029 | 3.8% |
부 | 2765 | 3.5% |
산 | 2746 | 3.4% |
Other values (498) | 18979 |
ASCII
Value | Count | Frequency (%) |
377 | ||
( | 189 | 9.7% |
) | 189 | 9.7% |
1 | 129 | 6.6% |
K | 128 | 6.6% |
S | 91 | 4.7% |
L | 61 | 3.1% |
e | 60 | 3.1% |
T | 57 | 2.9% |
B | 46 | 2.4% |
Other values (49) | 625 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 2 |
None
Value | Count | Frequency (%) |
? | 2 | |
· | 1 |
CJK
Value | Count | Frequency (%) |
秀 | 1 | |
氷 | 1 | |
甲 | 1 | |
福 | 1 | |
該 | 1 | |
太 | 1 | |
利 | 1 | |
本 | 1 | |
炫 | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓡ | 1 |
crqfcacqdt
Text
MISSING
 
Distinct | 644 |
---|---|
Distinct (%) | 11.0% |
Missing | 4152 |
Missing (%) | 41.5% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.999316 |
Min length | 8 |
Characters and Unicode
Total characters | 58476 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 425 ? |
---|---|
Unique (%) | 7.3% |
Sample
1st row | 1989-02-08 |
---|---|
2nd row | 2015-10-24 |
3rd row | 1985-11-14 |
4th row | 2019-12-09 |
5th row | 1985-11-04 |
Value | Count | Frequency (%) |
2005-07-20 | 419 | 7.2% |
2017-12-11 | 318 | 5.4% |
2016-12-12 | 311 | 5.3% |
2019-12-09 | 243 | 4.2% |
2015-12-09 | 211 | 3.6% |
2003-11-07 | 200 | 3.4% |
2018-12-10 | 194 | 3.3% |
2005-12-12 | 191 | 3.3% |
2001-12-10 | 158 | 2.7% |
2000-11-20 | 154 | 2.6% |
Other values (634) | 3449 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 13265 | |
0 | 12335 | |
- | 11692 | |
2 | 11069 | |
9 | 2725 | 4.7% |
5 | 1785 | 3.1% |
7 | 1647 | 2.8% |
8 | 1434 | 2.5% |
3 | 1088 | 1.9% |
6 | 919 | 1.6% |
Other values (2) | 517 | 0.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 46780 | |
Dash Punctuation | 11692 | 20.0% |
Space Separator | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 13265 | |
0 | 12335 | |
2 | 11069 | |
9 | 2725 | 5.8% |
5 | 1785 | 3.8% |
7 | 1647 | 3.5% |
8 | 1434 | 3.1% |
3 | 1088 | 2.3% |
6 | 919 | 2.0% |
4 | 513 | 1.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 11692 |
Space Separator
Value | Count | Frequency (%) |
4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 58476 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 13265 | |
0 | 12335 | |
- | 11692 | |
2 | 11069 | |
9 | 2725 | 4.7% |
5 | 1785 | 3.1% |
7 | 1647 | 2.8% |
8 | 1434 | 2.5% |
3 | 1088 | 1.9% |
6 | 919 | 1.6% |
Other values (2) | 517 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 58476 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 13265 | |
0 | 12335 | |
- | 11692 | |
2 | 11069 | |
9 | 2725 | 4.7% |
5 | 1785 | 3.1% |
7 | 1647 | 2.8% |
8 | 1434 | 2.5% |
3 | 1088 | 1.9% |
6 | 919 | 1.6% |
Other values (2) | 517 | 0.9% |
crqfcno
Text
MISSING
 
Distinct | 5713 |
---|---|
Distinct (%) | 96.1% |
Missing | 4053 |
Missing (%) | 40.5% |
Memory size | 156.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 20 |
Mean length | 9.151505 |
Min length | 1 |
Characters and Unicode
Total characters | 54424 |
---|---|
Distinct characters | 55 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 5497 ? |
---|---|
Unique (%) | 92.4% |
Sample
1st row | 부산88-536 |
---|---|
2nd row | 26-2015-978(부산) |
3rd row | 4335 |
4th row | 26-2019-01910 |
5th row | 1727 |
Value | Count | Frequency (%) |
부산 | 352 | 5.4% |
부산시 | 57 | 0.9% |
부산광역시 | 30 | 0.5% |
부산광역시장 | 23 | 0.4% |
경남 | 18 | 0.3% |
경상남도 | 6 | 0.1% |
21 | 5 | 0.1% |
1236 | 4 | 0.1% |
제 | 4 | 0.1% |
410 | 4 | 0.1% |
Other values (5674) | 5963 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8581 | |
2 | 6752 | |
1 | 6667 | |
- | 6065 | |
6 | 3804 | 7.0% |
4 | 2461 | 4.5% |
3 | 2459 | 4.5% |
8 | 2256 | 4.1% |
5 | 2229 | 4.1% |
9 | 2216 | 4.1% |
Other values (45) | 10934 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39631 | |
Other Letter | 6793 | 12.5% |
Dash Punctuation | 6065 | 11.1% |
Open Punctuation | 702 | 1.3% |
Close Punctuation | 702 | 1.3% |
Space Separator | 525 | 1.0% |
Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 2019 | |
부 | 2002 | |
호 | 512 | 7.5% |
제 | 458 | 6.7% |
시 | 410 | 6.0% |
광 | 310 | 4.6% |
역 | 310 | 4.6% |
장 | 167 | 2.5% |
경 | 144 | 2.1% |
남 | 121 | 1.8% |
Other values (27) | 340 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
0 | 8581 | |
2 | 6752 | |
1 | 6667 | |
6 | 3804 | |
4 | 2461 | 6.2% |
3 | 2459 | 6.2% |
8 | 2256 | 5.7% |
5 | 2229 | 5.6% |
9 | 2216 | 5.6% |
7 | 2206 | 5.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 654 | |
[ | 48 | 6.8% |
Close Punctuation
Value | Count | Frequency (%) |
) | 654 | |
] | 48 | 6.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 4 | |
: | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6065 |
Space Separator
Value | Count | Frequency (%) |
525 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47631 | |
Hangul | 6793 | 12.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 2019 | |
부 | 2002 | |
호 | 512 | 7.5% |
제 | 458 | 6.7% |
시 | 410 | 6.0% |
광 | 310 | 4.6% |
역 | 310 | 4.6% |
장 | 167 | 2.5% |
경 | 144 | 2.1% |
남 | 121 | 1.8% |
Other values (27) | 340 | 5.0% |
Common
Value | Count | Frequency (%) |
0 | 8581 | |
2 | 6752 | |
1 | 6667 | |
- | 6065 | |
6 | 3804 | |
4 | 2461 | 5.2% |
3 | 2459 | 5.2% |
8 | 2256 | 4.7% |
5 | 2229 | 4.7% |
9 | 2216 | 4.7% |
Other values (8) | 4141 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 47631 | |
Hangul | 6793 | 12.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8581 | |
2 | 6752 | |
1 | 6667 | |
- | 6065 | |
6 | 3804 | |
4 | 2461 | 5.2% |
3 | 2459 | 5.2% |
8 | 2256 | 4.7% |
5 | 2229 | 4.7% |
9 | 2216 | 4.7% |
Other values (8) | 4141 |
Hangul
Value | Count | Frequency (%) |
산 | 2019 | |
부 | 2002 | |
호 | 512 | 7.5% |
제 | 458 | 6.7% |
시 | 410 | 6.0% |
광 | 310 | 4.6% |
역 | 310 | 4.6% |
장 | 167 | 2.5% |
경 | 144 | 2.1% |
남 | 121 | 1.8% |
Other values (27) | 340 | 5.0% |
jurirno
Text
MISSING
 
Distinct | 4742 |
---|---|
Distinct (%) | 65.1% |
Missing | 2721 |
Missing (%) | 27.2% |
Memory size | 156.2 KiB |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 13.720841 |
Min length | 6 |
Characters and Unicode
Total characters | 99874 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 3463 ? |
---|---|
Unique (%) | 47.6% |
Sample
1st row | 26290-2018-00022 |
---|---|
2nd row | 26290-2017-00083 |
3rd row | 26230-2017-00191 |
4th row | 26350-2018-00140 |
5th row | 가-08-1292 |
Value | Count | Frequency (%) |
26470-2018-00085 | 40 | 0.5% |
26230-2016-00137 | 33 | 0.5% |
26470-2016-00066 | 31 | 0.4% |
가-13-1490 | 31 | 0.4% |
26470-2015-00027 | 28 | 0.4% |
26530-2017-00027 | 28 | 0.4% |
가-13-1750 | 25 | 0.3% |
가-05-3566 | 23 | 0.3% |
26470-2018-00103 | 22 | 0.3% |
가-13-1947 | 15 | 0.2% |
Other values (4735) | 7007 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 28514 | |
2 | 16072 | |
- | 14513 | |
1 | 9939 | 10.0% |
6 | 7805 | 7.8% |
3 | 4408 | 4.4% |
4 | 3729 | 3.7% |
5 | 3631 | 3.6% |
7 | 3312 | 3.3% |
9 | 2978 | 3.0% |
Other values (4) | 4973 | 5.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 83091 | |
Dash Punctuation | 14513 | 14.5% |
Other Letter | 2266 | 2.3% |
Space Separator | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 28514 | |
2 | 16072 | |
1 | 9939 | 12.0% |
6 | 7805 | 9.4% |
3 | 4408 | 5.3% |
4 | 3729 | 4.5% |
5 | 3631 | 4.4% |
7 | 3312 | 4.0% |
9 | 2978 | 3.6% |
8 | 2703 | 3.3% |
Other Letter
Value | Count | Frequency (%) |
가 | 2245 | |
나 | 21 | 0.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14513 |
Space Separator
Value | Count | Frequency (%) |
4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 97608 | |
Hangul | 2266 | 2.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 28514 | |
2 | 16072 | |
- | 14513 | |
1 | 9939 | 10.2% |
6 | 7805 | 8.0% |
3 | 4408 | 4.5% |
4 | 3729 | 3.8% |
5 | 3631 | 3.7% |
7 | 3312 | 3.4% |
9 | 2978 | 3.1% |
Other values (2) | 2707 | 2.8% |
Hangul
Value | Count | Frequency (%) |
가 | 2245 | |
나 | 21 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97608 | |
Hangul | 2266 | 2.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 28514 | |
2 | 16072 | |
- | 14513 | |
1 | 9939 | 10.2% |
6 | 7805 | 8.0% |
3 | 4408 | 4.5% |
4 | 3729 | 3.8% |
5 | 3631 | 3.7% |
7 | 3312 | 3.4% |
9 | 2978 | 3.1% |
Other values (2) | 2707 | 2.8% |
Hangul
Value | Count | Frequency (%) |
가 | 2245 | |
나 | 21 | 0.9% |
lastupdtdt
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2021-01-02 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-01-02 |
---|---|
2nd row | 2021-01-02 |
3rd row | 2021-01-02 |
4th row | 2021-01-02 |
5th row | 2021-01-02 |
Common Values
Value | Count | Frequency (%) |
2021-01-02 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-01-02 | 10000 |
ldcode
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26361.871 |
Minimum | 26110 |
---|---|
Maximum | 26710 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 26110 |
---|---|
5-th percentile | 26200 |
Q1 | 26260 |
median | 26350 |
Q3 | 26440 |
95-th percentile | 26530 |
Maximum | 26710 |
Range | 600 |
Interquartile range (IQR) | 180 |
Descriptive statistics
Standard deviation | 127.01418 |
---|---|
Coefficient of variation (CV) | 0.004818102 |
Kurtosis | 0.56325798 |
Mean | 26361.871 |
Median Absolute Deviation (MAD) | 90 |
Skewness | 0.6251349 |
Sum | 2.6361871 × 108 |
Variance | 16132.603 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
26230 | 1436 | |
26350 | 1414 | |
26260 | 1078 | |
26470 | 925 | |
26410 | 839 | |
26440 | 749 | |
26380 | 679 | |
26500 | 615 | |
26290 | 545 | 5.5% |
26710 | 454 | 4.5% |
Other values (6) | 1266 |
Value | Count | Frequency (%) |
26110 | 174 | 1.7% |
26140 | 164 | 1.6% |
26170 | 159 | 1.6% |
26200 | 151 | 1.5% |
26230 | 1436 | |
26260 | 1078 | |
26290 | 545 | 5.5% |
26320 | 323 | 3.2% |
26350 | 1414 | |
26380 | 679 |
Value | Count | Frequency (%) |
26710 | 454 | 4.5% |
26530 | 295 | 2.9% |
26500 | 615 | |
26470 | 925 | |
26440 | 749 | |
26410 | 839 | |
26380 | 679 | |
26350 | 1414 | |
26320 | 323 | 3.2% |
26290 | 545 | 5.5% |
ldcodenm
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
부산광역시 부산진구 | |
---|---|
부산광역시 해운대구 | |
부산광역시 동래구 | |
부산광역시 연제구 | |
부산광역시 금정구 | |
Other values (11) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.1485 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 남구 |
---|---|
2nd row | 부산광역시 해운대구 |
3rd row | 부산광역시 남구 |
4th row | 부산광역시 기장군 |
5th row | 부산광역시 부산진구 |
Common Values
Value | Count | Frequency (%) |
부산광역시 부산진구 | 1436 | |
부산광역시 해운대구 | 1414 | |
부산광역시 동래구 | 1078 | |
부산광역시 연제구 | 925 | |
부산광역시 금정구 | 839 | |
부산광역시 강서구 | 749 | |
부산광역시 사하구 | 679 | |
부산광역시 수영구 | 615 | |
부산광역시 남구 | 545 | 5.5% |
부산광역시 기장군 | 454 | 4.5% |
Other values (6) | 1266 |
Length
Value | Count | Frequency (%) |
부산광역시 | 10000 | |
부산진구 | 1436 | 7.2% |
해운대구 | 1414 | 7.1% |
동래구 | 1078 | 5.4% |
연제구 | 925 | 4.6% |
금정구 | 839 | 4.2% |
강서구 | 749 | 3.7% |
사하구 | 679 | 3.4% |
수영구 | 615 | 3.1% |
남구 | 545 | 2.7% |
Other values (7) | 1720 | 8.6% |
ofcpssecode
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
4 | |
<NA> | |
3 | 15 |
2 | 6 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.8151 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | <NA> |
3rd row | 1 |
4th row | <NA> |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
1 | 3686 | |
4 | 3576 | |
<NA> | 2717 | |
3 | 15 | 0.1% |
2 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 3686 | |
4 | 3576 | |
na | 2717 | |
3 | 15 | 0.1% |
2 | 6 | 0.1% |
ofcpssecodenm
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
대표 | |
---|---|
일반 | |
<NA> | |
이사 | 15 |
감사 | 6 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.5434 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반 |
---|---|
2nd row | <NA> |
3rd row | 대표 |
4th row | <NA> |
5th row | 일반 |
Common Values
Value | Count | Frequency (%) |
대표 | 3686 | |
일반 | 3576 | |
<NA> | 2717 | |
이사 | 15 | 0.1% |
감사 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대표 | 3686 | |
일반 | 3576 | |
na | 2717 | |
이사 | 15 | 0.1% |
감사 | 6 | 0.1% |
last_load_dttm
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2021-01-06 13:28:01 | |
---|---|
2021-01-06 13:28:02 | |
2021-01-06 13:28:00 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-01-06 13:28:01 |
---|---|
2nd row | 2021-01-06 13:28:01 |
3rd row | 2021-01-06 13:28:01 |
4th row | 2021-01-06 13:28:02 |
5th row | 2021-01-06 13:28:01 |
Common Values
Value | Count | Frequency (%) |
2021-01-06 13:28:01 | 5447 | |
2021-01-06 13:28:02 | 3779 | |
2021-01-06 13:28:00 | 774 | 7.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-01-06 | 10000 | |
13:28:01 | 5447 | |
13:28:02 | 3779 | 18.9% |
13:28:00 | 774 | 3.9% |
brkrasortcode | brkrasortcodenm | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
brkrasortcode | 1.000 | 1.000 | 0.233 | 0.287 | 0.831 | 0.831 | 0.032 |
brkrasortcodenm | 1.000 | 1.000 | 0.233 | 0.287 | 0.831 | 0.831 | 0.032 |
ldcode | 0.233 | 0.233 | 1.000 | 1.000 | 0.226 | 0.226 | 0.873 |
ldcodenm | 0.287 | 0.287 | 1.000 | 1.000 | 0.251 | 0.251 | 0.971 |
ofcpssecode | 0.831 | 0.831 | 0.226 | 0.251 | 1.000 | 1.000 | 0.008 |
ofcpssecodenm | 0.831 | 0.831 | 0.226 | 0.251 | 1.000 | 1.000 | 0.008 |
last_load_dttm | 0.032 | 0.032 | 0.873 | 0.971 | 0.008 | 0.008 | 1.000 |
ofcpssecodenm | ldcodenm | brkrasortcode | ofcpssecode | last_load_dttm | brkrasortcodenm | |
---|---|---|---|---|---|---|
ofcpssecodenm | 1.000 | 0.120 | 0.479 | 1.000 | 0.007 | 0.479 |
ldcodenm | 0.120 | 1.000 | 0.138 | 0.120 | 0.946 | 0.138 |
brkrasortcode | 0.479 | 0.138 | 1.000 | 0.479 | 0.030 | 1.000 |
ofcpssecode | 1.000 | 0.120 | 0.479 | 1.000 | 0.007 | 0.479 |
last_load_dttm | 0.007 | 0.946 | 0.030 | 0.007 | 1.000 | 0.030 |
brkrasortcodenm | 0.479 | 0.138 | 1.000 | 0.479 | 0.030 | 1.000 |
ldcode | brkrasortcode | brkrasortcodenm | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
ldcode | 1.000 | 0.106 | 0.106 | 1.000 | 0.102 | 0.102 | 0.864 |
brkrasortcode | 0.106 | 1.000 | 1.000 | 0.138 | 0.479 | 0.479 | 0.030 |
brkrasortcodenm | 0.106 | 1.000 | 1.000 | 0.138 | 0.479 | 0.479 | 0.030 |
ldcodenm | 1.000 | 0.138 | 0.138 | 1.000 | 0.120 | 0.120 | 0.946 |
ofcpssecode | 0.102 | 0.479 | 0.479 | 0.120 | 1.000 | 1.000 | 0.007 |
ofcpssecodenm | 0.102 | 0.479 | 0.479 | 0.120 | 1.000 | 1.000 | 0.007 |
last_load_dttm | 0.864 | 0.030 | 0.030 | 0.946 | 0.007 | 0.007 | 1.000 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
6291 | 4 | 중개보조원 | 한정욱 | 정인공인중개사사무소 | <NA> | <NA> | 26290-2018-00022 | 2021-01-02 | 26290 | 부산광역시 남구 | 4 | 일반 | 2021-01-06 13:28:01 |
10337 | 2 | 공인중개사 | 김길홍 | <NA> | 1989-02-08 | 부산88-536 | <NA> | 2021-01-02 | 26350 | 부산광역시 해운대구 | <NA> | <NA> | 2021-01-06 13:28:01 |
6433 | 2 | 공인중개사 | 정진홍 | 골드공인중개사사무소 | 2015-10-24 | 26-2015-978(부산) | 26290-2017-00083 | 2021-01-02 | 26290 | 부산광역시 남구 | 1 | 대표 | 2021-01-06 13:28:01 |
19347 | 2 | 공인중개사 | 김홍석 | <NA> | 1985-11-14 | 4335 | <NA> | 2021-01-02 | 26710 | 부산광역시 기장군 | <NA> | <NA> | 2021-01-06 13:28:02 |
2936 | 4 | 중개보조원 | 변용현 | 삼보공인중개사사무소 | <NA> | <NA> | 26230-2017-00191 | 2021-01-02 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2021-01-06 13:28:01 |
10019 | 2 | 공인중개사 | 김윤경 | 유한회사맥비스타부동산중개법인 | 2019-12-09 | 26-2019-01910 | 26350-2018-00140 | 2021-01-02 | 26350 | 부산광역시 해운대구 | 4 | 일반 | 2021-01-06 13:28:01 |
4179 | 2 | 공인중개사 | 강권칠 | <NA> | 1985-11-04 | 1727 | <NA> | 2021-01-02 | 26230 | 부산광역시 부산진구 | <NA> | <NA> | 2021-01-06 13:28:01 |
10233 | 2 | 공인중개사 | 김순희 | <NA> | 2003-11-19 | 서울14-07930 | <NA> | 2021-01-02 | 26350 | 부산광역시 해운대구 | <NA> | <NA> | 2021-01-06 13:28:01 |
11102 | 2 | 공인중개사 | 이동찬 | <NA> | 2005-12-12 | 제16-822호 | <NA> | 2021-01-02 | 26380 | 부산광역시 사하구 | <NA> | <NA> | 2021-01-06 13:28:01 |
11698 | 2 | 공인중개사 | 김종대 | 신세계공인중개사사무소 | 2005-12-12 | 부산16-828 | 가-08-1292 | 2021-01-02 | 26380 | 부산광역시 사하구 | 1 | 대표 | 2021-01-06 13:28:01 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5181 | 2 | 공인중개사 | 신옥자 | <NA> | 2005-07-20 | 336 | <NA> | 2021-01-02 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-01-06 13:28:01 |
16709 | 4 | 중개보조원 | 김병주 | 탑공인중개사사무소 | <NA> | <NA> | 26470-2020-00012 | 2021-01-02 | 26470 | 부산광역시 연제구 | 4 | 일반 | 2021-01-06 13:28:02 |
18417 | 4 | 중개보조원 | 신영자 | 대흥공인중개사 사무소 | <NA> | <NA> | 가-15-908 | 2021-01-02 | 26530 | 부산광역시 사상구 | 4 | 일반 | 2021-01-06 13:28:02 |
5692 | 2 | 공인중개사 | 김종득 | 새우성공인중개사사무소 | 1985-11-06 | 2468 | 가-06-2303 | 2021-01-02 | 26260 | 부산광역시 동래구 | 1 | 대표 | 2021-01-06 13:28:01 |
4133 | 2 | 공인중개사 | 홍승휘 | <NA> | 2000-11-20 | 부산시 11-143 | <NA> | 2021-01-02 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-01-06 13:28:01 |
12329 | 2 | 공인중개사 | 임수경(任壽京) | <NA> | 2005-07-20 | 1145(부산광역시) | <NA> | 2021-01-02 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2021-01-06 13:28:02 |
8687 | 2 | 공인중개사 | 이종석 | <NA> | 2005-07-20 | 부산124 | <NA> | 2021-01-02 | 26350 | 부산광역시 해운대구 | <NA> | <NA> | 2021-01-06 13:28:01 |
5669 | 4 | 중개보조원 | 노병흡 | 박사공인중개사사무소 | <NA> | <NA> | 26260-2020-00042 | 2021-01-02 | 26260 | 부산광역시 동래구 | 4 | 일반 | 2021-01-06 13:28:01 |
2425 | 2 | 공인중개사 | 이경준 | 주식회사 삼오부동산중개법인 | 2014-12-10 | 26-2014-726 | 26230-2016-00137 | 2021-01-02 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2021-01-06 13:28:01 |
19259 | 4 | 중개보조원 | 송정숙 | 예율공인중개사사무소 | <NA> | <NA> | 26710-2017-00020 | 2021-01-02 | 26710 | 부산광역시 기장군 | 4 | 일반 | 2021-01-06 13:28:02 |
Most frequently occurring
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2 | 공인중개사 | 박창호 | <NA> | <NA> | <NA> | <NA> | 2021-01-02 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-01-06 13:28:01 | 2 |
1 | 4 | 중개보조원 | 김덕환 | <NA> | <NA> | <NA> | <NA> | 2021-01-02 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2021-01-06 13:28:02 | 2 |
2 | 4 | 중개보조원 | 김승모 | 부산상가채널공인중개사사무소 | <NA> | <NA> | 26500-2018-00097 | 2021-01-02 | 26500 | 부산광역시 수영구 | 4 | 일반 | 2021-01-06 13:28:02 | 2 |