Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 13696 |
Missing cells (%) | 10.5% |
Duplicate rows | 5 |
Duplicate rows (%) | 0.1% |
Total size in memory | 1.1 MiB |
Average record size in memory | 115.0 B |
Variable types
Categorical | 5 |
---|---|
Text | 5 |
DateTime | 2 |
Numeric | 1 |
Dataset
Description | 2020-12-23 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901062201148000 |
lastupdtdt has constant value "" | Constant |
Dataset has 5 (0.1%) duplicate rows | Duplicates |
ofcpssecodenm is highly overall correlated with brkrasortcode and 2 other fields | High correlation |
brkrasortcode is highly overall correlated with brkrasortcodenm and 2 other fields | High correlation |
ofcpssecode is highly overall correlated with brkrasortcode and 2 other fields | High correlation |
brkrasortcodenm is highly overall correlated with brkrasortcode and 2 other fields | High correlation |
ldcode is highly overall correlated with ldcodenm | High correlation |
ldcodenm is highly overall correlated with ldcode | High correlation |
bsnmcmpnm has 2705 (27.1%) missing values | Missing |
crqfcacqdt has 4204 (42.0%) missing values | Missing |
crqfcno has 4082 (40.8%) missing values | Missing |
jurirno has 2705 (27.1%) missing values | Missing |
Reproduction
Analysis started | 2024-04-16 10:26:03.845030 |
---|---|
Analysis finished | 2024-04-16 10:26:05.588745 |
Duration | 1.74 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
brkrasortcode
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
4 | |
1 | 201 |
3 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 4 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 6541 | |
4 | 3256 | |
1 | 201 | 2.0% |
3 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6541 | |
4 | 3256 | |
1 | 201 | 2.0% |
3 | 2 | < 0.1% |
brkrasortcodenm
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
공인중개사 | |
---|---|
중개보조원 | |
중개인 | 201 |
법인 | 2 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9592 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 중개보조원 |
---|---|
2nd row | 공인중개사 |
3rd row | 공인중개사 |
4th row | 중개보조원 |
5th row | 공인중개사 |
Common Values
Value | Count | Frequency (%) |
공인중개사 | 6541 | |
중개보조원 | 3256 | |
중개인 | 201 | 2.0% |
법인 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공인중개사 | 6541 | |
중개보조원 | 3256 | |
중개인 | 201 | 2.0% |
법인 | 2 | < 0.1% |
brkrnm
Text
Distinct | 7919 |
---|---|
Distinct (%) | 79.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
김영희 | 19 | 0.2% |
김정숙 | 14 | 0.1% |
김민정 | 11 | 0.1% |
김경희 | 11 | 0.1% |
이영주 | 11 | 0.1% |
김인숙 | 10 | 0.1% |
이정희 | 10 | 0.1% |
정영희 | 9 | 0.1% |
이정훈 | 9 | 0.1% |
김정희 | 9 | 0.1% |
Other values (7914) | 9894 |
Most occurring characters
Value | Count | Frequency (%) |
김 | 2264 | 7.5% |
이 | 1412 | 4.7% |
정 | 1349 | 4.5% |
영 | 1001 | 3.3% |
박 | 904 | 3.0% |
희 | 702 | 2.3% |
경 | 614 | 2.0% |
미 | 560 | 1.9% |
성 | 551 | 1.8% |
숙 | 549 | 1.8% |
Other values (375) | 20231 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30038 | |
Open Punctuation | 41 | 0.1% |
Close Punctuation | 41 | 0.1% |
Uppercase Letter | 9 | < 0.1% |
Space Separator | 8 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
김 | 2264 | 7.5% |
이 | 1412 | 4.7% |
정 | 1349 | 4.5% |
영 | 1001 | 3.3% |
박 | 904 | 3.0% |
희 | 702 | 2.3% |
경 | 614 | 2.0% |
미 | 560 | 1.9% |
성 | 551 | 1.8% |
숙 | 549 | 1.8% |
Other values (366) | 20132 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 3 | |
N | 2 | |
Y | 1 | 11.1% |
A | 1 | 11.1% |
T | 1 | 11.1% |
J | 1 | 11.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 41 |
Close Punctuation
Value | Count | Frequency (%) |
) | 41 |
Space Separator
Value | Count | Frequency (%) |
8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29920 | |
Han | 118 | 0.4% |
Common | 90 | 0.3% |
Latin | 9 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
김 | 2264 | 7.6% |
이 | 1412 | 4.7% |
정 | 1349 | 4.5% |
영 | 1001 | 3.3% |
박 | 904 | 3.0% |
희 | 702 | 2.3% |
경 | 614 | 2.1% |
미 | 560 | 1.9% |
성 | 551 | 1.8% |
숙 | 549 | 1.8% |
Other values (284) | 20014 |
Han
Value | Count | Frequency (%) |
金 | 8 | 6.8% |
李 | 6 | 5.1% |
鄭 | 5 | 4.2% |
崔 | 5 | 4.2% |
子 | 4 | 3.4% |
朴 | 3 | 2.5% |
順 | 3 | 2.5% |
基 | 3 | 2.5% |
根 | 2 | 1.7% |
成 | 2 | 1.7% |
Other values (72) | 77 |
Latin
Value | Count | Frequency (%) |
I | 3 | |
N | 2 | |
Y | 1 | 11.1% |
A | 1 | 11.1% |
T | 1 | 11.1% |
J | 1 | 11.1% |
Common
Value | Count | Frequency (%) |
( | 41 | |
) | 41 | |
8 | 8.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29919 | |
CJK | 111 | 0.4% |
ASCII | 99 | 0.3% |
CJK Compat Ideographs | 7 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
김 | 2264 | 7.6% |
이 | 1412 | 4.7% |
정 | 1349 | 4.5% |
영 | 1001 | 3.3% |
박 | 904 | 3.0% |
희 | 702 | 2.3% |
경 | 614 | 2.1% |
미 | 560 | 1.9% |
성 | 551 | 1.8% |
숙 | 549 | 1.8% |
Other values (283) | 20013 |
ASCII
Value | Count | Frequency (%) |
( | 41 | |
) | 41 | |
8 | 8.1% | |
I | 3 | 3.0% |
N | 2 | 2.0% |
Y | 1 | 1.0% |
A | 1 | 1.0% |
T | 1 | 1.0% |
J | 1 | 1.0% |
CJK
Value | Count | Frequency (%) |
金 | 8 | 7.2% |
鄭 | 5 | 4.5% |
崔 | 5 | 4.5% |
子 | 4 | 3.6% |
朴 | 3 | 2.7% |
順 | 3 | 2.7% |
基 | 3 | 2.7% |
根 | 2 | 1.8% |
成 | 2 | 1.8% |
永 | 2 | 1.8% |
Other values (70) | 74 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 6 | |
林 | 1 | 14.3% |
Compat Jamo
Value | Count | Frequency (%) |
ㅂ | 1 |
bsnmcmpnm
Text
MISSING
 
Distinct | 3317 |
---|---|
Distinct (%) | 45.5% |
Missing | 2705 |
Missing (%) | 27.1% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 22 |
Mean length | 11.267992 |
Min length | 4 |
Characters and Unicode
Total characters | 82200 |
---|---|
Distinct characters | 574 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 1988 ? |
---|---|
Unique (%) | 27.3% |
Sample
1st row | 왕우공인중개사사무소 |
---|---|
2nd row | The파크공인중개사사무소 |
3rd row | 소정공인중개사사무소 |
4th row | 명품공인중개사사무소 |
5th row | 토박이공인중개사사무소 |
Value | Count | Frequency (%) |
주식회사 | 103 | 1.4% |
공인중개사사무소 | 79 | 1.0% |
사무소 | 64 | 0.8% |
조은공인중개사사무소 | 46 | 0.6% |
주)부동산중개법인개벽 | 40 | 0.5% |
현대공인중개사사무소 | 37 | 0.5% |
주)온나라부동산중개법인 | 36 | 0.5% |
삼성공인중개사사무소 | 34 | 0.4% |
삼오부동산중개법인 | 33 | 0.4% |
미래공인중개사사무소 | 32 | 0.4% |
Other values (3310) | 7102 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 12696 | |
개 | 7339 | 8.9% |
중 | 7314 | 8.9% |
소 | 6601 | 8.0% |
무 | 6560 | 8.0% |
인 | 6323 | 7.7% |
공 | 5913 | 7.2% |
동 | 3001 | 3.7% |
부 | 2751 | 3.3% |
산 | 2719 | 3.3% |
Other values (564) | 20983 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 80171 | |
Uppercase Letter | 753 | 0.9% |
Space Separator | 383 | 0.5% |
Decimal Number | 319 | 0.4% |
Close Punctuation | 208 | 0.3% |
Open Punctuation | 208 | 0.3% |
Lowercase Letter | 131 | 0.2% |
Other Punctuation | 19 | < 0.1% |
Dash Punctuation | 5 | < 0.1% |
Letter Number | 1 | < 0.1% |
Other values (2) | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 12696 | |
개 | 7339 | 9.2% |
중 | 7314 | 9.1% |
소 | 6601 | 8.2% |
무 | 6560 | 8.2% |
인 | 6323 | 7.9% |
공 | 5913 | 7.4% |
동 | 3001 | 3.7% |
부 | 2751 | 3.4% |
산 | 2719 | 3.4% |
Other values (502) | 18954 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 138 | |
S | 91 | |
T | 59 | 7.8% |
L | 59 | 7.8% |
W | 43 | 5.7% |
C | 43 | 5.7% |
H | 38 | 5.0% |
O | 35 | 4.6% |
B | 32 | 4.2% |
E | 29 | 3.9% |
Other values (14) | 186 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 58 | |
h | 20 | 15.3% |
w | 10 | 7.6% |
t | 9 | 6.9% |
s | 7 | 5.3% |
c | 6 | 4.6% |
k | 5 | 3.8% |
b | 4 | 3.1% |
i | 3 | 2.3% |
o | 3 | 2.3% |
Other values (5) | 6 | 4.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 134 | |
8 | 42 | 13.2% |
2 | 39 | 12.2% |
4 | 38 | 11.9% |
3 | 23 | 7.2% |
9 | 19 | 6.0% |
5 | 9 | 2.8% |
0 | 6 | 1.9% |
7 | 5 | 1.6% |
6 | 4 | 1.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 7 | |
& | 6 | |
? | 2 | 10.5% |
· | 2 | 10.5% |
! | 1 | 5.3% |
# | 1 | 5.3% |
Space Separator
Value | Count | Frequency (%) |
383 |
Close Punctuation
Value | Count | Frequency (%) |
) | 208 |
Open Punctuation
Value | Count | Frequency (%) |
( | 208 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Other Symbol
Value | Count | Frequency (%) |
ⓡ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 80159 | |
Common | 1144 | 1.4% |
Latin | 885 | 1.1% |
Han | 12 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 12696 | |
개 | 7339 | 9.2% |
중 | 7314 | 9.1% |
소 | 6601 | 8.2% |
무 | 6560 | 8.2% |
인 | 6323 | 7.9% |
공 | 5913 | 7.4% |
동 | 3001 | 3.7% |
부 | 2751 | 3.4% |
산 | 2719 | 3.4% |
Other values (492) | 18942 |
Latin
Value | Count | Frequency (%) |
K | 138 | |
S | 91 | 10.3% |
T | 59 | 6.7% |
L | 59 | 6.7% |
e | 58 | 6.6% |
W | 43 | 4.9% |
C | 43 | 4.9% |
H | 38 | 4.3% |
O | 35 | 4.0% |
B | 32 | 3.6% |
Other values (30) | 289 |
Common
Value | Count | Frequency (%) |
383 | ||
) | 208 | |
( | 208 | |
1 | 134 | 11.7% |
8 | 42 | 3.7% |
2 | 39 | 3.4% |
4 | 38 | 3.3% |
3 | 23 | 2.0% |
9 | 19 | 1.7% |
5 | 9 | 0.8% |
Other values (12) | 41 | 3.6% |
Han
Value | Count | Frequency (%) |
福 | 3 | |
甲 | 1 | 8.3% |
秀 | 1 | 8.3% |
該 | 1 | 8.3% |
氷 | 1 | 8.3% |
人 | 1 | 8.3% |
本 | 1 | 8.3% |
炫 | 1 | 8.3% |
明 | 1 | 8.3% |
堂 | 1 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 80159 | |
ASCII | 2023 | 2.5% |
CJK | 12 | < 0.1% |
None | 4 | < 0.1% |
Number Forms | 1 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 12696 | |
개 | 7339 | 9.2% |
중 | 7314 | 9.1% |
소 | 6601 | 8.2% |
무 | 6560 | 8.2% |
인 | 6323 | 7.9% |
공 | 5913 | 7.4% |
동 | 3001 | 3.7% |
부 | 2751 | 3.4% |
산 | 2719 | 3.4% |
Other values (492) | 18942 |
ASCII
Value | Count | Frequency (%) |
383 | ||
) | 208 | 10.3% |
( | 208 | 10.3% |
K | 138 | 6.8% |
1 | 134 | 6.6% |
S | 91 | 4.5% |
T | 59 | 2.9% |
L | 59 | 2.9% |
e | 58 | 2.9% |
W | 43 | 2.1% |
Other values (48) | 642 |
CJK
Value | Count | Frequency (%) |
福 | 3 | |
甲 | 1 | 8.3% |
秀 | 1 | 8.3% |
該 | 1 | 8.3% |
氷 | 1 | 8.3% |
人 | 1 | 8.3% |
本 | 1 | 8.3% |
炫 | 1 | 8.3% |
明 | 1 | 8.3% |
堂 | 1 | 8.3% |
None
Value | Count | Frequency (%) |
? | 2 | |
· | 2 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓡ | 1 |
crqfcacqdt
Text
MISSING
 
Distinct | 641 |
---|---|
Distinct (%) | 11.1% |
Missing | 4204 |
Missing (%) | 42.0% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9993099 |
Min length | 8 |
Characters and Unicode
Total characters | 57956 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 400 ? |
---|---|
Unique (%) | 6.9% |
Sample
1st row | 2014-10-02 |
---|---|
2nd row | 2012-12-10 |
3rd row | 1985-11-18 |
4th row | 2007-12-17 |
5th row | 2016-12-12 |
Value | Count | Frequency (%) |
2005-07-20 | 382 | 6.6% |
2017-12-11 | 350 | 6.0% |
2016-12-12 | 288 | 5.0% |
2019-12-09 | 247 | 4.3% |
2018-12-10 | 205 | 3.5% |
2003-11-07 | 197 | 3.4% |
2015-12-09 | 195 | 3.4% |
2005-12-12 | 170 | 2.9% |
2001-12-10 | 156 | 2.7% |
2011-12-19 | 153 | 2.6% |
Other values (631) | 3453 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 13299 | |
0 | 12139 | |
- | 11588 | |
2 | 10884 | |
9 | 2745 | 4.7% |
5 | 1701 | 2.9% |
7 | 1572 | 2.7% |
8 | 1449 | 2.5% |
3 | 1086 | 1.9% |
6 | 944 | 1.6% |
Other values (2) | 549 | 0.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 46364 | |
Dash Punctuation | 11588 | 20.0% |
Space Separator | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 13299 | |
0 | 12139 | |
2 | 10884 | |
9 | 2745 | 5.9% |
5 | 1701 | 3.7% |
7 | 1572 | 3.4% |
8 | 1449 | 3.1% |
3 | 1086 | 2.3% |
6 | 944 | 2.0% |
4 | 545 | 1.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 11588 |
Space Separator
Value | Count | Frequency (%) |
4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 57956 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 13299 | |
0 | 12139 | |
- | 11588 | |
2 | 10884 | |
9 | 2745 | 4.7% |
5 | 1701 | 2.9% |
7 | 1572 | 2.7% |
8 | 1449 | 2.5% |
3 | 1086 | 1.9% |
6 | 944 | 1.6% |
Other values (2) | 549 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 57956 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 13299 | |
0 | 12139 | |
- | 11588 | |
2 | 10884 | |
9 | 2745 | 4.7% |
5 | 1701 | 2.9% |
7 | 1572 | 2.7% |
8 | 1449 | 2.5% |
3 | 1086 | 1.9% |
6 | 944 | 1.6% |
Other values (2) | 549 | 0.9% |
crqfcno
Text
MISSING
 
Distinct | 5693 |
---|---|
Distinct (%) | 96.2% |
Missing | 4082 |
Missing (%) | 40.8% |
Memory size | 156.2 KiB |
Length
Max length | 24 |
---|---|
Median length | 21 |
Mean length | 9.1809733 |
Min length | 1 |
Characters and Unicode
Total characters | 54333 |
---|---|
Distinct characters | 62 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 5485 ? |
---|---|
Unique (%) | 92.7% |
Sample
1st row | 10-00771 |
---|---|
2nd row | 23-00361 |
3rd row | 5119 |
4th row | 18-170 |
5th row | 26-2016-02198 |
Value | Count | Frequency (%) |
부산 | 376 | 5.8% |
부산시 | 46 | 0.7% |
부산광역시 | 22 | 0.3% |
경남 | 19 | 0.3% |
부산광역시장 | 17 | 0.3% |
경상남도 | 6 | 0.1% |
1154 | 5 | 0.1% |
경기도 | 5 | 0.1% |
제 | 5 | 0.1% |
716 | 4 | 0.1% |
Other values (5655) | 5942 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8643 | |
2 | 6809 | |
1 | 6575 | |
- | 6056 | |
6 | 3784 | 7.0% |
4 | 2507 | 4.6% |
3 | 2441 | 4.5% |
8 | 2248 | 4.1% |
7 | 2241 | 4.1% |
5 | 2241 | 4.1% |
Other values (52) | 10788 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39691 | |
Other Letter | 6660 | 12.3% |
Dash Punctuation | 6056 | 11.1% |
Open Punctuation | 692 | 1.3% |
Close Punctuation | 691 | 1.3% |
Space Separator | 532 | 1.0% |
Other Punctuation | 11 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 1969 | |
부 | 1949 | |
호 | 525 | 7.9% |
제 | 471 | 7.1% |
시 | 380 | 5.7% |
광 | 288 | 4.3% |
역 | 287 | 4.3% |
장 | 161 | 2.4% |
경 | 138 | 2.1% |
남 | 112 | 1.7% |
Other values (33) | 380 | 5.7% |
Decimal Number
Value | Count | Frequency (%) |
0 | 8643 | |
2 | 6809 | |
1 | 6575 | |
6 | 3784 | |
4 | 2507 | 6.3% |
3 | 2441 | 6.2% |
8 | 2248 | 5.7% |
7 | 2241 | 5.6% |
5 | 2241 | 5.6% |
9 | 2202 | 5.5% |
Other Punctuation
Value | Count | Frequency (%) |
. | 5 | |
, | 4 | |
: | 2 | 18.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 633 | |
[ | 59 | 8.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 632 | |
] | 59 | 8.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6056 |
Space Separator
Value | Count | Frequency (%) |
532 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47673 | |
Hangul | 6660 | 12.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 1969 | |
부 | 1949 | |
호 | 525 | 7.9% |
제 | 471 | 7.1% |
시 | 380 | 5.7% |
광 | 288 | 4.3% |
역 | 287 | 4.3% |
장 | 161 | 2.4% |
경 | 138 | 2.1% |
남 | 112 | 1.7% |
Other values (33) | 380 | 5.7% |
Common
Value | Count | Frequency (%) |
0 | 8643 | |
2 | 6809 | |
1 | 6575 | |
- | 6056 | |
6 | 3784 | |
4 | 2507 | 5.3% |
3 | 2441 | 5.1% |
8 | 2248 | 4.7% |
7 | 2241 | 4.7% |
5 | 2241 | 4.7% |
Other values (9) | 4128 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 47673 | |
Hangul | 6660 | 12.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8643 | |
2 | 6809 | |
1 | 6575 | |
- | 6056 | |
6 | 3784 | |
4 | 2507 | 5.3% |
3 | 2441 | 5.1% |
8 | 2248 | 4.7% |
7 | 2241 | 4.7% |
5 | 2241 | 4.7% |
Other values (9) | 4128 |
Hangul
Value | Count | Frequency (%) |
산 | 1969 | |
부 | 1949 | |
호 | 525 | 7.9% |
제 | 471 | 7.1% |
시 | 380 | 5.7% |
광 | 288 | 4.3% |
역 | 287 | 4.3% |
장 | 161 | 2.4% |
경 | 138 | 2.1% |
남 | 112 | 1.7% |
Other values (33) | 380 | 5.7% |
jurirno
Text
MISSING
 
Distinct | 4730 |
---|---|
Distinct (%) | 64.8% |
Missing | 2705 |
Missing (%) | 27.1% |
Memory size | 156.2 KiB |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 13.727759 |
Min length | 6 |
Characters and Unicode
Total characters | 100144 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 3393 ? |
---|---|
Unique (%) | 46.5% |
Sample
1st row | 26470-2020-00141 |
---|---|
2nd row | 가-10-3423 |
3rd row | 가-11-1977 |
4th row | 26320-2019-00052 |
5th row | 26440-2020-00094 |
Value | Count | Frequency (%) |
26470-2018-00085 | 40 | 0.5% |
26470-2016-00066 | 36 | 0.5% |
26230-2016-00137 | 33 | 0.5% |
26470-2015-00027 | 32 | 0.4% |
가-13-1490 | 27 | 0.4% |
26470-2018-00103 | 24 | 0.3% |
26530-2017-00027 | 23 | 0.3% |
가-05-3566 | 20 | 0.3% |
26230-2020-00110 | 17 | 0.2% |
26230-2016-00096 | 17 | 0.2% |
Other values (4723) | 7030 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 28632 | |
2 | 16146 | |
- | 14532 | |
1 | 10058 | 10.0% |
6 | 7886 | 7.9% |
3 | 4269 | 4.3% |
4 | 3783 | 3.8% |
5 | 3601 | 3.6% |
7 | 3336 | 3.3% |
9 | 3006 | 3.0% |
Other values (4) | 4895 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 83347 | |
Dash Punctuation | 14532 | 14.5% |
Other Letter | 2261 | 2.3% |
Space Separator | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 28632 | |
2 | 16146 | |
1 | 10058 | 12.1% |
6 | 7886 | 9.5% |
3 | 4269 | 5.1% |
4 | 3783 | 4.5% |
5 | 3601 | 4.3% |
7 | 3336 | 4.0% |
9 | 3006 | 3.6% |
8 | 2630 | 3.2% |
Other Letter
Value | Count | Frequency (%) |
가 | 2244 | |
나 | 17 | 0.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14532 |
Space Separator
Value | Count | Frequency (%) |
4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 97883 | |
Hangul | 2261 | 2.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 28632 | |
2 | 16146 | |
- | 14532 | |
1 | 10058 | 10.3% |
6 | 7886 | 8.1% |
3 | 4269 | 4.4% |
4 | 3783 | 3.9% |
5 | 3601 | 3.7% |
7 | 3336 | 3.4% |
9 | 3006 | 3.1% |
Other values (2) | 2634 | 2.7% |
Hangul
Value | Count | Frequency (%) |
가 | 2244 | |
나 | 17 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97883 | |
Hangul | 2261 | 2.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 28632 | |
2 | 16146 | |
- | 14532 | |
1 | 10058 | 10.3% |
6 | 7886 | 8.1% |
3 | 4269 | 4.4% |
4 | 3783 | 3.9% |
5 | 3601 | 3.7% |
7 | 3336 | 3.4% |
9 | 3006 | 3.1% |
Other values (2) | 2634 | 2.7% |
Hangul
Value | Count | Frequency (%) |
가 | 2244 | |
나 | 17 | 0.8% |
lastupdtdt
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2020-12-22 00:00:00 |
---|---|
Maximum | 2020-12-22 00:00:00 |
ldcode
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26363.647 |
Minimum | 26110 |
---|---|
Maximum | 26710 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 26110 |
---|---|
5-th percentile | 26170 |
Q1 | 26260 |
median | 26350 |
Q3 | 26440 |
95-th percentile | 26530 |
Maximum | 26710 |
Range | 600 |
Interquartile range (IQR) | 180 |
Descriptive statistics
Standard deviation | 128.77309 |
---|---|
Coefficient of variation (CV) | 0.0048844944 |
Kurtosis | 0.5607224 |
Mean | 26363.647 |
Median Absolute Deviation (MAD) | 90 |
Skewness | 0.63410638 |
Sum | 2.6363647 × 108 |
Variance | 16582.508 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
26230 | 1398 | |
26350 | 1395 | |
26260 | 1063 | |
26470 | 901 | |
26410 | 860 | |
26440 | 763 | |
26380 | 684 | |
26500 | 642 | |
26290 | 544 | 5.4% |
26710 | 495 | 5.0% |
Other values (6) | 1255 |
Value | Count | Frequency (%) |
26110 | 185 | 1.8% |
26140 | 155 | 1.6% |
26170 | 173 | 1.7% |
26200 | 147 | 1.5% |
26230 | 1398 | |
26260 | 1063 | |
26290 | 544 | 5.4% |
26320 | 323 | 3.2% |
26350 | 1395 | |
26380 | 684 |
Value | Count | Frequency (%) |
26710 | 495 | 5.0% |
26530 | 272 | 2.7% |
26500 | 642 | |
26470 | 901 | |
26440 | 763 | |
26410 | 860 | |
26380 | 684 | |
26350 | 1395 | |
26320 | 323 | 3.2% |
26290 | 544 | 5.4% |
ldcodenm
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
부산광역시 부산진구 | |
---|---|
부산광역시 해운대구 | |
부산광역시 동래구 | |
부산광역시 연제구 | |
부산광역시 금정구 | |
Other values (11) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.1413 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 연제구 |
---|---|
2nd row | 부산광역시 해운대구 |
3rd row | 부산광역시 금정구 |
4th row | 부산광역시 부산진구 |
5th row | 부산광역시 금정구 |
Common Values
Value | Count | Frequency (%) |
부산광역시 부산진구 | 1398 | |
부산광역시 해운대구 | 1395 | |
부산광역시 동래구 | 1063 | |
부산광역시 연제구 | 901 | |
부산광역시 금정구 | 860 | |
부산광역시 강서구 | 763 | |
부산광역시 사하구 | 684 | |
부산광역시 수영구 | 642 | |
부산광역시 남구 | 544 | 5.4% |
부산광역시 기장군 | 495 | 5.0% |
Other values (6) | 1255 |
Length
Value | Count | Frequency (%) |
부산광역시 | 10000 | |
부산진구 | 1398 | 7.0% |
해운대구 | 1395 | 7.0% |
동래구 | 1063 | 5.3% |
연제구 | 901 | 4.5% |
금정구 | 860 | 4.3% |
강서구 | 763 | 3.8% |
사하구 | 684 | 3.4% |
수영구 | 642 | 3.2% |
남구 | 544 | 2.7% |
Other values (7) | 1750 | 8.8% |
ofcpssecode
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
4 | |
<NA> | |
3 | 16 |
2 | 6 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.8097 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 4 |
3rd row | 1 |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
1 | 3687 | |
4 | 3592 | |
<NA> | 2699 | |
3 | 16 | 0.2% |
2 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 3687 | |
4 | 3592 | |
na | 2699 | |
3 | 16 | 0.2% |
2 | 6 | 0.1% |
ofcpssecodenm
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
대표 | |
---|---|
일반 | |
<NA> | |
이사 | 16 |
감사 | 6 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.5398 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반 |
---|---|
2nd row | 일반 |
3rd row | 대표 |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
대표 | 3687 | |
일반 | 3592 | |
<NA> | 2699 | |
이사 | 16 | 0.2% |
감사 | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대표 | 3687 | |
일반 | 3592 | |
na | 2699 | |
이사 | 16 | 0.2% |
감사 | 6 | 0.1% |
last_load_dttm
Date
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2020-12-23 12:10:31 |
---|---|
Maximum | 2020-12-23 12:10:32 |
brkrasortcode | brkrasortcodenm | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
brkrasortcode | 1.000 | 1.000 | 0.243 | 0.311 | 0.577 | 0.577 | 0.043 |
brkrasortcodenm | 1.000 | 1.000 | 0.243 | 0.311 | 0.577 | 0.577 | 0.043 |
ldcode | 0.243 | 0.243 | 1.000 | 1.000 | 0.216 | 0.216 | 0.996 |
ldcodenm | 0.311 | 0.311 | 1.000 | 1.000 | 0.254 | 0.254 | 0.996 |
ofcpssecode | 0.577 | 0.577 | 0.216 | 0.254 | 1.000 | 1.000 | 0.022 |
ofcpssecodenm | 0.577 | 0.577 | 0.216 | 0.254 | 1.000 | 1.000 | 0.022 |
last_load_dttm | 0.043 | 0.043 | 0.996 | 0.996 | 0.022 | 0.022 | 1.000 |
ofcpssecodenm | ldcodenm | brkrasortcode | ofcpssecode | brkrasortcodenm | |
---|---|---|---|---|---|
ofcpssecodenm | 1.000 | 0.122 | 0.587 | 1.000 | 0.587 |
ldcodenm | 0.122 | 1.000 | 0.150 | 0.122 | 0.150 |
brkrasortcode | 0.587 | 0.150 | 1.000 | 0.587 | 1.000 |
ofcpssecode | 1.000 | 0.122 | 0.587 | 1.000 | 0.587 |
brkrasortcodenm | 0.587 | 0.150 | 1.000 | 0.587 | 1.000 |
ldcode | brkrasortcode | brkrasortcodenm | ldcodenm | ofcpssecode | ofcpssecodenm | |
---|---|---|---|---|---|---|
ldcode | 1.000 | 0.110 | 0.110 | 1.000 | 0.097 | 0.097 |
brkrasortcode | 0.110 | 1.000 | 1.000 | 0.150 | 0.587 | 0.587 |
brkrasortcodenm | 0.110 | 1.000 | 1.000 | 0.150 | 0.587 | 0.587 |
ldcodenm | 1.000 | 0.150 | 0.150 | 1.000 | 0.122 | 0.122 |
ofcpssecode | 0.097 | 0.587 | 0.587 | 0.122 | 1.000 | 1.000 |
ofcpssecodenm | 0.097 | 0.587 | 0.587 | 0.122 | 1.000 | 1.000 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
16674 | 4 | 중개보조원 | 김영홍 | 왕우공인중개사사무소 | <NA> | <NA> | 26470-2020-00141 | 2020-12-22 | 26470 | 부산광역시 연제구 | 4 | 일반 | 2020-12-23 12:10:32 |
8221 | 2 | 공인중개사 | 최미숙 | The파크공인중개사사무소 | 2014-10-02 | 10-00771 | 가-10-3423 | 2020-12-22 | 26350 | 부산광역시 해운대구 | 4 | 일반 | 2020-12-23 12:10:31 |
13614 | 2 | 공인중개사 | 강수영 | 소정공인중개사사무소 | 2012-12-10 | 23-00361 | 가-11-1977 | 2020-12-22 | 26410 | 부산광역시 금정구 | 1 | 대표 | 2020-12-23 12:10:32 |
3749 | 4 | 중개보조원 | 김석제 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26230 | 부산광역시 부산진구 | <NA> | <NA> | 2020-12-23 12:10:31 |
13291 | 2 | 공인중개사 | 김우용 | <NA> | 1985-11-18 | 5119 | <NA> | 2020-12-22 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2020-12-23 12:10:32 |
7739 | 2 | 공인중개사 | 김성찬 | 명품공인중개사사무소 | 2007-12-17 | 18-170 | 26320-2019-00052 | 2020-12-22 | 26320 | 부산광역시 북구 | 1 | 대표 | 2020-12-23 12:10:31 |
5972 | 2 | 공인중개사 | 김명숙 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2020-12-23 12:10:31 |
14858 | 2 | 공인중개사 | 김재순 | 토박이공인중개사사무소 | 2016-12-12 | 26-2016-02198 | 26440-2020-00094 | 2020-12-22 | 26440 | 부산광역시 강서구 | 1 | 대표 | 2020-12-23 12:10:32 |
18790 | 2 | 공인중개사 | 최정아 | 해뜨는 공인중개사사무소 | 2008-12-15 | 제19-342호 | 가-16-1236 | 2020-12-22 | 26710 | 부산광역시 기장군 | 1 | 대표 | 2020-12-23 12:10:32 |
13462 | 4 | 중개보조원 | 김동은 | 금정더샵공인중개사사무소 | <NA> | <NA> | 26410-2019-00076 | 2020-12-22 | 26410 | 부산광역시 금정구 | 4 | 일반 | 2020-12-23 12:10:32 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
16840 | 4 | 중개보조원 | 공용진 | (주)온나라부동산중개법인 | <NA> | <NA> | 26470-2016-00066 | 2020-12-22 | 26470 | 부산광역시 연제구 | 4 | 일반 | 2020-12-23 12:10:32 |
10948 | 2 | 공인중개사 | 이혜정 | <NA> | 2008-10-26 | 19-434 | <NA> | 2020-12-22 | 26380 | 부산광역시 사하구 | <NA> | <NA> | 2020-12-23 12:10:32 |
102 | 1 | 중개인 | 이상화 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26110 | 부산광역시 중구 | <NA> | <NA> | 2020-12-23 12:10:31 |
8898 | 2 | 공인중개사 | 이수연 | 앳홈공인중개사사무소 | 2015-12-07 | 31-2015-00337 | 26350-2020-00148 | 2020-12-22 | 26350 | 부산광역시 해운대구 | 1 | 대표 | 2020-12-23 12:10:31 |
3186 | 2 | 공인중개사 | 박성진 | 서면1번지부동산공인중개사사무소 | 2017-12-11 | 26-2017-01219 | 26230-2018-00083 | 2020-12-22 | 26230 | 부산광역시 부산진구 | 1 | 대표 | 2020-12-23 12:10:31 |
4844 | 2 | 공인중개사 | 이병동 | 장군공인중개사사무소 | 1985-11-08 | 부산광역시 - 3175 | 26260-2015-00084 | 2020-12-22 | 26260 | 부산광역시 동래구 | 1 | 대표 | 2020-12-23 12:10:31 |
1782 | 2 | 공인중개사 | 조영옥 | 빛나라공인중개사사무소 | 2017-12-11 | 26-2017-00867(부산) | 26230-2018-00091 | 2020-12-22 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2020-12-23 12:10:31 |
8105 | 2 | 공인중개사 | 최용희 | <NA> | 2000-11-20 | 부산 12 | <NA> | 2020-12-22 | 26350 | 부산광역시 해운대구 | <NA> | <NA> | 2020-12-23 12:10:31 |
15415 | 4 | 중개보조원 | 정회연 | 세종공인중개사사무소 | <NA> | <NA> | 가-13-2104 | 2020-12-22 | 26470 | 부산광역시 연제구 | 4 | 일반 | 2020-12-23 12:10:32 |
3027 | 4 | 중개보조원 | 박한수 | 해피공인중개사사무소 | <NA> | <NA> | 26230-2017-00103 | 2020-12-22 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2020-12-23 12:10:31 |
Most frequently occurring
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2 | 공인중개사 | 권의현 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2020-12-23 12:10:32 | 2 |
1 | 2 | 공인중개사 | 박성진 | 법무공인중개사사무소 | 2011-12-19 | 부산22-00193 | 가-14-1196 | 2020-12-22 | 26500 | 부산광역시 수영구 | 1 | 대표 | 2020-12-23 12:10:32 | 2 |
2 | 2 | 공인중개사 | 박창호 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2020-12-23 12:10:31 | 2 |
3 | 2 | 공인중개사 | 한영수 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2020-12-23 12:10:31 | 2 |
4 | 4 | 중개보조원 | 이기옥 | <NA> | <NA> | <NA> | <NA> | 2020-12-22 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2020-12-23 12:10:32 | 2 |