Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 16653 |
Missing cells (%) | 12.8% |
Duplicate rows | 5 |
Duplicate rows (%) | 0.1% |
Total size in memory | 1.1 MiB |
Average record size in memory | 115.0 B |
Variable types
Categorical | 7 |
---|---|
Text | 5 |
Numeric | 1 |
Dataset
Description | 2021-04-01 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901062201148000 |
lastupdtdt has constant value "" | Constant |
Dataset has 5 (0.1%) duplicate rows | Duplicates |
ofcpssecodenm is highly overall correlated with brkrasortcode and 2 other fields | High correlation |
ldcodenm is highly overall correlated with ldcode and 1 other fields | High correlation |
brkrasortcode is highly overall correlated with brkrasortcodenm and 2 other fields | High correlation |
ofcpssecode is highly overall correlated with brkrasortcode and 2 other fields | High correlation |
last_load_dttm is highly overall correlated with ldcode and 1 other fields | High correlation |
brkrasortcodenm is highly overall correlated with brkrasortcode and 2 other fields | High correlation |
ldcode is highly overall correlated with ldcodenm and 1 other fields | High correlation |
bsnmcmpnm has 4168 (41.7%) missing values | Missing |
crqfcacqdt has 4207 (42.1%) missing values | Missing |
crqfcno has 4110 (41.1%) missing values | Missing |
jurirno has 4168 (41.7%) missing values | Missing |
Reproduction
Analysis started | 2024-04-16 10:25:29.899498 |
---|---|
Analysis finished | 2024-04-16 10:25:31.539409 |
Duration | 1.64 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
brkrasortcode
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
4 | |
1 | 215 |
3 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 4 |
---|---|
2nd row | 2 |
3rd row | 4 |
4th row | 2 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
2 | 6507 | |
4 | 3277 | |
1 | 215 | 2.1% |
3 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6507 | |
4 | 3277 | |
1 | 215 | 2.1% |
3 | 1 | < 0.1% |
brkrasortcodenm
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
공인중개사 | |
---|---|
중개보조원 | |
중개인 | 215 |
법인 | 1 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9567 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 중개보조원 |
---|---|
2nd row | 공인중개사 |
3rd row | 중개보조원 |
4th row | 공인중개사 |
5th row | 중개보조원 |
Common Values
Value | Count | Frequency (%) |
공인중개사 | 6507 | |
중개보조원 | 3277 | |
중개인 | 215 | 2.1% |
법인 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공인중개사 | 6507 | |
중개보조원 | 3277 | |
중개인 | 215 | 2.1% |
법인 | 1 | < 0.1% |
brkrnm
Text
Distinct | 7938 |
---|---|
Distinct (%) | 79.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
김정숙 | 15 | 0.1% |
김미경 | 13 | 0.1% |
김영희 | 13 | 0.1% |
김정희 | 12 | 0.1% |
김미숙 | 11 | 0.1% |
이영주 | 11 | 0.1% |
김경희 | 10 | 0.1% |
정영희 | 10 | 0.1% |
김명희 | 10 | 0.1% |
이정숙 | 9 | 0.1% |
Other values (7934) | 9892 |
Most occurring characters
Value | Count | Frequency (%) |
김 | 2134 | 7.1% |
이 | 1522 | 5.1% |
정 | 1336 | 4.4% |
영 | 1022 | 3.4% |
박 | 950 | 3.2% |
희 | 683 | 2.3% |
경 | 602 | 2.0% |
성 | 538 | 1.8% |
숙 | 536 | 1.8% |
미 | 522 | 1.7% |
Other values (381) | 20287 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30029 | |
Open Punctuation | 37 | 0.1% |
Close Punctuation | 37 | 0.1% |
Uppercase Letter | 13 | < 0.1% |
Lowercase Letter | 10 | < 0.1% |
Space Separator | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
김 | 2134 | 7.1% |
이 | 1522 | 5.1% |
정 | 1336 | 4.4% |
영 | 1022 | 3.4% |
박 | 950 | 3.2% |
희 | 683 | 2.3% |
경 | 602 | 2.0% |
성 | 538 | 1.8% |
숙 | 536 | 1.8% |
미 | 522 | 1.7% |
Other values (362) | 20184 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 2 | |
A | 2 | |
Y | 2 | |
T | 2 | |
E | 1 | |
H | 1 | |
L | 1 | |
C | 1 | |
S | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 4 | |
g | 1 | 10.0% |
a | 1 | 10.0% |
s | 1 | 10.0% |
y | 1 | 10.0% |
u | 1 | 10.0% |
n | 1 | 10.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 37 |
Close Punctuation
Value | Count | Frequency (%) |
) | 37 |
Space Separator
Value | Count | Frequency (%) |
6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29919 | |
Han | 110 | 0.4% |
Common | 80 | 0.3% |
Latin | 23 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
김 | 2134 | 7.1% |
이 | 1522 | 5.1% |
정 | 1336 | 4.5% |
영 | 1022 | 3.4% |
박 | 950 | 3.2% |
희 | 683 | 2.3% |
경 | 602 | 2.0% |
성 | 538 | 1.8% |
숙 | 536 | 1.8% |
미 | 522 | 1.7% |
Other values (283) | 20074 |
Han
Value | Count | Frequency (%) |
金 | 9 | 8.2% |
崔 | 4 | 3.6% |
順 | 3 | 2.7% |
榮 | 3 | 2.7% |
子 | 3 | 2.7% |
鄭 | 3 | 2.7% |
李 | 3 | 2.7% |
朴 | 2 | 1.8% |
永 | 2 | 1.8% |
東 | 2 | 1.8% |
Other values (69) | 76 |
Latin
Value | Count | Frequency (%) |
e | 4 | |
N | 2 | 8.7% |
A | 2 | 8.7% |
Y | 2 | 8.7% |
T | 2 | 8.7% |
E | 1 | 4.3% |
H | 1 | 4.3% |
L | 1 | 4.3% |
C | 1 | 4.3% |
g | 1 | 4.3% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
( | 37 | |
) | 37 | |
6 | 7.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29919 | |
CJK | 104 | 0.3% |
ASCII | 103 | 0.3% |
CJK Compat Ideographs | 6 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
김 | 2134 | 7.1% |
이 | 1522 | 5.1% |
정 | 1336 | 4.5% |
영 | 1022 | 3.4% |
박 | 950 | 3.2% |
희 | 683 | 2.3% |
경 | 602 | 2.0% |
성 | 538 | 1.8% |
숙 | 536 | 1.8% |
미 | 522 | 1.7% |
Other values (283) | 20074 |
ASCII
Value | Count | Frequency (%) |
( | 37 | |
) | 37 | |
6 | 5.8% | |
e | 4 | 3.9% |
N | 2 | 1.9% |
A | 2 | 1.9% |
Y | 2 | 1.9% |
T | 2 | 1.9% |
E | 1 | 1.0% |
H | 1 | 1.0% |
Other values (9) | 9 | 8.7% |
CJK
Value | Count | Frequency (%) |
金 | 9 | 8.7% |
崔 | 4 | 3.8% |
順 | 3 | 2.9% |
榮 | 3 | 2.9% |
子 | 3 | 2.9% |
鄭 | 3 | 2.9% |
朴 | 2 | 1.9% |
永 | 2 | 1.9% |
東 | 2 | 1.9% |
文 | 2 | 1.9% |
Other values (65) | 71 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 3 | |
連 | 1 | 16.7% |
梁 | 1 | 16.7% |
林 | 1 | 16.7% |
bsnmcmpnm
Text
MISSING
 
Distinct | 2711 |
---|---|
Distinct (%) | 46.5% |
Missing | 4168 |
Missing (%) | 41.7% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 22 |
Mean length | 11.356653 |
Min length | 4 |
Characters and Unicode
Total characters | 66232 |
---|---|
Distinct characters | 543 |
Distinct categories | 10 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 1611 ? |
---|---|
Unique (%) | 27.6% |
Sample
1st row | 나이스부동산공인중개사사무소 |
---|---|
2nd row | 대한공인중개사사무소 |
3rd row | 럭키합동공인중개사사무소 |
4th row | 주식회사 부동산중개법인 더트럼프 |
5th row | 아주공인중개사사무소 |
Value | Count | Frequency (%) |
주식회사 | 140 | 2.3% |
공인중개사사무소 | 59 | 1.0% |
주)부동산중개법인개벽 | 45 | 0.7% |
사무소 | 42 | 0.7% |
현대공인중개사사무소 | 35 | 0.6% |
대명합동공인중개사사무소 | 32 | 0.5% |
삼오부동산중개법인 | 31 | 0.5% |
조은부동산중개 | 30 | 0.5% |
주)온나라부동산중개법인 | 29 | 0.5% |
삼성공인중개사사무소 | 26 | 0.4% |
Other values (2709) | 5679 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 9854 | |
개 | 5877 | 8.9% |
중 | 5843 | 8.8% |
소 | 5121 | 7.7% |
무 | 5096 | 7.7% |
인 | 4941 | 7.5% |
공 | 4535 | 6.8% |
동 | 2621 | 4.0% |
부 | 2396 | 3.6% |
산 | 2391 | 3.6% |
Other values (533) | 17557 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 64294 | |
Uppercase Letter | 743 | 1.1% |
Space Separator | 393 | 0.6% |
Decimal Number | 249 | 0.4% |
Close Punctuation | 194 | 0.3% |
Open Punctuation | 194 | 0.3% |
Lowercase Letter | 136 | 0.2% |
Other Punctuation | 23 | < 0.1% |
Dash Punctuation | 4 | < 0.1% |
Letter Number | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 9854 | |
개 | 5877 | 9.1% |
중 | 5843 | 9.1% |
소 | 5121 | 8.0% |
무 | 5096 | 7.9% |
인 | 4941 | 7.7% |
공 | 4535 | 7.1% |
동 | 2621 | 4.1% |
부 | 2396 | 3.7% |
산 | 2391 | 3.7% |
Other values (479) | 15619 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 123 | |
S | 78 | |
L | 68 | 9.2% |
T | 64 | 8.6% |
C | 53 | 7.1% |
B | 42 | 5.7% |
H | 39 | 5.2% |
G | 32 | 4.3% |
O | 30 | 4.0% |
W | 29 | 3.9% |
Other values (13) | 185 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 63 | |
h | 22 | 16.2% |
w | 10 | 7.4% |
t | 10 | 7.4% |
s | 8 | 5.9% |
c | 7 | 5.1% |
k | 6 | 4.4% |
b | 5 | 3.7% |
y | 2 | 1.5% |
i | 1 | 0.7% |
Other values (2) | 2 | 1.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 96 | |
2 | 37 | 14.9% |
4 | 31 | 12.4% |
8 | 28 | 11.2% |
3 | 24 | 9.6% |
9 | 19 | 7.6% |
5 | 6 | 2.4% |
6 | 4 | 1.6% |
7 | 3 | 1.2% |
0 | 1 | 0.4% |
Other Punctuation
Value | Count | Frequency (%) |
& | 20 | |
· | 1 | 4.3% |
# | 1 | 4.3% |
. | 1 | 4.3% |
Space Separator
Value | Count | Frequency (%) |
393 |
Close Punctuation
Value | Count | Frequency (%) |
) | 194 |
Open Punctuation
Value | Count | Frequency (%) |
( | 194 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 64286 | |
Common | 1057 | 1.6% |
Latin | 881 | 1.3% |
Han | 8 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 9854 | |
개 | 5877 | 9.1% |
중 | 5843 | 9.1% |
소 | 5121 | 8.0% |
무 | 5096 | 7.9% |
인 | 4941 | 7.7% |
공 | 4535 | 7.1% |
동 | 2621 | 4.1% |
부 | 2396 | 3.7% |
산 | 2391 | 3.7% |
Other values (471) | 15611 |
Latin
Value | Count | Frequency (%) |
K | 123 | |
S | 78 | 8.9% |
L | 68 | 7.7% |
T | 64 | 7.3% |
e | 63 | 7.2% |
C | 53 | 6.0% |
B | 42 | 4.8% |
H | 39 | 4.4% |
G | 32 | 3.6% |
O | 30 | 3.4% |
Other values (26) | 289 |
Common
Value | Count | Frequency (%) |
393 | ||
) | 194 | |
( | 194 | |
1 | 96 | 9.1% |
2 | 37 | 3.5% |
4 | 31 | 2.9% |
8 | 28 | 2.6% |
3 | 24 | 2.3% |
& | 20 | 1.9% |
9 | 19 | 1.8% |
Other values (8) | 21 | 2.0% |
Han
Value | Count | Frequency (%) |
本 | 1 | |
利 | 1 | |
太 | 1 | |
甲 | 1 | |
堂 | 1 | |
明 | 1 | |
秀 | 1 | |
福 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 64286 | |
ASCII | 1935 | 2.9% |
CJK | 8 | < 0.1% |
Number Forms | 2 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 9854 | |
개 | 5877 | 9.1% |
중 | 5843 | 9.1% |
소 | 5121 | 8.0% |
무 | 5096 | 7.9% |
인 | 4941 | 7.7% |
공 | 4535 | 7.1% |
동 | 2621 | 4.1% |
부 | 2396 | 3.7% |
산 | 2391 | 3.7% |
Other values (471) | 15611 |
ASCII
Value | Count | Frequency (%) |
393 | ||
) | 194 | 10.0% |
( | 194 | 10.0% |
K | 123 | 6.4% |
1 | 96 | 5.0% |
S | 78 | 4.0% |
L | 68 | 3.5% |
T | 64 | 3.3% |
e | 63 | 3.3% |
C | 53 | 2.7% |
Other values (42) | 609 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 2 |
None
Value | Count | Frequency (%) |
· | 1 |
CJK
Value | Count | Frequency (%) |
本 | 1 | |
利 | 1 | |
太 | 1 | |
甲 | 1 | |
堂 | 1 | |
明 | 1 | |
秀 | 1 | |
福 | 1 |
crqfcacqdt
Text
MISSING
 
Distinct | 632 |
---|---|
Distinct (%) | 10.9% |
Missing | 4207 |
Missing (%) | 42.1% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9993095 |
Min length | 8 |
Characters and Unicode
Total characters | 57926 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 401 ? |
---|---|
Unique (%) | 6.9% |
Sample
1st row | 1985-10-25 |
---|---|
2nd row | 2013-10-27 |
3rd row | 2016-10-29 |
4th row | 2003-11-07 |
5th row | 2021-03-22 |
Value | Count | Frequency (%) |
2005-07-20 | 391 | 6.7% |
2017-12-11 | 332 | 5.7% |
2016-12-12 | 321 | 5.5% |
2019-12-09 | 238 | 4.1% |
2003-11-07 | 198 | 3.4% |
2015-12-09 | 197 | 3.4% |
2018-12-10 | 183 | 3.2% |
2005-12-12 | 182 | 3.1% |
2001-12-10 | 160 | 2.8% |
2007-12-17 | 150 | 2.6% |
Other values (622) | 3441 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 13253 | |
0 | 12042 | |
- | 11582 | |
2 | 11067 | |
9 | 2650 | 4.6% |
5 | 1685 | 2.9% |
7 | 1613 | 2.8% |
8 | 1394 | 2.4% |
3 | 1072 | 1.9% |
6 | 938 | 1.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 46344 | |
Dash Punctuation | 11582 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 13253 | |
0 | 12042 | |
2 | 11067 | |
9 | 2650 | 5.7% |
5 | 1685 | 3.6% |
7 | 1613 | 3.5% |
8 | 1394 | 3.0% |
3 | 1072 | 2.3% |
6 | 938 | 2.0% |
4 | 630 | 1.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 11582 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 57926 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 13253 | |
0 | 12042 | |
- | 11582 | |
2 | 11067 | |
9 | 2650 | 4.6% |
5 | 1685 | 2.9% |
7 | 1613 | 2.8% |
8 | 1394 | 2.4% |
3 | 1072 | 1.9% |
6 | 938 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 57926 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 13253 | |
0 | 12042 | |
- | 11582 | |
2 | 11067 | |
9 | 2650 | 4.6% |
5 | 1685 | 2.9% |
7 | 1613 | 2.8% |
8 | 1394 | 2.4% |
3 | 1072 | 1.9% |
6 | 938 | 1.6% |
crqfcno
Text
MISSING
 
Distinct | 5699 |
---|---|
Distinct (%) | 96.8% |
Missing | 4110 |
Missing (%) | 41.1% |
Memory size | 156.2 KiB |
Length
Max length | 21 |
---|---|
Median length | 19 |
Mean length | 9.2239389 |
Min length | 1 |
Characters and Unicode
Total characters | 54329 |
---|---|
Distinct characters | 60 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 5526 ? |
---|---|
Unique (%) | 93.8% |
Sample
1st row | 410 |
---|---|
2nd row | 24-0646(부산) |
3rd row | 26-2016-01102(부산) |
4th row | [부산]449 |
5th row | 26-2020-00828 |
Value | Count | Frequency (%) |
부산 | 355 | 5.5% |
부산시 | 49 | 0.8% |
부산광역시장 | 24 | 0.4% |
부산광역시 | 22 | 0.3% |
경남 | 13 | 0.2% |
울산 | 9 | 0.1% |
1154 | 5 | 0.1% |
경기도 | 4 | 0.1% |
제 | 4 | 0.1% |
6 | 4 | 0.1% |
Other values (5657) | 5918 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8812 | |
2 | 6923 | |
1 | 6609 | |
- | 6089 | |
6 | 3851 | 7.1% |
3 | 2427 | 4.5% |
4 | 2427 | 4.5% |
8 | 2233 | 4.1% |
5 | 2176 | 4.0% |
9 | 2168 | 4.0% |
Other values (50) | 10614 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39793 | |
Other Letter | 6483 | 11.9% |
Dash Punctuation | 6089 | 11.2% |
Open Punctuation | 711 | 1.3% |
Close Punctuation | 711 | 1.3% |
Space Separator | 524 | 1.0% |
Other Punctuation | 17 | < 0.1% |
Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 1954 | |
부 | 1935 | |
호 | 498 | 7.7% |
제 | 439 | 6.8% |
시 | 368 | 5.7% |
광 | 281 | 4.3% |
역 | 280 | 4.3% |
장 | 161 | 2.5% |
경 | 123 | 1.9% |
남 | 94 | 1.4% |
Other values (29) | 350 | 5.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 8812 | |
2 | 6923 | |
1 | 6609 | |
6 | 3851 | |
3 | 2427 | 6.1% |
4 | 2427 | 6.1% |
8 | 2233 | 5.6% |
5 | 2176 | 5.5% |
9 | 2168 | 5.4% |
7 | 2167 | 5.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 6 | |
, | 6 | |
: | 3 | |
? | 2 | 11.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 658 | |
[ | 53 | 7.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 658 | |
] | 53 | 7.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6089 |
Space Separator
Value | Count | Frequency (%) |
524 |
Uppercase Letter
Value | Count | Frequency (%) |
Ы | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 47845 | |
Hangul | 6483 | 11.9% |
Cyrillic | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 1954 | |
부 | 1935 | |
호 | 498 | 7.7% |
제 | 439 | 6.8% |
시 | 368 | 5.7% |
광 | 281 | 4.3% |
역 | 280 | 4.3% |
장 | 161 | 2.5% |
경 | 123 | 1.9% |
남 | 94 | 1.4% |
Other values (29) | 350 | 5.4% |
Common
Value | Count | Frequency (%) |
0 | 8812 | |
2 | 6923 | |
1 | 6609 | |
- | 6089 | |
6 | 3851 | |
3 | 2427 | 5.1% |
4 | 2427 | 5.1% |
8 | 2233 | 4.7% |
5 | 2176 | 4.5% |
9 | 2168 | 4.5% |
Other values (10) | 4130 |
Cyrillic
Value | Count | Frequency (%) |
Ы | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 47843 | |
Hangul | 6483 | 11.9% |
None | 2 | < 0.1% |
Cyrillic | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8812 | |
2 | 6923 | |
1 | 6609 | |
- | 6089 | |
6 | 3851 | |
3 | 2427 | 5.1% |
4 | 2427 | 5.1% |
8 | 2233 | 4.7% |
5 | 2176 | 4.5% |
9 | 2168 | 4.5% |
Other values (9) | 4128 |
Hangul
Value | Count | Frequency (%) |
산 | 1954 | |
부 | 1935 | |
호 | 498 | 7.7% |
제 | 439 | 6.8% |
시 | 368 | 5.7% |
광 | 281 | 4.3% |
역 | 280 | 4.3% |
장 | 161 | 2.5% |
경 | 123 | 1.9% |
남 | 94 | 1.4% |
Other values (29) | 350 | 5.4% |
None
Value | Count | Frequency (%) |
? | 2 |
Cyrillic
Value | Count | Frequency (%) |
Ы | 1 |
jurirno
Text
MISSING
 
Distinct | 3684 |
---|---|
Distinct (%) | 63.2% |
Missing | 4168 |
Missing (%) | 41.7% |
Memory size | 156.2 KiB |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 13.835219 |
Min length | 6 |
Characters and Unicode
Total characters | 80687 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 2609 ? |
---|---|
Unique (%) | 44.7% |
Sample
1st row | 26410-2019-00041 |
---|---|
2nd row | 26140-2017-00014 |
3rd row | 가-05-2297 |
4th row | 가-05-3636 |
5th row | 가-7-1843 |
Value | Count | Frequency (%) |
26470-2018-00085 | 45 | 0.8% |
26470-2015-00027 | 32 | 0.5% |
26230-2016-00137 | 31 | 0.5% |
26470-2016-00066 | 29 | 0.5% |
26470-2021-00017 | 29 | 0.5% |
26470-2018-00103 | 21 | 0.4% |
26230-2020-00171 | 19 | 0.3% |
26290-2017-00018 | 18 | 0.3% |
가-05-3566 | 18 | 0.3% |
26230-2016-00096 | 17 | 0.3% |
Other values (3677) | 5577 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 23155 | |
2 | 13049 | |
- | 11619 | |
1 | 8392 | 10.4% |
6 | 6151 | 7.6% |
3 | 3318 | 4.1% |
4 | 3289 | 4.1% |
5 | 2997 | 3.7% |
7 | 2806 | 3.5% |
9 | 2282 | 2.8% |
Other values (4) | 3629 | 4.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 67355 | |
Dash Punctuation | 11619 | 14.4% |
Other Letter | 1709 | 2.1% |
Space Separator | 4 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 23155 | |
2 | 13049 | |
1 | 8392 | 12.5% |
6 | 6151 | 9.1% |
3 | 3318 | 4.9% |
4 | 3289 | 4.9% |
5 | 2997 | 4.4% |
7 | 2806 | 4.2% |
9 | 2282 | 3.4% |
8 | 1916 | 2.8% |
Other Letter
Value | Count | Frequency (%) |
가 | 1689 | |
나 | 20 | 1.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 11619 |
Space Separator
Value | Count | Frequency (%) |
4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 78978 | |
Hangul | 1709 | 2.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 23155 | |
2 | 13049 | |
- | 11619 | |
1 | 8392 | 10.6% |
6 | 6151 | 7.8% |
3 | 3318 | 4.2% |
4 | 3289 | 4.2% |
5 | 2997 | 3.8% |
7 | 2806 | 3.6% |
9 | 2282 | 2.9% |
Other values (2) | 1920 | 2.4% |
Hangul
Value | Count | Frequency (%) |
가 | 1689 | |
나 | 20 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 78978 | |
Hangul | 1709 | 2.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 23155 | |
2 | 13049 | |
- | 11619 | |
1 | 8392 | 10.6% |
6 | 6151 | 7.8% |
3 | 3318 | 4.2% |
4 | 3289 | 4.2% |
5 | 2997 | 3.8% |
7 | 2806 | 3.6% |
9 | 2282 | 2.9% |
Other values (2) | 1920 | 2.4% |
Hangul
Value | Count | Frequency (%) |
가 | 1689 | |
나 | 20 | 1.2% |
lastupdtdt
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2021-03-29 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-03-29 |
---|---|
2nd row | 2021-03-29 |
3rd row | 2021-03-29 |
4th row | 2021-03-29 |
5th row | 2021-03-29 |
Common Values
Value | Count | Frequency (%) |
2021-03-29 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-03-29 | 10000 |
ldcode
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26362.876 |
Minimum | 26110 |
---|---|
Maximum | 26710 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 26110 |
---|---|
5-th percentile | 26170 |
Q1 | 26260 |
median | 26350 |
Q3 | 26440 |
95-th percentile | 26530 |
Maximum | 26710 |
Range | 600 |
Interquartile range (IQR) | 180 |
Descriptive statistics
Standard deviation | 128.9436 |
---|---|
Coefficient of variation (CV) | 0.0048911051 |
Kurtosis | 0.50592679 |
Mean | 26362.876 |
Median Absolute Deviation (MAD) | 90 |
Skewness | 0.62322584 |
Sum | 2.6362876 × 108 |
Variance | 16626.451 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
26230 | 1454 | |
26350 | 1383 | |
26260 | 1041 | |
26470 | 948 | |
26410 | 819 | |
26440 | 763 | |
26380 | 668 | |
26500 | 620 | |
26290 | 514 | 5.1% |
26710 | 483 | 4.8% |
Other values (6) | 1307 |
Value | Count | Frequency (%) |
26110 | 173 | 1.7% |
26140 | 165 | 1.7% |
26170 | 181 | 1.8% |
26200 | 169 | 1.7% |
26230 | 1454 | |
26260 | 1041 | |
26290 | 514 | 5.1% |
26320 | 324 | 3.2% |
26350 | 1383 | |
26380 | 668 |
Value | Count | Frequency (%) |
26710 | 483 | 4.8% |
26530 | 295 | 2.9% |
26500 | 620 | |
26470 | 948 | |
26440 | 763 | |
26410 | 819 | |
26380 | 668 | |
26350 | 1383 | |
26320 | 324 | 3.2% |
26290 | 514 | 5.1% |
ldcodenm
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
부산광역시 부산진구 | |
---|---|
부산광역시 해운대구 | |
부산광역시 동래구 | |
부산광역시 연제구 | |
부산광역시 금정구 | |
Other values (11) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.148 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 금정구 |
---|---|
2nd row | 부산광역시 서구 |
3rd row | 부산광역시 부산진구 |
4th row | 부산광역시 부산진구 |
5th row | 부산광역시 북구 |
Common Values
Value | Count | Frequency (%) |
부산광역시 부산진구 | 1454 | |
부산광역시 해운대구 | 1383 | |
부산광역시 동래구 | 1041 | |
부산광역시 연제구 | 948 | |
부산광역시 금정구 | 819 | |
부산광역시 강서구 | 763 | |
부산광역시 사하구 | 668 | |
부산광역시 수영구 | 620 | |
부산광역시 남구 | 514 | 5.1% |
부산광역시 기장군 | 483 | 4.8% |
Other values (6) | 1307 |
Length
Value | Count | Frequency (%) |
부산광역시 | 10000 | |
부산진구 | 1454 | 7.3% |
해운대구 | 1383 | 6.9% |
동래구 | 1041 | 5.2% |
연제구 | 948 | 4.7% |
금정구 | 819 | 4.1% |
강서구 | 763 | 3.8% |
사하구 | 668 | 3.3% |
수영구 | 620 | 3.1% |
남구 | 514 | 2.6% |
Other values (7) | 1790 | 8.9% |
ofcpssecode
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
4 | |
<NA> | |
3 | 16 |
2 | 10 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.7944 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 1 |
3rd row | 4 |
4th row | 2 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
1 | 3722 | |
4 | 3604 | |
<NA> | 2648 | |
3 | 16 | 0.2% |
2 | 10 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 3722 | |
4 | 3604 | |
na | 2648 | |
3 | 16 | 0.2% |
2 | 10 | 0.1% |
ofcpssecodenm
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
대표 | |
---|---|
일반 | |
<NA> | |
이사 | 16 |
감사 | 10 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.5296 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반 |
---|---|
2nd row | 대표 |
3rd row | 일반 |
4th row | 감사 |
5th row | 일반 |
Common Values
Value | Count | Frequency (%) |
대표 | 3722 | |
일반 | 3604 | |
<NA> | 2648 | |
이사 | 16 | 0.2% |
감사 | 10 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대표 | 3722 | |
일반 | 3604 | |
na | 2648 | |
이사 | 16 | 0.2% |
감사 | 10 | 0.1% |
last_load_dttm
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2021-04-01 06:22:03 | |
---|---|
2021-04-01 06:22:04 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-04-01 06:22:04 |
---|---|
2nd row | 2021-04-01 06:22:03 |
3rd row | 2021-04-01 06:22:03 |
4th row | 2021-04-01 06:22:03 |
5th row | 2021-04-01 06:22:03 |
Common Values
Value | Count | Frequency (%) |
2021-04-01 06:22:03 | 5005 | |
2021-04-01 06:22:04 | 4995 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-04-01 | 10000 | |
06:22:03 | 5005 | |
06:22:04 | 4995 |
brkrasortcode | brkrasortcodenm | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
brkrasortcode | 1.000 | 1.000 | 0.243 | 0.300 | 0.574 | 0.574 | 0.056 |
brkrasortcodenm | 1.000 | 1.000 | 0.243 | 0.300 | 0.574 | 0.574 | 0.056 |
ldcode | 0.243 | 0.243 | 1.000 | 1.000 | 0.215 | 0.215 | 0.995 |
ldcodenm | 0.300 | 0.300 | 1.000 | 1.000 | 0.255 | 0.255 | 0.995 |
ofcpssecode | 0.574 | 0.574 | 0.215 | 0.255 | 1.000 | 1.000 | 0.047 |
ofcpssecodenm | 0.574 | 0.574 | 0.215 | 0.255 | 1.000 | 1.000 | 0.047 |
last_load_dttm | 0.056 | 0.056 | 0.995 | 0.995 | 0.047 | 0.047 | 1.000 |
ofcpssecodenm | ldcodenm | brkrasortcode | ofcpssecode | last_load_dttm | brkrasortcodenm | |
---|---|---|---|---|---|---|
ofcpssecodenm | 1.000 | 0.122 | 0.584 | 1.000 | 0.031 | 0.584 |
ldcodenm | 0.122 | 1.000 | 0.145 | 0.122 | 0.941 | 0.145 |
brkrasortcode | 0.584 | 0.145 | 1.000 | 0.584 | 0.037 | 1.000 |
ofcpssecode | 1.000 | 0.122 | 0.584 | 1.000 | 0.031 | 0.584 |
last_load_dttm | 0.031 | 0.941 | 0.037 | 0.031 | 1.000 | 0.037 |
brkrasortcodenm | 0.584 | 0.145 | 1.000 | 0.584 | 0.037 | 1.000 |
ldcode | brkrasortcode | brkrasortcodenm | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
ldcode | 1.000 | 0.112 | 0.112 | 1.000 | 0.097 | 0.097 | 0.937 |
brkrasortcode | 0.112 | 1.000 | 1.000 | 0.145 | 0.584 | 0.584 | 0.037 |
brkrasortcodenm | 0.112 | 1.000 | 1.000 | 0.145 | 0.584 | 0.584 | 0.037 |
ldcodenm | 1.000 | 0.145 | 0.145 | 1.000 | 0.122 | 0.122 | 0.941 |
ofcpssecode | 0.097 | 0.584 | 0.584 | 0.122 | 1.000 | 1.000 | 0.031 |
ofcpssecodenm | 0.097 | 0.584 | 0.584 | 0.122 | 1.000 | 1.000 | 0.031 |
last_load_dttm | 0.937 | 0.037 | 0.037 | 0.941 | 0.031 | 0.031 | 1.000 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
13385 | 4 | 중개보조원 | 김정숙 | 나이스부동산공인중개사사무소 | <NA> | <NA> | 26410-2019-00041 | 2021-03-29 | 26410 | 부산광역시 금정구 | 4 | 일반 | 2021-04-01 06:22:04 |
578 | 2 | 공인중개사 | 박상기 | 대한공인중개사사무소 | 1985-10-25 | 410 | 26140-2017-00014 | 2021-03-29 | 26140 | 부산광역시 서구 | 1 | 대표 | 2021-04-01 06:22:03 |
3341 | 4 | 중개보조원 | 문두홍 | 럭키합동공인중개사사무소 | <NA> | <NA> | 가-05-2297 | 2021-03-29 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2021-04-01 06:22:03 |
3390 | 2 | 공인중개사 | 김혜영 | 주식회사 부동산중개법인 더트럼프 | 2013-10-27 | 24-0646(부산) | 가-05-3636 | 2021-03-29 | 26230 | 부산광역시 부산진구 | 2 | 감사 | 2021-04-01 06:22:03 |
7603 | 4 | 중개보조원 | 유인규 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26320 | 부산광역시 북구 | 4 | 일반 | 2021-04-01 06:22:03 |
2758 | 2 | 공인중개사 | 신준호 | <NA> | 2016-10-29 | 26-2016-01102(부산) | <NA> | 2021-03-29 | 26230 | 부산광역시 부산진구 | 1 | 대표 | 2021-04-01 06:22:03 |
11334 | 2 | 공인중개사 | 이선미 | <NA> | 2003-11-07 | [부산]449 | <NA> | 2021-03-29 | 26380 | 부산광역시 사하구 | <NA> | <NA> | 2021-04-01 06:22:04 |
11885 | 4 | 중개보조원 | 김미숙 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26380 | 부산광역시 사하구 | 4 | 일반 | 2021-04-01 06:22:04 |
274 | 2 | 공인중개사 | 김외선 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26110 | 부산광역시 중구 | <NA> | <NA> | 2021-04-01 06:22:03 |
10001 | 2 | 공인중개사 | 김혜린 | <NA> | 2021-03-22 | 26-2020-00828 | <NA> | 2021-03-29 | 26350 | 부산광역시 해운대구 | 1 | 대표 | 2021-04-01 06:22:04 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3891 | 1 | 중개인 | 김동수 | 밀양부동산중개사무소 | <NA> | <NA> | 나-05-155 | 2021-03-29 | 26230 | 부산광역시 부산진구 | 1 | 대표 | 2021-04-01 06:22:03 |
11904 | 4 | 중개보조원 | 김복희 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26380 | 부산광역시 사하구 | 4 | 일반 | 2021-04-01 06:22:04 |
6927 | 4 | 중개보조원 | 박현숙 | W럭키부동산중개사무소 | <NA> | <NA> | 26290-2019-00051 | 2021-03-29 | 26290 | 부산광역시 남구 | 4 | 일반 | 2021-04-01 06:22:03 |
19512 | 4 | 중개보조원 | 윤병수 | 금나라부동산공인중개사사무소 | <NA> | <NA> | 26710-2015-00102 | 2021-03-29 | 26710 | 부산광역시 기장군 | 4 | 일반 | 2021-04-01 06:22:04 |
5791 | 2 | 공인중개사 | 김흥도 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-04-01 06:22:03 |
5288 | 2 | 공인중개사 | 안성희 | <NA> | 2017-12-11 | 26-2017-00352 | <NA> | 2021-03-29 | 26260 | 부산광역시 동래구 | 4 | 일반 | 2021-04-01 06:22:03 |
11629 | 2 | 공인중개사 | 박미숙 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26380 | 부산광역시 사하구 | <NA> | <NA> | 2021-04-01 06:22:04 |
1336 | 4 | 중개보조원 | 고윤미 | 신도 공인 중개사 사무소 | <NA> | <NA> | 가-4-391 | 2021-03-29 | 26200 | 부산광역시 영도구 | 4 | 일반 | 2021-04-01 06:22:03 |
10627 | 2 | 공인중개사 | 권은정 | 센텀에이스공인중개사사무소 | 2003-11-06 | 부산1486 | 가-10-1660 | 2021-03-29 | 26350 | 부산광역시 해운대구 | 1 | 대표 | 2021-04-01 06:22:04 |
6842 | 4 | 중개보조원 | 선정주 | 대우공인중개사사무소 | <NA> | <NA> | 가-7-1586 | 2021-03-29 | 26290 | 부산광역시 남구 | 4 | 일반 | 2021-04-01 06:22:03 |
Most frequently occurring
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2 | 공인중개사 | 김미정 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-04-01 06:22:03 | 2 |
1 | 2 | 공인중개사 | 김성수 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26350 | 부산광역시 해운대구 | <NA> | <NA> | 2021-04-01 06:22:04 | 2 |
2 | 2 | 공인중개사 | 박창호 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-04-01 06:22:03 | 2 |
3 | 4 | 중개보조원 | 김명자 | <NA> | <NA> | <NA> | <NA> | 2021-03-29 | 26230 | 부산광역시 부산진구 | <NA> | <NA> | 2021-04-01 06:22:03 | 2 |
4 | 4 | 중개보조원 | 최혜빈 | (주)고명부동산중개법인 | <NA> | <NA> | 가-12-1115 | 2021-03-29 | 26440 | 부산광역시 강서구 | 4 | 일반 | 2021-04-01 06:22:04 | 2 |