Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 13528 |
Missing cells (%) | 10.4% |
Duplicate rows | 4 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 1.1 MiB |
Average record size in memory | 115.0 B |
Variable types
Categorical | 7 |
---|---|
Text | 5 |
Numeric | 1 |
Dataset
Description | 2021-02-01 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901062201148000 |
lastupdtdt has constant value "" | Constant |
Dataset has 4 (< 0.1%) duplicate rows | Duplicates |
ofcpssecodenm is highly overall correlated with ofcpssecode | High correlation |
ldcodenm is highly overall correlated with ldcode and 1 other fields | High correlation |
brkrasortcode is highly overall correlated with brkrasortcodenm | High correlation |
ofcpssecode is highly overall correlated with ofcpssecodenm | High correlation |
last_load_dttm is highly overall correlated with ldcode and 1 other fields | High correlation |
brkrasortcodenm is highly overall correlated with brkrasortcode | High correlation |
ldcode is highly overall correlated with ldcodenm and 1 other fields | High correlation |
bsnmcmpnm has 2712 (27.1%) missing values | Missing |
crqfcacqdt has 4104 (41.0%) missing values | Missing |
crqfcno has 4000 (40.0%) missing values | Missing |
jurirno has 2712 (27.1%) missing values | Missing |
Reproduction
Analysis started | 2024-04-16 10:25:46.930675 |
---|---|
Analysis finished | 2024-04-16 10:25:48.429954 |
Duration | 1.5 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
brkrasortcode
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
4 | |
1 | 201 |
3 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 4 |
3rd row | 2 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
2 | 6621 | |
4 | 3174 | |
1 | 201 | 2.0% |
3 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6621 | |
4 | 3174 | |
1 | 201 | 2.0% |
3 | 4 | < 0.1% |
brkrasortcodenm
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
공인중개사 | |
---|---|
중개보조원 | |
중개인 | 201 |
법인 | 4 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9586 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 공인중개사 |
---|---|
2nd row | 중개보조원 |
3rd row | 공인중개사 |
4th row | 중개보조원 |
5th row | 중개보조원 |
Common Values
Value | Count | Frequency (%) |
공인중개사 | 6621 | |
중개보조원 | 3174 | |
중개인 | 201 | 2.0% |
법인 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공인중개사 | 6621 | |
중개보조원 | 3174 | |
중개인 | 201 | 2.0% |
법인 | 4 | < 0.1% |
brkrnm
Text
Distinct | 7909 |
---|---|
Distinct (%) | 79.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
김영희 | 17 | 0.2% |
김정희 | 15 | 0.1% |
김선희 | 13 | 0.1% |
이영주 | 12 | 0.1% |
김정숙 | 12 | 0.1% |
이정희 | 11 | 0.1% |
정영희 | 11 | 0.1% |
김미숙 | 11 | 0.1% |
이미경 | 10 | 0.1% |
김미경 | 10 | 0.1% |
Other values (7909) | 9890 |
Most occurring characters
Value | Count | Frequency (%) |
김 | 2197 | 7.3% |
이 | 1492 | 5.0% |
정 | 1344 | 4.5% |
영 | 1032 | 3.4% |
박 | 910 | 3.0% |
희 | 690 | 2.3% |
경 | 631 | 2.1% |
숙 | 552 | 1.8% |
미 | 542 | 1.8% |
수 | 542 | 1.8% |
Other values (364) | 20198 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30011 | |
Close Punctuation | 37 | 0.1% |
Open Punctuation | 37 | 0.1% |
Uppercase Letter | 22 | 0.1% |
Space Separator | 13 | < 0.1% |
Lowercase Letter | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
김 | 2197 | 7.3% |
이 | 1492 | 5.0% |
정 | 1344 | 4.5% |
영 | 1032 | 3.4% |
박 | 910 | 3.0% |
희 | 690 | 2.3% |
경 | 631 | 2.1% |
숙 | 552 | 1.8% |
미 | 542 | 1.8% |
수 | 542 | 1.8% |
Other values (343) | 20079 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 4 | |
Y | 3 | |
A | 3 | |
T | 3 | |
I | 3 | |
J | 1 | 4.5% |
E | 1 | 4.5% |
H | 1 | 4.5% |
S | 1 | 4.5% |
C | 1 | 4.5% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 4 | |
g | 1 | 10.0% |
n | 1 | 10.0% |
u | 1 | 10.0% |
y | 1 | 10.0% |
s | 1 | 10.0% |
a | 1 | 10.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 37 |
Open Punctuation
Value | Count | Frequency (%) |
( | 37 |
Space Separator
Value | Count | Frequency (%) |
13 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29902 | |
Han | 109 | 0.4% |
Common | 87 | 0.3% |
Latin | 32 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
김 | 2197 | 7.3% |
이 | 1492 | 5.0% |
정 | 1344 | 4.5% |
영 | 1032 | 3.5% |
박 | 910 | 3.0% |
희 | 690 | 2.3% |
경 | 631 | 2.1% |
숙 | 552 | 1.8% |
미 | 542 | 1.8% |
수 | 542 | 1.8% |
Other values (273) | 19970 |
Han
Value | Count | Frequency (%) |
金 | 10 | 9.2% |
李 | 7 | 6.4% |
鄭 | 4 | 3.7% |
崔 | 4 | 3.7% |
永 | 3 | 2.8% |
泰 | 3 | 2.8% |
慶 | 3 | 2.8% |
朴 | 2 | 1.8% |
守 | 2 | 1.8% |
東 | 2 | 1.8% |
Other values (60) | 69 |
Latin
Value | Count | Frequency (%) |
e | 4 | |
N | 4 | |
Y | 3 | 9.4% |
A | 3 | 9.4% |
T | 3 | 9.4% |
I | 3 | 9.4% |
J | 1 | 3.1% |
E | 1 | 3.1% |
H | 1 | 3.1% |
g | 1 | 3.1% |
Other values (8) | 8 |
Common
Value | Count | Frequency (%) |
) | 37 | |
( | 37 | |
13 | 14.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29902 | |
ASCII | 119 | 0.4% |
CJK | 100 | 0.3% |
CJK Compat Ideographs | 9 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
김 | 2197 | 7.3% |
이 | 1492 | 5.0% |
정 | 1344 | 4.5% |
영 | 1032 | 3.5% |
박 | 910 | 3.0% |
희 | 690 | 2.3% |
경 | 631 | 2.1% |
숙 | 552 | 1.8% |
미 | 542 | 1.8% |
수 | 542 | 1.8% |
Other values (273) | 19970 |
ASCII
Value | Count | Frequency (%) |
) | 37 | |
( | 37 | |
13 | 10.9% | |
e | 4 | 3.4% |
N | 4 | 3.4% |
Y | 3 | 2.5% |
A | 3 | 2.5% |
T | 3 | 2.5% |
I | 3 | 2.5% |
J | 1 | 0.8% |
Other values (11) | 11 | 9.2% |
CJK
Value | Count | Frequency (%) |
金 | 10 | 10.0% |
鄭 | 4 | 4.0% |
崔 | 4 | 4.0% |
永 | 3 | 3.0% |
泰 | 3 | 3.0% |
慶 | 3 | 3.0% |
朴 | 2 | 2.0% |
守 | 2 | 2.0% |
東 | 2 | 2.0% |
淑 | 2 | 2.0% |
Other values (57) | 65 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 7 | |
梁 | 1 | 11.1% |
連 | 1 | 11.1% |
bsnmcmpnm
Text
MISSING
 
Distinct | 3322 |
---|---|
Distinct (%) | 45.6% |
Missing | 2712 |
Missing (%) | 27.1% |
Memory size | 156.2 KiB |
Length
Max length | 21 |
---|---|
Median length | 20 |
Mean length | 11.280461 |
Min length | 4 |
Characters and Unicode
Total characters | 82212 |
---|---|
Distinct characters | 580 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 2029 ? |
---|---|
Unique (%) | 27.8% |
Sample
1st row | 행복한부동산공인중개사사무소 |
---|---|
2nd row | 거산공인중개사사무소 |
3rd row | 신우공인중개사사무소 |
4th row | 더원공인중개사사무소 |
5th row | 하늘부동산공인중개사사무소 |
Value | Count | Frequency (%) |
주식회사 | 96 | 1.3% |
공인중개사사무소 | 72 | 0.9% |
사무소 | 72 | 0.9% |
주)부동산중개법인개벽 | 41 | 0.5% |
주)온나라부동산중개법인 | 38 | 0.5% |
삼성공인중개사사무소 | 36 | 0.5% |
현대공인중개사사무소 | 33 | 0.4% |
굿모닝공인중개사사무소 | 33 | 0.4% |
대명합동공인중개사사무소 | 30 | 0.4% |
우리공인중개사사무소 | 28 | 0.4% |
Other values (3320) | 7124 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 12590 | |
개 | 7336 | 8.9% |
중 | 7306 | 8.9% |
소 | 6587 | 8.0% |
무 | 6541 | 8.0% |
인 | 6247 | 7.6% |
공 | 5835 | 7.1% |
동 | 3031 | 3.7% |
부 | 2800 | 3.4% |
산 | 2783 | 3.4% |
Other values (570) | 21156 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 80122 | |
Uppercase Letter | 792 | 1.0% |
Space Separator | 396 | 0.5% |
Decimal Number | 299 | 0.4% |
Open Punctuation | 211 | 0.3% |
Close Punctuation | 211 | 0.3% |
Lowercase Letter | 149 | 0.2% |
Other Punctuation | 26 | < 0.1% |
Dash Punctuation | 3 | < 0.1% |
Letter Number | 1 | < 0.1% |
Other values (2) | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 12590 | |
개 | 7336 | 9.2% |
중 | 7306 | 9.1% |
소 | 6587 | 8.2% |
무 | 6541 | 8.2% |
인 | 6247 | 7.8% |
공 | 5835 | 7.3% |
동 | 3031 | 3.8% |
부 | 2800 | 3.5% |
산 | 2783 | 3.5% |
Other values (504) | 19066 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 149 | |
S | 93 | |
T | 62 | 7.8% |
L | 57 | 7.2% |
C | 48 | 6.1% |
B | 46 | 5.8% |
H | 43 | 5.4% |
W | 38 | 4.8% |
O | 38 | 4.8% |
G | 25 | 3.2% |
Other values (15) | 193 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 55 | |
h | 22 | 14.8% |
t | 11 | 7.4% |
s | 8 | 5.4% |
c | 7 | 4.7% |
w | 7 | 4.7% |
k | 7 | 4.7% |
i | 6 | 4.0% |
b | 5 | 3.4% |
o | 5 | 3.4% |
Other values (8) | 16 | 10.7% |
Decimal Number
Value | Count | Frequency (%) |
1 | 129 | |
2 | 42 | 14.0% |
8 | 39 | 13.0% |
4 | 34 | 11.4% |
3 | 25 | 8.4% |
9 | 11 | 3.7% |
5 | 9 | 3.0% |
6 | 6 | 2.0% |
0 | 4 | 1.3% |
Other Punctuation
Value | Count | Frequency (%) |
& | 15 | |
. | 3 | 11.5% |
# | 2 | 7.7% |
· | 2 | 7.7% |
? | 2 | 7.7% |
! | 1 | 3.8% |
, | 1 | 3.8% |
Space Separator
Value | Count | Frequency (%) |
396 |
Open Punctuation
Value | Count | Frequency (%) |
( | 211 |
Close Punctuation
Value | Count | Frequency (%) |
) | 211 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Other Symbol
Value | Count | Frequency (%) |
ⓡ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 80111 | |
Common | 1148 | 1.4% |
Latin | 942 | 1.1% |
Han | 11 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 12590 | |
개 | 7336 | 9.2% |
중 | 7306 | 9.1% |
소 | 6587 | 8.2% |
무 | 6541 | 8.2% |
인 | 6247 | 7.8% |
공 | 5835 | 7.3% |
동 | 3031 | 3.8% |
부 | 2800 | 3.5% |
산 | 2783 | 3.5% |
Other values (493) | 19055 |
Latin
Value | Count | Frequency (%) |
K | 149 | |
S | 93 | 9.9% |
T | 62 | 6.6% |
L | 57 | 6.1% |
e | 55 | 5.8% |
C | 48 | 5.1% |
B | 46 | 4.9% |
H | 43 | 4.6% |
W | 38 | 4.0% |
O | 38 | 4.0% |
Other values (34) | 313 |
Common
Value | Count | Frequency (%) |
396 | ||
( | 211 | |
) | 211 | |
1 | 129 | 11.2% |
2 | 42 | 3.7% |
8 | 39 | 3.4% |
4 | 34 | 3.0% |
3 | 25 | 2.2% |
& | 15 | 1.3% |
9 | 11 | 1.0% |
Other values (12) | 35 | 3.0% |
Han
Value | Count | Frequency (%) |
人 | 1 | |
利 | 1 | |
太 | 1 | |
秀 | 1 | |
本 | 1 | |
氷 | 1 | |
明 | 1 | |
堂 | 1 | |
甲 | 1 | |
該 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 80111 | |
ASCII | 2084 | 2.5% |
CJK | 11 | < 0.1% |
None | 4 | < 0.1% |
Number Forms | 1 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 12590 | |
개 | 7336 | 9.2% |
중 | 7306 | 9.1% |
소 | 6587 | 8.2% |
무 | 6541 | 8.2% |
인 | 6247 | 7.8% |
공 | 5835 | 7.3% |
동 | 3031 | 3.8% |
부 | 2800 | 3.5% |
산 | 2783 | 3.5% |
Other values (493) | 19055 |
ASCII
Value | Count | Frequency (%) |
396 | ||
( | 211 | 10.1% |
) | 211 | 10.1% |
K | 149 | 7.1% |
1 | 129 | 6.2% |
S | 93 | 4.5% |
T | 62 | 3.0% |
L | 57 | 2.7% |
e | 55 | 2.6% |
C | 48 | 2.3% |
Other values (52) | 673 |
None
Value | Count | Frequency (%) |
· | 2 | |
? | 2 |
CJK
Value | Count | Frequency (%) |
人 | 1 | |
利 | 1 | |
太 | 1 | |
秀 | 1 | |
本 | 1 | |
氷 | 1 | |
明 | 1 | |
堂 | 1 | |
甲 | 1 | |
該 | 1 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓡ | 1 |
crqfcacqdt
Text
MISSING
 
Distinct | 658 |
---|---|
Distinct (%) | 11.2% |
Missing | 4104 |
Missing (%) | 41.0% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9989824 |
Min length | 8 |
Characters and Unicode
Total characters | 58954 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 405 ? |
---|---|
Unique (%) | 6.9% |
Sample
1st row | 2016-12-15 |
---|---|
2nd row | 2019-12-09 |
3rd row | 1985-11-06 |
4th row | 2016-12-12 |
5th row | 1985-11-18 |
Value | Count | Frequency (%) |
2005-07-20 | 377 | 6.4% |
2017-12-11 | 330 | 5.6% |
2016-12-12 | 306 | 5.2% |
2019-12-09 | 242 | 4.1% |
2015-12-09 | 220 | 3.7% |
2003-11-07 | 212 | 3.6% |
2005-12-12 | 191 | 3.2% |
2018-12-10 | 184 | 3.1% |
2000-11-20 | 154 | 2.6% |
2001-12-10 | 154 | 2.6% |
Other values (648) | 3526 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 13421 | |
0 | 12350 | |
- | 11786 | |
2 | 11189 | |
9 | 2800 | 4.7% |
5 | 1704 | 2.9% |
7 | 1632 | 2.8% |
8 | 1425 | 2.4% |
3 | 1136 | 1.9% |
6 | 944 | 1.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 47168 | |
Dash Punctuation | 11786 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 13421 | |
0 | 12350 | |
2 | 11189 | |
9 | 2800 | 5.9% |
5 | 1704 | 3.6% |
7 | 1632 | 3.5% |
8 | 1425 | 3.0% |
3 | 1136 | 2.4% |
6 | 944 | 2.0% |
4 | 567 | 1.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 11786 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 58954 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 13421 | |
0 | 12350 | |
- | 11786 | |
2 | 11189 | |
9 | 2800 | 4.7% |
5 | 1704 | 2.9% |
7 | 1632 | 2.8% |
8 | 1425 | 2.4% |
3 | 1136 | 1.9% |
6 | 944 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 58954 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 13421 | |
0 | 12350 | |
- | 11786 | |
2 | 11189 | |
9 | 2800 | 4.7% |
5 | 1704 | 2.9% |
7 | 1632 | 2.8% |
8 | 1425 | 2.4% |
3 | 1136 | 1.9% |
6 | 944 | 1.6% |
crqfcno
Text
MISSING
 
Distinct | 5774 |
---|---|
Distinct (%) | 96.2% |
Missing | 4000 |
Missing (%) | 40.0% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 20 |
Mean length | 9.2301667 |
Min length | 1 |
Characters and Unicode
Total characters | 55381 |
---|---|
Distinct characters | 63 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 5575 ? |
---|---|
Unique (%) | 92.9% |
Sample
1st row | 26-2016-01898 |
---|---|
2nd row | 26-2019-00518 |
3rd row | 2560 |
4th row | 26-2016-01358 |
5th row | 4988 |
Value | Count | Frequency (%) |
부산 | 361 | 5.5% |
부산시 | 52 | 0.8% |
부산광역시 | 25 | 0.4% |
부산광역시장 | 18 | 0.3% |
경남 | 16 | 0.2% |
울산 | 5 | 0.1% |
455 | 4 | 0.1% |
제 | 4 | 0.1% |
경상남도 | 4 | 0.1% |
662 | 4 | 0.1% |
Other values (5730) | 6023 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8902 | |
2 | 6995 | |
1 | 6693 | |
- | 6177 | |
6 | 3938 | 7.1% |
4 | 2539 | 4.6% |
3 | 2445 | 4.4% |
5 | 2260 | 4.1% |
8 | 2246 | 4.1% |
7 | 2235 | 4.0% |
Other values (53) | 10951 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 40434 | |
Other Letter | 6830 | 12.3% |
Dash Punctuation | 6177 | 11.2% |
Close Punctuation | 703 | 1.3% |
Open Punctuation | 703 | 1.3% |
Space Separator | 518 | 0.9% |
Other Punctuation | 16 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 2016 | |
부 | 1999 | |
호 | 543 | 8.0% |
제 | 484 | 7.1% |
시 | 395 | 5.8% |
광 | 300 | 4.4% |
역 | 299 | 4.4% |
장 | 167 | 2.4% |
경 | 142 | 2.1% |
남 | 117 | 1.7% |
Other values (33) | 368 | 5.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 8902 | |
2 | 6995 | |
1 | 6693 | |
6 | 3938 | |
4 | 2539 | 6.3% |
3 | 2445 | 6.0% |
5 | 2260 | 5.6% |
8 | 2246 | 5.6% |
7 | 2235 | 5.5% |
9 | 2181 | 5.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 8 | |
, | 4 | |
: | 3 | 18.8% |
? | 1 | 6.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 657 | |
] | 46 | 6.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 657 | |
[ | 46 | 6.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6177 |
Space Separator
Value | Count | Frequency (%) |
518 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 48551 | |
Hangul | 6830 | 12.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 2016 | |
부 | 1999 | |
호 | 543 | 8.0% |
제 | 484 | 7.1% |
시 | 395 | 5.8% |
광 | 300 | 4.4% |
역 | 299 | 4.4% |
장 | 167 | 2.4% |
경 | 142 | 2.1% |
남 | 117 | 1.7% |
Other values (33) | 368 | 5.4% |
Common
Value | Count | Frequency (%) |
0 | 8902 | |
2 | 6995 | |
1 | 6693 | |
- | 6177 | |
6 | 3938 | |
4 | 2539 | 5.2% |
3 | 2445 | 5.0% |
5 | 2260 | 4.7% |
8 | 2246 | 4.6% |
7 | 2235 | 4.6% |
Other values (10) | 4121 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 48551 | |
Hangul | 6830 | 12.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8902 | |
2 | 6995 | |
1 | 6693 | |
- | 6177 | |
6 | 3938 | |
4 | 2539 | 5.2% |
3 | 2445 | 5.0% |
5 | 2260 | 4.7% |
8 | 2246 | 4.6% |
7 | 2235 | 4.6% |
Other values (10) | 4121 |
Hangul
Value | Count | Frequency (%) |
산 | 2016 | |
부 | 1999 | |
호 | 543 | 8.0% |
제 | 484 | 7.1% |
시 | 395 | 5.8% |
광 | 300 | 4.4% |
역 | 299 | 4.4% |
장 | 167 | 2.4% |
경 | 142 | 2.1% |
남 | 117 | 1.7% |
Other values (33) | 368 | 5.4% |
jurirno
Text
MISSING
 
Distinct | 4756 |
---|---|
Distinct (%) | 65.3% |
Missing | 2712 |
Missing (%) | 27.1% |
Memory size | 156.2 KiB |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 13.727772 |
Min length | 6 |
Characters and Unicode
Total characters | 100048 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 3434 ? |
---|---|
Unique (%) | 47.1% |
Sample
1st row | 26170-2020-00034 |
---|---|
2nd row | 26500-2019-00125 |
3rd row | 26140-2017-00010 |
4th row | 26230-2019-00034 |
5th row | 26440-2020-00034 |
Value | Count | Frequency (%) |
26470-2018-00085 | 41 | 0.6% |
26470-2016-00066 | 38 | 0.5% |
26470-2015-00027 | 30 | 0.4% |
26530-2017-00027 | 25 | 0.3% |
가-13-1750 | 23 | 0.3% |
26470-2018-00103 | 23 | 0.3% |
가-05-4212 | 23 | 0.3% |
26230-2016-00137 | 20 | 0.3% |
26230-2020-00171 | 18 | 0.2% |
26290-2017-00018 | 18 | 0.2% |
Other values (4747) | 7030 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 28702 | |
2 | 16338 | |
- | 14534 | |
1 | 9889 | 9.9% |
6 | 7847 | 7.8% |
3 | 4313 | 4.3% |
4 | 3812 | 3.8% |
5 | 3538 | 3.5% |
7 | 3278 | 3.3% |
9 | 2922 | 2.9% |
Other values (4) | 4875 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 83256 | |
Dash Punctuation | 14534 | 14.5% |
Other Letter | 2257 | 2.3% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 28702 | |
2 | 16338 | |
1 | 9889 | 11.9% |
6 | 7847 | 9.4% |
3 | 4313 | 5.2% |
4 | 3812 | 4.6% |
5 | 3538 | 4.2% |
7 | 3278 | 3.9% |
9 | 2922 | 3.5% |
8 | 2617 | 3.1% |
Other Letter
Value | Count | Frequency (%) |
가 | 2231 | |
나 | 26 | 1.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14534 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 97791 | |
Hangul | 2257 | 2.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 28702 | |
2 | 16338 | |
- | 14534 | |
1 | 9889 | 10.1% |
6 | 7847 | 8.0% |
3 | 4313 | 4.4% |
4 | 3812 | 3.9% |
5 | 3538 | 3.6% |
7 | 3278 | 3.4% |
9 | 2922 | 3.0% |
Other values (2) | 2618 | 2.7% |
Hangul
Value | Count | Frequency (%) |
가 | 2231 | |
나 | 26 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97791 | |
Hangul | 2257 | 2.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 28702 | |
2 | 16338 | |
- | 14534 | |
1 | 9889 | 10.1% |
6 | 7847 | 8.0% |
3 | 4313 | 4.4% |
4 | 3812 | 3.9% |
5 | 3538 | 3.6% |
7 | 3278 | 3.4% |
9 | 2922 | 3.0% |
Other values (2) | 2618 | 2.7% |
Hangul
Value | Count | Frequency (%) |
가 | 2231 | |
나 | 26 | 1.2% |
lastupdtdt
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2021-01-31 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-01-31 |
---|---|
2nd row | 2021-01-31 |
3rd row | 2021-01-31 |
4th row | 2021-01-31 |
5th row | 2021-01-31 |
Common Values
Value | Count | Frequency (%) |
2021-01-31 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-01-31 | 10000 |
ldcode
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26361.157 |
Minimum | 26110 |
---|---|
Maximum | 26710 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 26110 |
---|---|
5-th percentile | 26170 |
Q1 | 26260 |
median | 26350 |
Q3 | 26440 |
95-th percentile | 26530 |
Maximum | 26710 |
Range | 600 |
Interquartile range (IQR) | 180 |
Descriptive statistics
Standard deviation | 127.02151 |
---|---|
Coefficient of variation (CV) | 0.0048185106 |
Kurtosis | 0.57838593 |
Mean | 26361.157 |
Median Absolute Deviation (MAD) | 90 |
Skewness | 0.63547105 |
Sum | 2.6361157 × 108 |
Variance | 16134.465 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
26230 | 1417 | |
26350 | 1388 | |
26260 | 1086 | |
26470 | 900 | |
26410 | 881 | |
26440 | 773 | |
26380 | 650 | |
26500 | 591 | |
26290 | 552 | 5.5% |
26710 | 455 | 4.5% |
Other values (6) | 1307 |
Value | Count | Frequency (%) |
26110 | 161 | 1.6% |
26140 | 177 | 1.8% |
26170 | 175 | 1.8% |
26200 | 164 | 1.6% |
26230 | 1417 | |
26260 | 1086 | |
26290 | 552 | 5.5% |
26320 | 340 | 3.4% |
26350 | 1388 | |
26380 | 650 |
Value | Count | Frequency (%) |
26710 | 455 | 4.5% |
26530 | 290 | 2.9% |
26500 | 591 | |
26470 | 900 | |
26440 | 773 | |
26410 | 881 | |
26380 | 650 | |
26350 | 1388 | |
26320 | 340 | 3.4% |
26290 | 552 | 5.5% |
ldcodenm
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
부산광역시 부산진구 | |
---|---|
부산광역시 해운대구 | |
부산광역시 동래구 | |
부산광역시 연제구 | |
부산광역시 금정구 | |
Other values (11) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.14 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 동구 |
---|---|
2nd row | 부산광역시 수영구 |
3rd row | 부산광역시 서구 |
4th row | 부산광역시 부산진구 |
5th row | 부산광역시 강서구 |
Common Values
Value | Count | Frequency (%) |
부산광역시 부산진구 | 1417 | |
부산광역시 해운대구 | 1388 | |
부산광역시 동래구 | 1086 | |
부산광역시 연제구 | 900 | |
부산광역시 금정구 | 881 | |
부산광역시 강서구 | 773 | |
부산광역시 사하구 | 650 | |
부산광역시 수영구 | 591 | |
부산광역시 남구 | 552 | 5.5% |
부산광역시 기장군 | 455 | 4.5% |
Other values (6) | 1307 |
Length
Value | Count | Frequency (%) |
부산광역시 | 10000 | |
부산진구 | 1417 | 7.1% |
해운대구 | 1388 | 6.9% |
동래구 | 1086 | 5.4% |
연제구 | 900 | 4.5% |
금정구 | 881 | 4.4% |
강서구 | 773 | 3.9% |
사하구 | 650 | 3.2% |
수영구 | 591 | 3.0% |
남구 | 552 | 2.8% |
Other values (7) | 1762 | 8.8% |
ofcpssecode
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
4 | |
<NA> | |
2 | 10 |
3 | 10 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.8118 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 4 |
3rd row | 4 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
1 | 3739 | |
4 | 3535 | |
<NA> | 2706 | |
2 | 10 | 0.1% |
3 | 10 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 3739 | |
4 | 3535 | |
na | 2706 | |
2 | 10 | 0.1% |
3 | 10 | 0.1% |
ofcpssecodenm
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
대표 | |
---|---|
일반 | |
<NA> | |
감사 | 10 |
이사 | 10 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.5412 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 대표 |
---|---|
2nd row | 일반 |
3rd row | 일반 |
4th row | 일반 |
5th row | 일반 |
Common Values
Value | Count | Frequency (%) |
대표 | 3739 | |
일반 | 3535 | |
<NA> | 2706 | |
감사 | 10 | 0.1% |
이사 | 10 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대표 | 3739 | |
일반 | 3535 | |
na | 2706 | |
감사 | 10 | 0.1% |
이사 | 10 | 0.1% |
last_load_dttm
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2021-02-01 06:22:03 | |
---|---|
2021-02-01 06:22:04 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-02-01 06:22:03 |
---|---|
2nd row | 2021-02-01 06:22:04 |
3rd row | 2021-02-01 06:22:03 |
4th row | 2021-02-01 06:22:03 |
5th row | 2021-02-01 06:22:04 |
Common Values
Value | Count | Frequency (%) |
2021-02-01 06:22:03 | 5134 | |
2021-02-01 06:22:04 | 4866 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-02-01 | 10000 | |
06:22:03 | 5134 | |
06:22:04 | 4866 |
brkrasortcode | brkrasortcodenm | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
brkrasortcode | 1.000 | 1.000 | 0.231 | 0.293 | 0.829 | 0.829 | 0.048 |
brkrasortcodenm | 1.000 | 1.000 | 0.231 | 0.293 | 0.829 | 0.829 | 0.048 |
ldcode | 0.231 | 0.231 | 1.000 | 1.000 | 0.217 | 0.217 | 0.996 |
ldcodenm | 0.293 | 0.293 | 1.000 | 1.000 | 0.245 | 0.245 | 0.997 |
ofcpssecode | 0.829 | 0.829 | 0.217 | 0.245 | 1.000 | 1.000 | 0.034 |
ofcpssecodenm | 0.829 | 0.829 | 0.217 | 0.245 | 1.000 | 1.000 | 0.034 |
last_load_dttm | 0.048 | 0.048 | 0.996 | 0.997 | 0.034 | 0.034 | 1.000 |
ofcpssecodenm | ldcodenm | brkrasortcode | ofcpssecode | last_load_dttm | brkrasortcodenm | |
---|---|---|---|---|---|---|
ofcpssecodenm | 1.000 | 0.117 | 0.476 | 1.000 | 0.022 | 0.476 |
ldcodenm | 0.117 | 1.000 | 0.141 | 0.117 | 0.948 | 0.141 |
brkrasortcode | 0.476 | 0.141 | 1.000 | 0.476 | 0.032 | 1.000 |
ofcpssecode | 1.000 | 0.117 | 0.476 | 1.000 | 0.022 | 0.476 |
last_load_dttm | 0.022 | 0.948 | 0.032 | 0.022 | 1.000 | 0.032 |
brkrasortcodenm | 0.476 | 0.141 | 1.000 | 0.476 | 0.032 | 1.000 |
ldcode | brkrasortcode | brkrasortcodenm | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|
ldcode | 1.000 | 0.107 | 0.107 | 1.000 | 0.099 | 0.099 | 0.945 |
brkrasortcode | 0.107 | 1.000 | 1.000 | 0.141 | 0.476 | 0.476 | 0.032 |
brkrasortcodenm | 0.107 | 1.000 | 1.000 | 0.141 | 0.476 | 0.476 | 0.032 |
ldcodenm | 1.000 | 0.141 | 0.141 | 1.000 | 0.117 | 0.117 | 0.948 |
ofcpssecode | 0.099 | 0.476 | 0.476 | 0.117 | 1.000 | 1.000 | 0.022 |
ofcpssecodenm | 0.099 | 0.476 | 0.476 | 0.117 | 1.000 | 1.000 | 0.022 |
last_load_dttm | 0.945 | 0.032 | 0.032 | 0.948 | 0.022 | 0.022 | 1.000 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
808 | 2 | 공인중개사 | 여미화 | 행복한부동산공인중개사사무소 | 2016-12-15 | 26-2016-01898 | 26170-2020-00034 | 2021-01-31 | 26170 | 부산광역시 동구 | 1 | 대표 | 2021-02-01 06:22:03 |
17888 | 4 | 중개보조원 | 김정희 | 거산공인중개사사무소 | <NA> | <NA> | 26500-2019-00125 | 2021-01-31 | 26500 | 부산광역시 수영구 | 4 | 일반 | 2021-02-01 06:22:04 |
540 | 2 | 공인중개사 | 송민금 | 신우공인중개사사무소 | 2019-12-09 | 26-2019-00518 | 26140-2017-00010 | 2021-01-31 | 26140 | 부산광역시 서구 | 4 | 일반 | 2021-02-01 06:22:03 |
3212 | 4 | 중개보조원 | 박순근 | 더원공인중개사사무소 | <NA> | <NA> | 26230-2019-00034 | 2021-01-31 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2021-02-01 06:22:03 |
14205 | 4 | 중개보조원 | 이관훈 | 하늘부동산공인중개사사무소 | <NA> | <NA> | 26440-2020-00034 | 2021-01-31 | 26440 | 부산광역시 강서구 | 4 | 일반 | 2021-02-01 06:22:04 |
4963 | 4 | 중개보조원 | 윤효덕 | 유림부동산공인중개사사무소 | <NA> | <NA> | 가-06-4342 | 2021-01-31 | 26260 | 부산광역시 동래구 | 4 | 일반 | 2021-02-01 06:22:03 |
1720 | 4 | 중개보조원 | 조현태 | 굿모닝공인중개사사무소 | <NA> | <NA> | 가-05-4212 | 2021-01-31 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2021-02-01 06:22:03 |
17584 | 2 | 공인중개사 | 석영태 | YT부동산공인중개사사무소 | 1985-11-06 | 2560 | 가-14-993 | 2021-01-31 | 26500 | 부산광역시 수영구 | 1 | 대표 | 2021-02-01 06:22:04 |
15323 | 2 | 공인중개사 | 하주현 | 우정부동산공인중개사사무소 | 2016-12-12 | 26-2016-01358 | 26470-2017-00078 | 2021-01-31 | 26470 | 부산광역시 연제구 | 1 | 대표 | 2021-02-01 06:22:04 |
12233 | 2 | 공인중개사 | 조동식 | <NA> | 1985-11-18 | 4988 | <NA> | 2021-01-31 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2021-02-01 06:22:04 |
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
14915 | 2 | 공인중개사 | 김숙 | <NA> | 1991-12-20 | 97 | <NA> | 2021-01-31 | 26440 | 부산광역시 강서구 | <NA> | <NA> | 2021-02-01 06:22:04 |
6032 | 2 | 공인중개사 | 김미정 | 그레이공인중개사사무소 | 2019-12-09 | 26-2019-01632 | 26260-2020-00091 | 2021-01-31 | 26260 | 부산광역시 동래구 | 4 | 일반 | 2021-02-01 06:22:03 |
5605 | 2 | 공인중개사 | 류계향 | <NA> | <NA> | <NA> | <NA> | 2021-01-31 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-02-01 06:22:03 |
16464 | 2 | 공인중개사 | 노재호 | 복덩이공인중개사사무소 | 1989-02-13 | 4-707 | 26470-2020-00139 | 2021-01-31 | 26470 | 부산광역시 연제구 | 4 | 일반 | 2021-02-01 06:22:04 |
10919 | 4 | 중개보조원 | 정민주 | 장수공인중개사사무소 | <NA> | <NA> | 가-08-761 | 2021-01-31 | 26380 | 부산광역시 사하구 | 4 | 일반 | 2021-02-01 06:22:04 |
12782 | 4 | 중개보조원 | 양기영 | 조은 공인중개사사무소 | <NA> | <NA> | 가-11-2031 | 2021-01-31 | 26410 | 부산광역시 금정구 | 4 | 일반 | 2021-02-01 06:22:04 |
6330 | 4 | 중개보조원 | 하귀선 | 홈런공인중개사사무소 | <NA> | <NA> | 26290-2017-00170 | 2021-01-31 | 26290 | 부산광역시 남구 | 4 | 일반 | 2021-02-01 06:22:03 |
13984 | 2 | 공인중개사 | 장재원 | 한라공인중개사사무소 | 2016-12-12 | 48-2016-01284 | 26440-2015-00160 | 2021-01-31 | 26440 | 부산광역시 강서구 | 4 | 일반 | 2021-02-01 06:22:04 |
18965 | 2 | 공인중개사 | 정경희 | <NA> | 2005-07-20 | 1451(부산) | <NA> | 2021-01-31 | 26710 | 부산광역시 기장군 | <NA> | <NA> | 2021-02-01 06:22:04 |
7272 | 4 | 중개보조원 | 최정원 | e편한세상공인중개사사무소 | <NA> | <NA> | 26320-2020-00002 | 2021-01-31 | 26320 | 부산광역시 북구 | 4 | 일반 | 2021-02-01 06:22:03 |
Most frequently occurring
brkrasortcode | brkrasortcodenm | brkrnm | bsnmcmpnm | crqfcacqdt | crqfcno | jurirno | lastupdtdt | ldcode | ldcodenm | ofcpssecode | ofcpssecodenm | last_load_dttm | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2 | 공인중개사 | 김미정 | <NA> | <NA> | <NA> | <NA> | 2021-01-31 | 26260 | 부산광역시 동래구 | <NA> | <NA> | 2021-02-01 06:22:03 | 2 |
1 | 2 | 공인중개사 | 김성수 | <NA> | <NA> | <NA> | <NA> | 2021-01-31 | 26350 | 부산광역시 해운대구 | <NA> | <NA> | 2021-02-01 06:22:04 | 2 |
2 | 4 | 중개보조원 | 박영만 | 대원부동산공인중개사사무소 | <NA> | <NA> | 가-05-2115 | 2021-01-31 | 26230 | 부산광역시 부산진구 | 4 | 일반 | 2021-02-01 06:22:03 | 2 |
3 | 4 | 중개보조원 | 이기옥 | <NA> | <NA> | <NA> | <NA> | 2021-01-31 | 26410 | 부산광역시 금정구 | <NA> | <NA> | 2021-02-01 06:22:04 | 2 |