Overview

Dataset statistics

Number of variables5
Number of observations256
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.4 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description부동산 중개업 현황
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3077949

Alerts

중개업소구분 is highly imbalanced (74.0%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:07:45.411127
Analysis finished2023-12-11 00:07:45.973936
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct256
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.5
Minimum1
Maximum256
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-11T09:07:46.065491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.75
Q164.75
median128.5
Q3192.25
95-th percentile243.25
Maximum256
Range255
Interquartile range (IQR)127.5

Descriptive statistics

Standard deviation74.045031
Coefficient of variation (CV)0.57622592
Kurtosis-1.2
Mean128.5
Median Absolute Deviation (MAD)64
Skewness0
Sum32896
Variance5482.6667
MonotonicityStrictly increasing
2023-12-11T09:07:46.209445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
130 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
Other values (246) 246
96.1%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
256 1
0.4%
255 1
0.4%
254 1
0.4%
253 1
0.4%
252 1
0.4%
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
Distinct241
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T09:07:46.437763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length10.835938
Min length7

Characters and Unicode

Total characters2774
Distinct characters205
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)89.5%

Sample

1st row구암공인중개사사무소
2nd row대동공인중개사사무소
3rd row대한공인중개사사무소
4th row마산114공인중개사사무소
5th row마창경매컨설팅 공인중개사사무소
ValueCountFrequency (%)
사무소 36
 
12.2%
신세계공인중개사사무소 4
 
1.4%
스마트공인중개사사무소 3
 
1.0%
신우공인중개사사무소 2
 
0.7%
명성공인중개사사무소 2
 
0.7%
대박공인중개사사무소 2
 
0.7%
한솔공인중개사사무소 2
 
0.7%
1번지부동산공인중개사사무소 2
 
0.7%
명가공인중개사사무소 2
 
0.7%
롯데캐슬공인중개사사무소 2
 
0.7%
Other values (235) 238
80.7%
2023-12-11T09:07:46.833577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
452
16.3%
257
 
9.3%
256
 
9.2%
242
 
8.7%
241
 
8.7%
216
 
7.8%
214
 
7.7%
79
 
2.8%
77
 
2.8%
76
 
2.7%
Other values (195) 664
23.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2712
97.8%
Space Separator 39
 
1.4%
Decimal Number 9
 
0.3%
Uppercase Letter 9
 
0.3%
Lowercase Letter 3
 
0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
452
16.7%
257
9.5%
256
9.4%
242
8.9%
241
8.9%
216
 
8.0%
214
 
7.9%
79
 
2.9%
77
 
2.8%
76
 
2.8%
Other values (179) 602
22.2%
Decimal Number
ValueCountFrequency (%)
1 4
44.4%
2 1
 
11.1%
3 1
 
11.1%
5 1
 
11.1%
6 1
 
11.1%
4 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
B 2
22.2%
A 2
22.2%
L 2
22.2%
G 2
22.2%
S 1
11.1%
Lowercase Letter
ValueCountFrequency (%)
o 2
66.7%
d 1
33.3%
Space Separator
ValueCountFrequency (%)
39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2712
97.8%
Common 50
 
1.8%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
452
16.7%
257
9.5%
256
9.4%
242
8.9%
241
8.9%
216
 
8.0%
214
 
7.9%
79
 
2.9%
77
 
2.8%
76
 
2.8%
Other values (179) 602
22.2%
Common
ValueCountFrequency (%)
39
78.0%
1 4
 
8.0%
2 1
 
2.0%
3 1
 
2.0%
5 1
 
2.0%
6 1
 
2.0%
4 1
 
2.0%
) 1
 
2.0%
( 1
 
2.0%
Latin
ValueCountFrequency (%)
B 2
16.7%
A 2
16.7%
L 2
16.7%
o 2
16.7%
G 2
16.7%
d 1
8.3%
S 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2712
97.8%
ASCII 62
 
2.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
452
16.7%
257
9.5%
256
9.4%
242
8.9%
241
8.9%
216
 
8.0%
214
 
7.9%
79
 
2.9%
77
 
2.8%
76
 
2.8%
Other values (179) 602
22.2%
ASCII
ValueCountFrequency (%)
39
62.9%
1 4
 
6.5%
B 2
 
3.2%
A 2
 
3.2%
L 2
 
3.2%
o 2
 
3.2%
G 2
 
3.2%
2 1
 
1.6%
3 1
 
1.6%
d 1
 
1.6%
Other values (6) 6
 
9.7%

중개업소구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
공인중개사
238 
중개인
 
15
법인
 
3

Length

Max length5
Median length5
Mean length4.8476562
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공인중개사
2nd row공인중개사
3rd row공인중개사
4th row공인중개사
5th row공인중개사

Common Values

ValueCountFrequency (%)
공인중개사 238
93.0%
중개인 15
 
5.9%
법인 3
 
1.2%

Length

2023-12-11T09:07:47.003695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:07:47.106984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공인중개사 238
93.0%
중개인 15
 
5.9%
법인 3
 
1.2%
Distinct252
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T09:07:47.425957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9921875
Min length2

Characters and Unicode

Total characters766
Distinct characters145
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique249 ?
Unique (%)97.3%

Sample

1st row안정은
2nd row이봉자
3rd row김한규
4th row김우철
5th row김정무
ValueCountFrequency (%)
김정숙 3
 
1.2%
김정화 2
 
0.8%
김영미 2
 
0.8%
백순흠 1
 
0.4%
윤선이 1
 
0.4%
이재성 1
 
0.4%
이재기 1
 
0.4%
김봉수 1
 
0.4%
안정은 1
 
0.4%
이림 1
 
0.4%
Other values (242) 242
94.5%
2023-12-11T09:07:47.927546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
67
 
8.7%
41
 
5.4%
39
 
5.1%
21
 
2.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
18
 
2.3%
17
 
2.2%
17
 
2.2%
Other values (135) 488
63.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 766
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
8.7%
41
 
5.4%
39
 
5.1%
21
 
2.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
18
 
2.3%
17
 
2.2%
17
 
2.2%
Other values (135) 488
63.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 766
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
8.7%
41
 
5.4%
39
 
5.1%
21
 
2.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
18
 
2.3%
17
 
2.2%
17
 
2.2%
Other values (135) 488
63.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 766
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
67
 
8.7%
41
 
5.4%
39
 
5.1%
21
 
2.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
18
 
2.3%
17
 
2.2%
17
 
2.2%
Other values (135) 488
63.7%
Distinct247
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T09:07:48.245600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47.5
Mean length34.507812
Min length24

Characters and Unicode

Total characters8834
Distinct characters153
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)93.0%

Sample

1st row경상남도 창원시 마산회원구 구암북17길 18(구암동)
2nd row경상남도 창원시 마산회원구 구암남11길 34(구암동)
3rd row경상남도 창원시 마산회원구 구암남14길 5, 101호(구암동, 구암대동타운부상가)
4th row경상남도 창원시 마산회원구 구암서6길 35(구암동)
5th row경상남도 창원시 마산회원구 구암남12길 1(구암동)
ValueCountFrequency (%)
경상남도 256
16.4%
마산회원구 256
16.4%
창원시 256
16.4%
내서읍 84
 
5.4%
호원로 20
 
1.3%
양덕로 15
 
1.0%
30 14
 
0.9%
양덕서로 13
 
0.8%
중리상곡로 12
 
0.8%
메트로시티 11
 
0.7%
Other values (412) 625
40.0%
2023-12-11T09:07:48.697258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1306
 
14.8%
582
 
6.6%
1 330
 
3.7%
315
 
3.6%
312
 
3.5%
295
 
3.3%
290
 
3.3%
275
 
3.1%
266
 
3.0%
263
 
3.0%
Other values (143) 4600
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5701
64.5%
Space Separator 1306
 
14.8%
Decimal Number 1173
 
13.3%
Close Punctuation 223
 
2.5%
Open Punctuation 223
 
2.5%
Other Punctuation 179
 
2.0%
Dash Punctuation 24
 
0.3%
Uppercase Letter 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
582
 
10.2%
315
 
5.5%
312
 
5.5%
295
 
5.2%
290
 
5.1%
275
 
4.8%
266
 
4.7%
263
 
4.6%
260
 
4.6%
258
 
4.5%
Other values (125) 2585
45.3%
Decimal Number
ValueCountFrequency (%)
1 330
28.1%
2 163
13.9%
0 147
12.5%
3 145
12.4%
5 85
 
7.2%
4 67
 
5.7%
6 65
 
5.5%
7 63
 
5.4%
9 56
 
4.8%
8 52
 
4.4%
Other Punctuation
ValueCountFrequency (%)
, 171
95.5%
· 8
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
A 3
60.0%
B 2
40.0%
Space Separator
ValueCountFrequency (%)
1306
100.0%
Close Punctuation
ValueCountFrequency (%)
) 223
100.0%
Open Punctuation
ValueCountFrequency (%)
( 223
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5701
64.5%
Common 3128
35.4%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
582
 
10.2%
315
 
5.5%
312
 
5.5%
295
 
5.2%
290
 
5.1%
275
 
4.8%
266
 
4.7%
263
 
4.6%
260
 
4.6%
258
 
4.5%
Other values (125) 2585
45.3%
Common
ValueCountFrequency (%)
1306
41.8%
1 330
 
10.5%
) 223
 
7.1%
( 223
 
7.1%
, 171
 
5.5%
2 163
 
5.2%
0 147
 
4.7%
3 145
 
4.6%
5 85
 
2.7%
4 67
 
2.1%
Other values (6) 268
 
8.6%
Latin
ValueCountFrequency (%)
A 3
60.0%
B 2
40.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5701
64.5%
ASCII 3125
35.4%
None 8
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1306
41.8%
1 330
 
10.6%
) 223
 
7.1%
( 223
 
7.1%
, 171
 
5.5%
2 163
 
5.2%
0 147
 
4.7%
3 145
 
4.6%
5 85
 
2.7%
4 67
 
2.1%
Other values (7) 265
 
8.5%
Hangul
ValueCountFrequency (%)
582
 
10.2%
315
 
5.5%
312
 
5.5%
295
 
5.2%
290
 
5.1%
275
 
4.8%
266
 
4.7%
263
 
4.6%
260
 
4.6%
258
 
4.5%
Other values (125) 2585
45.3%
None
ValueCountFrequency (%)
· 8
100.0%

Interactions

2023-12-11T09:07:45.724076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:07:48.816159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번중개업소구분
연번1.0000.373
중개업소구분0.3731.000
2023-12-11T09:07:48.906650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번중개업소구분
연번1.0000.243
중개업소구분0.2431.000

Missing values

2023-12-11T09:07:45.833060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:07:45.926333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사무소명중개업소구분중개업자명사무소주소
01구암공인중개사사무소공인중개사안정은경상남도 창원시 마산회원구 구암북17길 18(구암동)
12대동공인중개사사무소공인중개사이봉자경상남도 창원시 마산회원구 구암남11길 34(구암동)
23대한공인중개사사무소공인중개사김한규경상남도 창원시 마산회원구 구암남14길 5, 101호(구암동, 구암대동타운부상가)
34마산114공인중개사사무소공인중개사김우철경상남도 창원시 마산회원구 구암서6길 35(구암동)
45마창경매컨설팅 공인중개사사무소공인중개사김정무경상남도 창원시 마산회원구 구암남12길 1(구암동)
56명성공인중개사사무소공인중개사김옥순경상남도 창원시 마산회원구 구암북9길 26(구암동)
67부경공인중개사사무소공인중개사김영상경상남도 창원시 마산회원구 구암남14길 8, A동 205호(구암동, 구암대동타운주상가)
78삼성공인중개사 사무소공인중개사이병철경상남도 창원시 마산회원구 구암남14길 8, 110호(구암동, 대동1차아파트상가)
89영남공인중개사사무소공인중개사이기수경상남도 창원시 마산회원구 구암서2길 73(구암동)
910우진공인중개사 사무소공인중개사김종찬경상남도 창원시 마산회원구 구암남14길 8, 101호(구암동,구암대동타운주상가)
연번사무소명중개업소구분중개업자명사무소주소
246247온누리공인중개사사무소공인중개사조미정경상남도 창원시 마산회원구 회원남로 73(회원동)
247248우리부동산공인중개사 사무소공인중개사김영미경상남도 창원시 마산회원구 교방시장1길 33(회원동)
248249육호공인중개사사무소공인중개사박노석경상남도 창원시 마산회원구 3·15대로 480(회원동)
249250일일사부동산중개 사무소중개인김종열경상남도 창원시 마산회원구 회원천북길 101(회원동)
250251지성공인중개사사무소공인중개사김지원경상남도 창원시 마산회원구 회원남로 60(회원동)
251252천해지공인중개사사무소공인중개사김정화경상남도 창원시 마산회원구 회원남로 11(회원동)
252253하나공인중개사무소공인중개사백순흠경상남도 창원시 마산회원구 북성로 196(회원동)
253254한마공인중개사사무소공인중개사김구채경상남도 창원시 마산회원구 무학로 549-1(회원동)
254255회원부동산중개공인중개사양필선경상남도 창원시 마산회원구 회원남로 35(회원동)
255256육일공인중개사사무소공인중개사김혜이경상남도 창원시 마산회원구 북성로 170(회원동)