Overview

Dataset statistics

Number of variables5
Number of observations409
Missing cells110
Missing cells (%)5.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.5 KiB
Average record size in memory41.3 B

Variable types

Numeric1
Text4

Dataset

Description울산광역시 구구별(남구, 중구) 공인중개사 사무소 현황 정보(사무소명, 대표명, 도로명주소, 연락처 등)를 제공하고 있습니다.
Author울산광역시
URLhttps://www.data.go.kr/data/15091285/fileData.do

Alerts

연락처 has 109 (26.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 15:53:02.229011
Analysis finished2024-04-21 15:53:03.673956
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct409
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean205
Minimum1
Maximum409
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-04-22T00:53:03.806436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21.4
Q1103
median205
Q3307
95-th percentile388.6
Maximum409
Range408
Interquartile range (IQR)204

Descriptive statistics

Standard deviation118.21238
Coefficient of variation (CV)0.57664575
Kurtosis-1.2
Mean205
Median Absolute Deviation (MAD)102
Skewness0
Sum83845
Variance13974.167
MonotonicityStrictly increasing
2024-04-22T00:53:04.068603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
270 1
 
0.2%
280 1
 
0.2%
279 1
 
0.2%
278 1
 
0.2%
277 1
 
0.2%
276 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
Other values (399) 399
97.6%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
409 1
0.2%
408 1
0.2%
407 1
0.2%
406 1
0.2%
405 1
0.2%
404 1
0.2%
403 1
0.2%
402 1
0.2%
401 1
0.2%
400 1
0.2%
Distinct408
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-04-22T00:53:04.894650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length11.127139
Min length7

Characters and Unicode

Total characters4551
Distinct characters275
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique407 ?
Unique (%)99.5%

Sample

1st row현대부동산중개사무소
2nd row문수2차공인중개사사무소
3rd row키움부동산공인중개사사무소
4th row신태양공인중개사사무소
5th row현대부동산중개인사무소
ValueCountFrequency (%)
대원공인중개사사무소(합동 2
 
0.5%
rs강변공인중개사사무소 1
 
0.2%
kcc부동산공인중개사사무소 1
 
0.2%
국민공인중개사사무소 1
 
0.2%
백곰공인중개사사무소 1
 
0.2%
오름공인중개사사무소 1
 
0.2%
태화컨설팅공인중개사사무소 1
 
0.2%
행복한공인중개사사무소(합동 1
 
0.2%
약사강남공인중개사사무소 1
 
0.2%
신현대공인중개사사무소 1
 
0.2%
Other values (398) 398
97.3%
2024-04-22T00:53:06.053283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
786
17.3%
412
 
9.1%
409
 
9.0%
409
 
9.0%
403
 
8.9%
381
 
8.4%
369
 
8.1%
106
 
2.3%
83
 
1.8%
73
 
1.6%
Other values (265) 1120
24.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4425
97.2%
Uppercase Letter 40
 
0.9%
Close Punctuation 23
 
0.5%
Open Punctuation 23
 
0.5%
Decimal Number 17
 
0.4%
Space Separator 10
 
0.2%
Lowercase Letter 10
 
0.2%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
786
17.8%
412
9.3%
409
9.2%
409
9.2%
403
9.1%
381
 
8.6%
369
 
8.3%
106
 
2.4%
83
 
1.9%
73
 
1.6%
Other values (230) 994
22.5%
Uppercase Letter
ValueCountFrequency (%)
K 6
15.0%
N 4
 
10.0%
L 3
 
7.5%
I 3
 
7.5%
C 3
 
7.5%
A 3
 
7.5%
T 2
 
5.0%
G 2
 
5.0%
H 2
 
5.0%
R 2
 
5.0%
Other values (8) 10
25.0%
Decimal Number
ValueCountFrequency (%)
1 6
35.3%
5 3
17.6%
4 3
17.6%
3 2
 
11.8%
6 2
 
11.8%
2 1
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
e 5
50.0%
w 2
 
20.0%
h 1
 
10.0%
n 1
 
10.0%
i 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4422
97.2%
Common 76
 
1.7%
Latin 50
 
1.1%
Han 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
786
17.8%
412
9.3%
409
9.2%
409
9.2%
403
9.1%
381
 
8.6%
369
 
8.3%
106
 
2.4%
83
 
1.9%
73
 
1.7%
Other values (228) 991
22.4%
Latin
ValueCountFrequency (%)
K 6
 
12.0%
e 5
 
10.0%
N 4
 
8.0%
L 3
 
6.0%
I 3
 
6.0%
C 3
 
6.0%
A 3
 
6.0%
T 2
 
4.0%
G 2
 
4.0%
H 2
 
4.0%
Other values (13) 17
34.0%
Common
ValueCountFrequency (%)
) 23
30.3%
( 23
30.3%
10
13.2%
1 6
 
7.9%
5 3
 
3.9%
4 3
 
3.9%
3 2
 
2.6%
6 2
 
2.6%
, 1
 
1.3%
- 1
 
1.3%
Other values (2) 2
 
2.6%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4422
97.2%
ASCII 126
 
2.8%
CJK 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
786
17.8%
412
9.3%
409
9.2%
409
9.2%
403
9.1%
381
 
8.6%
369
 
8.3%
106
 
2.4%
83
 
1.9%
73
 
1.7%
Other values (228) 991
22.4%
ASCII
ValueCountFrequency (%)
) 23
18.3%
( 23
18.3%
10
 
7.9%
1 6
 
4.8%
K 6
 
4.8%
e 5
 
4.0%
N 4
 
3.2%
L 3
 
2.4%
I 3
 
2.4%
5 3
 
2.4%
Other values (25) 40
31.7%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%

대표
Text

Distinct402
Distinct (%)98.5%
Missing1
Missing (%)0.2%
Memory size3.3 KiB
2024-04-22T00:53:07.328028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9877451
Min length2

Characters and Unicode

Total characters1219
Distinct characters164
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique396 ?
Unique (%)97.1%

Sample

1st row이경태
2nd row김미자
3rd row한학송
4th row이용주
5th row송이천
ValueCountFrequency (%)
김혜진 2
 
0.5%
장경미 2
 
0.5%
이미숙 2
 
0.5%
김정희 2
 
0.5%
최정식 2
 
0.5%
김도영 2
 
0.5%
박은미 1
 
0.2%
이종재 1
 
0.2%
이경희 1
 
0.2%
한영숙 1
 
0.2%
Other values (392) 392
96.1%
2024-04-22T00:53:08.977698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
90
 
7.4%
64
 
5.3%
55
 
4.5%
49
 
4.0%
44
 
3.6%
39
 
3.2%
36
 
3.0%
28
 
2.3%
26
 
2.1%
24
 
2.0%
Other values (154) 764
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1219
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
7.4%
64
 
5.3%
55
 
4.5%
49
 
4.0%
44
 
3.6%
39
 
3.2%
36
 
3.0%
28
 
2.3%
26
 
2.1%
24
 
2.0%
Other values (154) 764
62.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1219
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
7.4%
64
 
5.3%
55
 
4.5%
49
 
4.0%
44
 
3.6%
39
 
3.2%
36
 
3.0%
28
 
2.3%
26
 
2.1%
24
 
2.0%
Other values (154) 764
62.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1219
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
90
 
7.4%
64
 
5.3%
55
 
4.5%
49
 
4.0%
44
 
3.6%
39
 
3.2%
36
 
3.0%
28
 
2.3%
26
 
2.1%
24
 
2.0%
Other values (154) 764
62.7%
Distinct388
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-04-22T00:53:09.953071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length47
Mean length25.327628
Min length15

Characters and Unicode

Total characters10359
Distinct characters209
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)90.5%

Sample

1st row울산광역시 남구 대암로 104
2nd row울산광역시 남구 동산로 64
3rd row울산광역시 남구 중앙로156번길 33
4th row도산로 9, 상가1동(달동, 이진빌라)
5th row울산광역시 중구 시원길 43(우정동)
ValueCountFrequency (%)
울산광역시 408
21.4%
중구 405
21.2%
화합로 20
 
1.0%
장춘로 17
 
0.9%
종가로 17
 
0.9%
태화로 14
 
0.7%
종가4길 11
 
0.6%
10 11
 
0.6%
번영로 10
 
0.5%
1 9
 
0.5%
Other values (609) 985
51.7%
2024-04-22T00:53:11.381075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1501
 
14.5%
495
 
4.8%
481
 
4.6%
479
 
4.6%
1 444
 
4.3%
417
 
4.0%
415
 
4.0%
409
 
3.9%
409
 
3.9%
409
 
3.9%
Other values (199) 4900
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6239
60.2%
Decimal Number 1622
 
15.7%
Space Separator 1501
 
14.5%
Close Punctuation 325
 
3.1%
Open Punctuation 325
 
3.1%
Other Punctuation 292
 
2.8%
Dash Punctuation 27
 
0.3%
Uppercase Letter 18
 
0.2%
Lowercase Letter 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
495
 
7.9%
481
 
7.7%
479
 
7.7%
417
 
6.7%
415
 
6.7%
409
 
6.6%
409
 
6.6%
409
 
6.6%
218
 
3.5%
190
 
3.0%
Other values (172) 2317
37.1%
Decimal Number
ValueCountFrequency (%)
1 444
27.4%
2 243
15.0%
0 215
13.3%
3 157
 
9.7%
4 128
 
7.9%
5 124
 
7.6%
6 95
 
5.9%
7 90
 
5.5%
8 79
 
4.9%
9 47
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
C 6
33.3%
K 3
16.7%
B 2
 
11.1%
L 2
 
11.1%
H 2
 
11.1%
P 1
 
5.6%
W 1
 
5.6%
A 1
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
e 7
70.0%
k 1
 
10.0%
a 1
 
10.0%
r 1
 
10.0%
Space Separator
ValueCountFrequency (%)
1501
100.0%
Close Punctuation
ValueCountFrequency (%)
) 325
100.0%
Open Punctuation
ValueCountFrequency (%)
( 325
100.0%
Other Punctuation
ValueCountFrequency (%)
, 292
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6239
60.2%
Common 4092
39.5%
Latin 28
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
495
 
7.9%
481
 
7.7%
479
 
7.7%
417
 
6.7%
415
 
6.7%
409
 
6.6%
409
 
6.6%
409
 
6.6%
218
 
3.5%
190
 
3.0%
Other values (172) 2317
37.1%
Common
ValueCountFrequency (%)
1501
36.7%
1 444
 
10.9%
) 325
 
7.9%
( 325
 
7.9%
, 292
 
7.1%
2 243
 
5.9%
0 215
 
5.3%
3 157
 
3.8%
4 128
 
3.1%
5 124
 
3.0%
Other values (5) 338
 
8.3%
Latin
ValueCountFrequency (%)
e 7
25.0%
C 6
21.4%
K 3
10.7%
B 2
 
7.1%
L 2
 
7.1%
H 2
 
7.1%
k 1
 
3.6%
a 1
 
3.6%
r 1
 
3.6%
P 1
 
3.6%
Other values (2) 2
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6239
60.2%
ASCII 4120
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1501
36.4%
1 444
 
10.8%
) 325
 
7.9%
( 325
 
7.9%
, 292
 
7.1%
2 243
 
5.9%
0 215
 
5.2%
3 157
 
3.8%
4 128
 
3.1%
5 124
 
3.0%
Other values (17) 366
 
8.9%
Hangul
ValueCountFrequency (%)
495
 
7.9%
481
 
7.7%
479
 
7.7%
417
 
6.7%
415
 
6.7%
409
 
6.6%
409
 
6.6%
409
 
6.6%
218
 
3.5%
190
 
3.0%
Other values (172) 2317
37.1%

연락처
Text

MISSING 

Distinct289
Distinct (%)96.3%
Missing109
Missing (%)26.7%
Memory size3.3 KiB
2024-04-22T00:53:12.315189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.003333
Min length12

Characters and Unicode

Total characters3601
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique280 ?
Unique (%)93.3%

Sample

1st row052-211-9009
2nd row052-227-4470
3rd row052-211-3430
4th row052-292-5511
5th row052-211-1635
ValueCountFrequency (%)
052-291-0022 3
 
1.0%
052-277-8949 3
 
1.0%
052-276-0090 2
 
0.7%
052-246-5686 2
 
0.7%
052-286-8100 2
 
0.7%
052-294-4949 2
 
0.7%
052-212-2234 2
 
0.7%
052-211-4549 2
 
0.7%
052-242-8989 2
 
0.7%
052-244-5973 1
 
0.3%
Other values (279) 279
93.0%
2024-04-22T00:53:13.662476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 744
20.7%
- 600
16.7%
0 541
15.0%
5 464
12.9%
9 261
 
7.2%
4 226
 
6.3%
8 209
 
5.8%
1 174
 
4.8%
6 133
 
3.7%
7 132
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3001
83.3%
Dash Punctuation 600
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 744
24.8%
0 541
18.0%
5 464
15.5%
9 261
 
8.7%
4 226
 
7.5%
8 209
 
7.0%
1 174
 
5.8%
6 133
 
4.4%
7 132
 
4.4%
3 117
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 600
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3601
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 744
20.7%
- 600
16.7%
0 541
15.0%
5 464
12.9%
9 261
 
7.2%
4 226
 
6.3%
8 209
 
5.8%
1 174
 
4.8%
6 133
 
3.7%
7 132
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3601
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 744
20.7%
- 600
16.7%
0 541
15.0%
5 464
12.9%
9 261
 
7.2%
4 226
 
6.3%
8 209
 
5.8%
1 174
 
4.8%
6 133
 
3.7%
7 132
 
3.7%

Interactions

2024-04-22T00:53:03.007807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-22T00:53:03.276806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T00:53:03.449473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-22T00:53:03.596624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번사무소명대표도로명주소연락처
01현대부동산중개사무소이경태울산광역시 남구 대암로 104052-211-9009
12문수2차공인중개사사무소김미자울산광역시 남구 동산로 64052-227-4470
23키움부동산공인중개사사무소한학송울산광역시 남구 중앙로156번길 33<NA>
34신태양공인중개사사무소이용주도산로 9, 상가1동(달동, 이진빌라)<NA>
45현대부동산중개인사무소송이천울산광역시 중구 시원길 43(우정동)052-211-3430
56제일부동산중개인사무소김재순울산광역시 중구 병영6길 35, 상가2호(동동,성진파크)052-292-5511
67강남부동산중개사무소윤흥갑울산광역시 중구 다운로 132(다운동)052-211-1635
78래미안부동산중개사무소지미자울산광역시 중구 평산1길 2(약사동)052-293-2711
89시민공인중개사사무소김대식울산광역시 중구 중앙길 261(학산동)052-275-9101
910월성부동산중개인사무소홍영복울산광역시 중구 학성공원4길 12(학성동)052-296-9861
연번사무소명대표도로명주소연락처
399400코아루공인중개사사무소황정희울산광역시 중구 명륜로 30(우정동)<NA>
400401참다운공인중개사사무소서성숙울산광역시 중구 서원6길 5<NA>
401402복산아이파크공인중개사사무소김혜진울산광역시 중구 계변로 96, 118동 104호(복산동)052-282-1113
402403금호어울림공인중개사사무소황동태울산광역시 중구 함월12길 29, 1층052-243-2588
403404신365공인중개사사무소고동형울산광역시 중구 장춘로 162<NA>
404405비원부동산중개사무소이경미울산광역시 중구 종가4길 10, 220동 104호052-248-5585
405406혁신푸르지오부동산공인중개사사무소하두옥울산광역시 중구 유곡로 80, 상가514동 103호<NA>
406407새황금공인중개사사무소김진화울산광역시 중구 옥교3길 103(학성동)<NA>
407408정효은공인중개사사무소정효은울산광역시 중구 번영로 435, 110동 B101호(복산동,효성해링턴플레이스1차아파트)052-294-5252
408409종가공인중개사사무소최대식울산광역시 중구 장춘로 56(우정동)052-265-5989