Overview

Dataset statistics

Number of variables7
Number of observations252
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.1 KiB
Average record size in memory61.5 B

Variable types

Categorical2
Text1
Numeric4

Dataset

Description전국 시군구별 개업공인중개사무소의 개업공인중개사, 소속공인중개사, 중개보조원 등 고용인 신고현황, 타법에 의한 중개업자 통계 정보
URLhttps://www.data.go.kr/data/15063948/fileData.do

Alerts

합계 is highly overall correlated with 개업공인중개사 and 3 other fieldsHigh correlation
개업공인중개사 is highly overall correlated with 합계 and 3 other fieldsHigh correlation
소속공인중개사 is highly overall correlated with 합계 and 3 other fieldsHigh correlation
중개보조원 is highly overall correlated with 합계 and 3 other fieldsHigh correlation
타법에의한중개업자 is highly overall correlated with 합계 and 3 other fieldsHigh correlation
타법에의한중개업자 is highly imbalanced (88.9%)Imbalance
소속공인중개사 has 26 (10.3%) zerosZeros
중개보조원 has 3 (1.2%) zerosZeros

Reproduction

Analysis started2023-12-12 12:47:31.284017
Analysis finished2023-12-12 12:47:33.408541
Duration2.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도
Categorical

Distinct17
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
경기도
44 
서울특별시
25 
경상북도
23 
전라남도
22 
경상남도
22 
Other values (12)
116 

Length

Max length7
Median length5
Mean length4.3730159
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 44
17.5%
서울특별시 25
9.9%
경상북도 23
9.1%
전라남도 22
8.7%
경상남도 22
8.7%
강원특별자치도 18
7.1%
부산광역시 16
 
6.3%
충청남도 16
 
6.3%
전라북도 15
 
6.0%
충청북도 14
 
5.6%
Other values (7) 37
14.7%

Length

2023-12-12T21:47:33.489437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 44
17.5%
서울특별시 25
9.9%
경상북도 23
9.1%
전라남도 22
8.7%
경상남도 22
8.7%
강원특별자치도 18
7.1%
부산광역시 16
 
6.3%
충청남도 16
 
6.3%
전라북도 15
 
6.0%
충청북도 14
 
5.6%
Other values (7) 37
14.7%
Distinct230
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:47:33.782076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.3373016
Min length2

Characters and Unicode

Total characters841
Distinct characters142
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique223 ?
Unique (%)88.5%

Sample

1st row종로구
2nd row중구
3rd row용산구
4th row성동구
5th row광진구
ValueCountFrequency (%)
동구 6
 
2.4%
중구 6
 
2.4%
서구 5
 
2.0%
남구 4
 
1.6%
북구 4
 
1.6%
고성군 2
 
0.8%
강서구 2
 
0.8%
나주시 1
 
0.4%
화순군 1
 
0.4%
보성군 1
 
0.4%
Other values (220) 220
87.3%
2023-12-12T21:47:34.240552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
106
 
12.6%
102
 
12.1%
85
 
10.1%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
21
 
2.5%
20
 
2.4%
18
 
2.1%
Other values (132) 396
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 841
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
12.6%
102
 
12.1%
85
 
10.1%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
21
 
2.5%
20
 
2.4%
18
 
2.1%
Other values (132) 396
47.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 841
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
12.6%
102
 
12.1%
85
 
10.1%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
21
 
2.5%
20
 
2.4%
18
 
2.1%
Other values (132) 396
47.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 841
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
106
 
12.6%
102
 
12.1%
85
 
10.1%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
21
 
2.5%
20
 
2.4%
18
 
2.1%
Other values (132) 396
47.1%

합계
Real number (ℝ)

HIGH CORRELATION 

Distinct216
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean788.12302
Minimum1
Maximum8279
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T21:47:34.380299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21.55
Q1102.75
median566
Q31252.25
95-th percentile2202.35
Maximum8279
Range8278
Interquartile range (IQR)1149.5

Descriptive statistics

Standard deviation902.89201
Coefficient of variation (CV)1.1456232
Kurtosis18.853548
Mean788.12302
Median Absolute Deviation (MAD)503
Skewness3.0477592
Sum198607
Variance815213.98
MonotonicityNot monotonic
2023-12-12T21:47:34.527294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16 4
 
1.6%
299 3
 
1.2%
127 3
 
1.2%
43 3
 
1.2%
54 3
 
1.2%
60 3
 
1.2%
8 2
 
0.8%
776 2
 
0.8%
30 2
 
0.8%
1450 2
 
0.8%
Other values (206) 225
89.3%
ValueCountFrequency (%)
1 1
 
0.4%
2 1
 
0.4%
8 2
0.8%
13 1
 
0.4%
15 1
 
0.4%
16 4
1.6%
19 1
 
0.4%
20 1
 
0.4%
21 1
 
0.4%
22 1
 
0.4%
ValueCountFrequency (%)
8279 1
0.4%
3971 1
0.4%
3908 1
0.4%
3319 1
0.4%
3206 1
0.4%
3130 1
0.4%
2556 1
0.4%
2502 1
0.4%
2465 1
0.4%
2374 1
0.4%

개업공인중개사
Real number (ℝ)

HIGH CORRELATION 

Distinct218
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean461.25
Minimum1
Maximum2945
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T21:47:34.694200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.1
Q163.75
median341
Q3743.75
95-th percentile1266.25
Maximum2945
Range2944
Interquartile range (IQR)680

Descriptive statistics

Standard deviation473.49883
Coefficient of variation (CV)1.0265557
Kurtosis4.0725156
Mean461.25
Median Absolute Deviation (MAD)295
Skewness1.6257565
Sum116235
Variance224201.14
MonotonicityNot monotonic
2023-12-12T21:47:34.864743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13 3
 
1.2%
63 3
 
1.2%
36 3
 
1.2%
31 3
 
1.2%
11 3
 
1.2%
877 2
 
0.8%
30 2
 
0.8%
64 2
 
0.8%
42 2
 
0.8%
58 2
 
0.8%
Other values (208) 227
90.1%
ValueCountFrequency (%)
1 1
 
0.4%
2 1
 
0.4%
5 1
 
0.4%
8 1
 
0.4%
10 1
 
0.4%
11 3
1.2%
12 1
 
0.4%
13 3
1.2%
14 1
 
0.4%
16 2
0.8%
ValueCountFrequency (%)
2945 1
0.4%
2592 1
0.4%
1969 1
0.4%
1861 1
0.4%
1792 1
0.4%
1766 1
0.4%
1571 1
0.4%
1556 1
0.4%
1488 1
0.4%
1391 1
0.4%

소속공인중개사
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct121
Distinct (%)48.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76.015873
Minimum0
Maximum1800
Zeros26
Zeros (%)10.3%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T21:47:35.054398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median31.5
Q3110.25
95-th percentile259.35
Maximum1800
Range1800
Interquartile range (IQR)108.25

Descriptive statistics

Standard deviation142.89801
Coefficient of variation (CV)1.8798443
Kurtosis86.08781
Mean76.015873
Median Absolute Deviation (MAD)31
Skewness7.732995
Sum19156
Variance20419.84
MonotonicityNot monotonic
2023-12-12T21:47:35.222304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 26
 
10.3%
1 21
 
8.3%
2 17
 
6.7%
3 9
 
3.6%
8 8
 
3.2%
4 6
 
2.4%
53 4
 
1.6%
106 4
 
1.6%
16 4
 
1.6%
19 3
 
1.2%
Other values (111) 150
59.5%
ValueCountFrequency (%)
0 26
10.3%
1 21
8.3%
2 17
6.7%
3 9
 
3.6%
4 6
 
2.4%
5 2
 
0.8%
6 2
 
0.8%
7 1
 
0.4%
8 8
 
3.2%
9 2
 
0.8%
ValueCountFrequency (%)
1800 1
0.4%
751 1
0.4%
492 1
0.4%
386 1
0.4%
317 1
0.4%
313 1
0.4%
301 1
0.4%
280 1
0.4%
271 1
0.4%
270 1
0.4%

중개보조원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct181
Distinct (%)71.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250.81349
Minimum0
Maximum3531
Zeros3
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T21:47:35.365674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q129.75
median158
Q3377.5
95-th percentile762.2
Maximum3531
Range3531
Interquartile range (IQR)347.75

Descriptive statistics

Standard deviation325.62198
Coefficient of variation (CV)1.2982634
Kurtosis40.906091
Mean250.81349
Median Absolute Deviation (MAD)141
Skewness4.7344174
Sum63205
Variance106029.67
MonotonicityNot monotonic
2023-12-12T21:47:35.524332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13 5
 
2.0%
10 5
 
2.0%
3 5
 
2.0%
22 4
 
1.6%
15 4
 
1.6%
0 3
 
1.2%
9 3
 
1.2%
16 3
 
1.2%
17 3
 
1.2%
28 3
 
1.2%
Other values (171) 214
84.9%
ValueCountFrequency (%)
0 3
1.2%
3 5
2.0%
4 2
 
0.8%
5 2
 
0.8%
6 2
 
0.8%
7 1
 
0.4%
8 2
 
0.8%
9 3
1.2%
10 5
2.0%
11 2
 
0.8%
ValueCountFrequency (%)
3531 1
0.4%
1365 1
0.4%
1224 1
0.4%
1061 1
0.4%
1028 1
0.4%
1016 1
0.4%
995 1
0.4%
993 1
0.4%
813 1
0.4%
807 1
0.4%

타법에의한중개업자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
0
245 
1
 
4
2
 
2
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row0
2nd row2
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 245
97.2%
1 4
 
1.6%
2 2
 
0.8%
3 1
 
0.4%

Length

2023-12-12T21:47:35.663172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:47:35.765797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 245
97.2%
1 4
 
1.6%
2 2
 
0.8%
3 1
 
0.4%

Interactions

2023-12-12T21:47:32.868148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:31.652244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.080298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.504602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.946837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:31.742523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.178466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.592988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:33.032613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:31.856548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.291934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.683267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:33.123705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:31.972822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.405210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:47:32.776825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:47:35.835404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도합계개업공인중개사소속공인중개사중개보조원타법에의한중개업자
시도1.0000.4180.5340.0930.4340.000
합계0.4181.0000.9440.8630.8530.756
개업공인중개사0.5340.9441.0000.8780.8560.764
소속공인중개사0.0930.8630.8781.0000.9560.648
중개보조원0.4340.8530.8560.9561.0000.652
타법에의한중개업자0.0000.7560.7640.6480.6521.000
2023-12-12T21:47:35.935227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도타법에의한중개업자
시도1.0000.000
타법에의한중개업자0.0001.000
2023-12-12T21:47:36.016642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
합계개업공인중개사소속공인중개사중개보조원시도타법에의한중개업자
합계1.0000.9940.9680.9900.2060.592
개업공인중개사0.9941.0000.9510.9730.2410.606
소속공인중개사0.9680.9511.0000.9630.0440.577
중개보조원0.9900.9730.9631.0000.2340.581
시도0.2060.2410.0440.2341.0000.000
타법에의한중개업자0.5920.6060.5770.5810.0001.000

Missing values

2023-12-12T21:47:33.235847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:47:33.354615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도시군구합계개업공인중개사소속공인중개사중개보조원타법에의한중개업자
0서울특별시종로구1033572953660
1서울특별시중구10695991063622
2서울특별시용산구16079131815130
3서울특별시성동구15258552713990
4서울특별시광진구15509651004850
5서울특별시동대문구15679401314960
6서울특별시중랑구1303800924110
7서울특별시성북구13258781093380
8서울특별시강북구10386281043060
9서울특별시도봉구810523602270
시도시군구합계개업공인중개사소속공인중개사중개보조원타법에의한중개업자
242강원특별자치도횡성군166963670
243강원특별자치도영월군47313130
244강원특별자치도평창군85641200
245강원특별자치도정선군2717190
246강원특별자치도철원군54430110
247강원특별자치도화천군2116050
248강원특별자치도양구군1612040
249강원특별자치도인제군43270160
250강원특별자치도고성군55321220
251강원특별자치도양양군63402210