Overview

Dataset statistics

Number of variables8
Number of observations59
Missing cells5
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory67.2 B

Variable types

Numeric1
Categorical3
Text3
DateTime1

Dataset

Description전라남도 담양군에 소재한 부동산중개업소의 업소명, 소재지, 영업상태 등에 대한 현황 정보를공공데이터로 제공하고 있습니다.
Author전라남도 담양군
URLhttps://www.data.go.kr/data/15043148/fileData.do

Alerts

데이터기준일 has constant value ""Constant
중개업소구분 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
직위 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
행정처분상태 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 행정처분상태 and 2 other fieldsHigh correlation
행정처분상태 is highly imbalanced (87.6%)Imbalance
중개업소구분 is highly imbalanced (87.6%)Imbalance
직위 is highly imbalanced (87.6%)Imbalance
연번 has 1 (1.7%) missing valuesMissing
중개업소명 has 1 (1.7%) missing valuesMissing
중개업자명 has 1 (1.7%) missing valuesMissing
사무소주소 has 1 (1.7%) missing valuesMissing
데이터기준일 has 1 (1.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:54:13.931613
Analysis finished2023-12-12 02:54:15.214753
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct58
Distinct (%)100.0%
Missing1
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean29.5
Minimum1
Maximum58
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T11:54:15.326361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.85
Q115.25
median29.5
Q343.75
95-th percentile55.15
Maximum58
Range57
Interquartile range (IQR)28.5

Descriptive statistics

Standard deviation16.886879
Coefficient of variation (CV)0.57243656
Kurtosis-1.2
Mean29.5
Median Absolute Deviation (MAD)14.5
Skewness0
Sum1711
Variance285.16667
MonotonicityStrictly increasing
2023-12-12T11:54:15.527688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
45 1
 
1.7%
33 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
Other values (48) 48
81.4%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%
50 1
1.7%
49 1
1.7%

행정처분상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size604.0 B
영업중
58 
<NA>
 
1

Length

Max length4
Median length3
Mean length3.0169492
Min length3

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 58
98.3%
<NA> 1
 
1.7%

Length

2023-12-12T11:54:15.767391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:54:15.927937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 58
98.3%
na 1
 
1.7%

중개업소명
Text

MISSING 

Distinct58
Distinct (%)100.0%
Missing1
Missing (%)1.7%
Memory size604.0 B
2023-12-12T11:54:16.188740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15.5
Mean length10.5
Min length5

Characters and Unicode

Total characters609
Distinct characters107
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row대우공인중개사
2nd row만남공인중개사사무소
3rd row영일공인중개사사무소
4th row현대부동산
5th row태공인중개사사무소
ValueCountFrequency (%)
대우공인중개사 1
 
1.7%
수북공인중개사사무소 1
 
1.7%
참조은공인중개사사무소 1
 
1.7%
가사문학공인중개사사무소 1
 
1.7%
창평공인중개사사무소 1
 
1.7%
창평황금공인중개사사무소 1
 
1.7%
대지공인중개사사무소 1
 
1.7%
창평믿음공인중개사사무소 1
 
1.7%
금성호수부동산중개사무소 1
 
1.7%
금성공인중개사사무소 1
 
1.7%
Other values (48) 48
82.8%
2023-12-12T11:54:16.684366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
106
17.4%
58
 
9.5%
57
 
9.4%
53
 
8.7%
53
 
8.7%
52
 
8.5%
52
 
8.5%
9
 
1.5%
8
 
1.3%
8
 
1.3%
Other values (97) 153
25.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 606
99.5%
Decimal Number 2
 
0.3%
Space Separator 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
17.5%
58
 
9.6%
57
 
9.4%
53
 
8.7%
53
 
8.7%
52
 
8.6%
52
 
8.6%
9
 
1.5%
8
 
1.3%
8
 
1.3%
Other values (95) 150
24.8%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 606
99.5%
Common 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
17.5%
58
 
9.6%
57
 
9.4%
53
 
8.7%
53
 
8.7%
52
 
8.6%
52
 
8.6%
9
 
1.5%
8
 
1.3%
8
 
1.3%
Other values (95) 150
24.8%
Common
ValueCountFrequency (%)
1 2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 606
99.5%
ASCII 3
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
106
17.5%
58
 
9.6%
57
 
9.4%
53
 
8.7%
53
 
8.7%
52
 
8.6%
52
 
8.6%
9
 
1.5%
8
 
1.3%
8
 
1.3%
Other values (95) 150
24.8%
ASCII
ValueCountFrequency (%)
1 2
66.7%
1
33.3%

중개업소구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size604.0 B
공인중개사
58 
<NA>
 
1

Length

Max length5
Median length5
Mean length4.9830508
Min length4

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row공인중개사
2nd row공인중개사
3rd row공인중개사
4th row공인중개사
5th row공인중개사

Common Values

ValueCountFrequency (%)
공인중개사 58
98.3%
<NA> 1
 
1.7%

Length

2023-12-12T11:54:16.856487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:54:16.973248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공인중개사 58
98.3%
na 1
 
1.7%

중개업자명
Text

MISSING 

Distinct58
Distinct (%)100.0%
Missing1
Missing (%)1.7%
Memory size604.0 B
2023-12-12T11:54:17.226558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters174
Distinct characters88
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row이동근
2nd row양은자
3rd row김인숙
4th row오은수
5th row국승근
ValueCountFrequency (%)
이동근 1
 
1.7%
허준석 1
 
1.7%
박상용 1
 
1.7%
위성차 1
 
1.7%
고영필 1
 
1.7%
이승희 1
 
1.7%
김성규 1
 
1.7%
송광자 1
 
1.7%
공충용 1
 
1.7%
이성순 1
 
1.7%
Other values (48) 48
82.8%
2023-12-12T11:54:17.756818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
5.2%
7
 
4.0%
7
 
4.0%
6
 
3.4%
6
 
3.4%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (78) 118
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 174
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
5.2%
7
 
4.0%
7
 
4.0%
6
 
3.4%
6
 
3.4%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (78) 118
67.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 174
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
5.2%
7
 
4.0%
7
 
4.0%
6
 
3.4%
6
 
3.4%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (78) 118
67.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 174
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
5.2%
7
 
4.0%
7
 
4.0%
6
 
3.4%
6
 
3.4%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
Other values (78) 118
67.8%

직위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size604.0 B
대표
58 
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0338983
Min length2

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row대표
2nd row대표
3rd row대표
4th row대표
5th row대표

Common Values

ValueCountFrequency (%)
대표 58
98.3%
<NA> 1
 
1.7%

Length

2023-12-12T11:54:18.044312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:54:18.214338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대표 58
98.3%
na 1
 
1.7%

사무소주소
Text

MISSING 

Distinct56
Distinct (%)96.6%
Missing1
Missing (%)1.7%
Memory size604.0 B
2023-12-12T11:54:18.569998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length22.189655
Min length18

Characters and Unicode

Total characters1287
Distinct characters84
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)93.1%

Sample

1st row전라남도 담양군 담양읍 천변5길 17
2nd row전라남도 담양군 담양읍 죽녹원로 42
3rd row전라남도 담양군 담양읍 중앙로 115
4th row전라남도 담양군 담양읍 추성로 1337
5th row전라남도 담양군 담양읍 추성로 1368
ValueCountFrequency (%)
전라남도 58
19.4%
담양군 58
19.4%
담양읍 30
 
10.0%
추성로 11
 
3.7%
수북면 11
 
3.7%
추성1로 10
 
3.3%
대전면 8
 
2.7%
창평면 4
 
1.3%
한수동로 4
 
1.3%
상가 3
 
1.0%
Other values (88) 102
34.1%
2023-12-12T11:54:19.133423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
241
18.7%
91
 
7.1%
90
 
7.0%
67
 
5.2%
58
 
4.5%
58
 
4.5%
58
 
4.5%
58
 
4.5%
1 58
 
4.5%
43
 
3.3%
Other values (74) 465
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 801
62.2%
Space Separator 241
 
18.7%
Decimal Number 221
 
17.2%
Dash Punctuation 13
 
1.0%
Other Punctuation 5
 
0.4%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
11.4%
90
11.2%
67
 
8.4%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
43
 
5.4%
30
 
3.7%
28
 
3.5%
Other values (59) 220
27.5%
Decimal Number
ValueCountFrequency (%)
1 58
26.2%
3 30
13.6%
2 29
13.1%
5 24
10.9%
6 18
 
8.1%
4 15
 
6.8%
0 14
 
6.3%
7 13
 
5.9%
8 13
 
5.9%
9 7
 
3.2%
Space Separator
ValueCountFrequency (%)
241
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 801
62.2%
Common 486
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
11.4%
90
11.2%
67
 
8.4%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
43
 
5.4%
30
 
3.7%
28
 
3.5%
Other values (59) 220
27.5%
Common
ValueCountFrequency (%)
241
49.6%
1 58
 
11.9%
3 30
 
6.2%
2 29
 
6.0%
5 24
 
4.9%
6 18
 
3.7%
4 15
 
3.1%
0 14
 
2.9%
7 13
 
2.7%
8 13
 
2.7%
Other values (5) 31
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 801
62.2%
ASCII 486
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
241
49.6%
1 58
 
11.9%
3 30
 
6.2%
2 29
 
6.0%
5 24
 
4.9%
6 18
 
3.7%
4 15
 
3.1%
0 14
 
2.9%
7 13
 
2.7%
8 13
 
2.7%
Other values (5) 31
 
6.4%
Hangul
ValueCountFrequency (%)
91
11.4%
90
11.2%
67
 
8.4%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
43
 
5.4%
30
 
3.7%
28
 
3.5%
Other values (59) 220
27.5%

데이터기준일
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)1.7%
Missing1
Missing (%)1.7%
Memory size604.0 B
Minimum2023-09-18 00:00:00
Maximum2023-09-18 00:00:00
2023-12-12T11:54:19.689423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:54:19.869615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T11:54:14.471643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:54:19.976695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번중개업소명중개업자명사무소주소
연번1.0001.0001.0000.967
중개업소명1.0001.0001.0001.000
중개업자명1.0001.0001.0001.000
사무소주소0.9671.0001.0001.000
2023-12-12T11:54:20.107017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중개업소구분직위행정처분상태
중개업소구분1.0001.0001.000
직위1.0001.0001.000
행정처분상태1.0001.0001.000
2023-12-12T11:54:20.241361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정처분상태중개업소구분직위
연번1.0001.0001.0001.000
행정처분상태1.0001.0001.0001.000
중개업소구분1.0001.0001.0001.000
직위1.0001.0001.0001.000

Missing values

2023-12-12T11:54:14.665719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:54:14.863091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:54:15.072541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번행정처분상태중개업소명중개업소구분중개업자명직위사무소주소데이터기준일
01영업중대우공인중개사공인중개사이동근대표전라남도 담양군 담양읍 천변5길 172023-09-18
12영업중만남공인중개사사무소공인중개사양은자대표전라남도 담양군 담양읍 죽녹원로 422023-09-18
23영업중영일공인중개사사무소공인중개사김인숙대표전라남도 담양군 담양읍 중앙로 1152023-09-18
34영업중현대부동산공인중개사오은수대표전라남도 담양군 담양읍 추성로 13372023-09-18
45영업중태공인중개사사무소공인중개사국승근대표전라남도 담양군 담양읍 추성로 13682023-09-18
56영업중스타공인중개사사무소공인중개사양누리대표전라남도 담양군 담양읍 죽향대로 11952023-09-18
67영업중유익한부동산중개사무소공인중개사정영민대표전라남도 담양군 담양읍 추성로 13532023-09-18
78영업중담양이삭공인중개사사무소공인중개사양예정대표전라남도 담양군 담양읍 추성로 13552023-09-18
89영업중향송공인중개사사무소공인중개사서수연대표전라남도 담양군 담양읍 추성로 12072023-09-18
910영업중향송1공인중개사사무소공인중개사임준택대표전라남도 담양군 담양읍 추성로 12072023-09-18
연번행정처분상태중개업소명중개업소구분중개업자명직위사무소주소데이터기준일
4950영업중윤원창공인중개사사무소공인중개사윤원창대표전라남도 담양군 수북면 한수동로 5642023-09-18
5051영업중성공시대공인중개사공인중개사홍성공대표전라남도 담양군 대전면 추성1로 2222023-09-18
5152영업중한결공인중개사사무소공인중개사한보영대표전라남도 담양군 대전면 갑향길 154-12023-09-18
5253영업중형제공인중개사공인중개사이현상대표전라남도 담양군 대전면 대전로 422023-09-18
5354영업중하나로종합부동산공인중개사사무소공인중개사김초록대표전라남도 담양군 대전면 추성1로 4362023-09-18
5455영업중대전공인중개사사무소공인중개사강해원대표전라남도 담양군 대전면 추성1로 2012023-09-18
5556영업중정평공인중개사사무소공인중개사임우훈대표전라남도 담양군 대전면 대치5길 1202023-09-18
5657영업중이어도공인중개사사무소공인중개사이관섭대표전라남도 담양군 대전면 추성1로 5352023-09-18
5758영업중삼성공인중개사사무소공인중개사임동수대표전라남도 담양군 대전면 추성1로 5382023-09-18
58<NA><NA><NA><NA><NA><NA><NA><NA>