Overview

Dataset statistics

Number of variables6
Number of observations25
Missing cells6
Missing cells (%)4.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory54.3 B

Variable types

Categorical2
Text2
DateTime1
Numeric1

Dataset

Description주택법 제11조에 따라 충청남도 내 설립된 주택조합 현황으로 시군별, 조합명, 모집신고일자, 설립인가일자, 사업지 위치, 세대수 등의 정보를 제공
Author충청남도
URLhttps://www.data.go.kr/data/15112634/fileData.do

Alerts

시군 is highly overall correlated with 모집신고일자High correlation
모집신고일자 is highly overall correlated with 시군High correlation
설립인가일자 has 6 (24.0%) missing valuesMissing
조합명 has unique valuesUnique
사업지 위치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:34:00.004942
Analysis finished2023-12-12 07:34:00.617519
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
천안시
아산시
당진시
서산시
공주시
Other values (3)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique4 ?
Unique (%)16.0%

Sample

1st row천안시
2nd row천안시
3rd row천안시
4th row천안시
5th row천안시

Common Values

ValueCountFrequency (%)
천안시 9
36.0%
아산시 5
20.0%
당진시 4
16.0%
서산시 3
 
12.0%
공주시 1
 
4.0%
계룡시 1
 
4.0%
예산군 1
 
4.0%
태안군 1
 
4.0%

Length

2023-12-12T16:34:00.691295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:34:00.815373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
천안시 9
36.0%
아산시 5
20.0%
당진시 4
16.0%
서산시 3
 
12.0%
공주시 1
 
4.0%
계룡시 1
 
4.0%
예산군 1
 
4.0%
태안군 1
 
4.0%

조합명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T16:34:01.039386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length11.84
Min length8

Characters and Unicode

Total characters296
Distinct characters88
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row직산월드메르디앙지역주택조합
2nd row봉명이편한세상지역주택조합
3rd row청당코오롱하늘채지역주택조합
4th row청수지역주택조합
5th row파크시티이편한세상지역주택조합
ValueCountFrequency (%)
지역주택조합 5
 
15.6%
직산월드메르디앙지역주택조합 1
 
3.1%
온양중심상권지역주택조합 1
 
3.1%
예산지역주택조합 1
 
3.1%
당진푸르지오3차지역주택조합 1
 
3.1%
가칭)당진송산지역주택조합 1
 
3.1%
당진아이파크지역주택조합 1
 
3.1%
당진신평지역주택조합 1
 
3.1%
계룡금암지역주택조합 1
 
3.1%
서산잠홍동지역주택조합 1
 
3.1%
Other values (18) 18
56.2%
2023-12-12T16:34:01.534072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
9.5%
26
 
8.8%
25
 
8.4%
25
 
8.4%
25
 
8.4%
25
 
8.4%
7
 
2.4%
6
 
2.0%
5
 
1.7%
4
 
1.4%
Other values (78) 120
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 283
95.6%
Space Separator 7
 
2.4%
Open Punctuation 2
 
0.7%
Close Punctuation 2
 
0.7%
Decimal Number 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
9.9%
26
 
9.2%
25
 
8.8%
25
 
8.8%
25
 
8.8%
25
 
8.8%
6
 
2.1%
5
 
1.8%
4
 
1.4%
4
 
1.4%
Other values (73) 110
38.9%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
3 1
50.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 283
95.6%
Common 13
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
9.9%
26
 
9.2%
25
 
8.8%
25
 
8.8%
25
 
8.8%
25
 
8.8%
6
 
2.1%
5
 
1.8%
4
 
1.4%
4
 
1.4%
Other values (73) 110
38.9%
Common
ValueCountFrequency (%)
7
53.8%
( 2
 
15.4%
) 2
 
15.4%
2 1
 
7.7%
3 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 283
95.6%
ASCII 13
 
4.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
9.9%
26
 
9.2%
25
 
8.8%
25
 
8.8%
25
 
8.8%
25
 
8.8%
6
 
2.1%
5
 
1.8%
4
 
1.4%
4
 
1.4%
Other values (73) 110
38.9%
ASCII
ValueCountFrequency (%)
7
53.8%
( 2
 
15.4%
) 2
 
15.4%
2 1
 
7.7%
3 1
 
7.7%

모집신고일자
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2017-6-3 부칙적용 제외 대상
16 
2018-05-23
 
1
2021-04-02
 
1
2022-10-18
 
1
2017-09-15
 
1
Other values (5)

Length

Max length19
Median length19
Mean length15.76
Min length10

Unique

Unique9 ?
Unique (%)36.0%

Sample

1st row2017-6-3 부칙적용 제외 대상
2nd row2017-6-3 부칙적용 제외 대상
3rd row2017-6-3 부칙적용 제외 대상
4th row2017-6-3 부칙적용 제외 대상
5th row2017-6-3 부칙적용 제외 대상

Common Values

ValueCountFrequency (%)
2017-6-3 부칙적용 제외 대상 16
64.0%
2018-05-23 1
 
4.0%
2021-04-02 1
 
4.0%
2022-10-18 1
 
4.0%
2017-09-15 1
 
4.0%
2021-11-11 1
 
4.0%
2020-06-03 1
 
4.0%
2018-08-06 1
 
4.0%
2020-07-27 1
 
4.0%
2022-11-02 1
 
4.0%

Length

2023-12-12T16:34:01.679177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:34:01.815628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-6-3 16
21.9%
부칙적용 16
21.9%
제외 16
21.9%
대상 16
21.9%
2018-05-23 1
 
1.4%
2021-04-02 1
 
1.4%
2022-10-18 1
 
1.4%
2017-09-15 1
 
1.4%
2021-11-11 1
 
1.4%
2020-06-03 1
 
1.4%
Other values (3) 3
 
4.1%

설립인가일자
Date

MISSING 

Distinct19
Distinct (%)100.0%
Missing6
Missing (%)24.0%
Memory size332.0 B
Minimum2014-11-04 00:00:00
Maximum2021-08-11 00:00:00
2023-12-12T16:34:01.965360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:34:02.067216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)

사업지 위치
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T16:34:02.269651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length17.8
Min length14

Characters and Unicode

Total characters445
Distinct characters75
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row직산읍 삼은리 36-3번지 일원
2nd row봉명동 204-3번지 일원
3rd row청당동 389-51번지 일원
4th row청당동 295-3번지 일원
5th row삼룡동 378-1번지 일원
ValueCountFrequency (%)
일원 24
23.5%
아산시 5
 
4.9%
당진시 4
 
3.9%
서산시 3
 
2.9%
온천동 2
 
2.0%
청당동 2
 
2.0%
북수리 2
 
2.0%
배방읍 2
 
2.0%
287번지 1
 
1.0%
금암동 1
 
1.0%
Other values (56) 56
54.9%
2023-12-12T16:34:02.611758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
17.3%
25
 
5.6%
25
 
5.6%
24
 
5.4%
24
 
5.4%
1 19
 
4.3%
- 16
 
3.6%
15
 
3.4%
3 14
 
3.1%
14
 
3.1%
Other values (65) 192
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 256
57.5%
Decimal Number 96
 
21.6%
Space Separator 77
 
17.3%
Dash Punctuation 16
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
9.8%
25
 
9.8%
24
 
9.4%
24
 
9.4%
15
 
5.9%
14
 
5.5%
12
 
4.7%
11
 
4.3%
10
 
3.9%
6
 
2.3%
Other values (53) 90
35.2%
Decimal Number
ValueCountFrequency (%)
1 19
19.8%
3 14
14.6%
8 10
10.4%
2 10
10.4%
6 9
9.4%
4 9
9.4%
5 7
 
7.3%
0 7
 
7.3%
7 6
 
6.2%
9 5
 
5.2%
Space Separator
ValueCountFrequency (%)
77
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 256
57.5%
Common 189
42.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
9.8%
25
 
9.8%
24
 
9.4%
24
 
9.4%
15
 
5.9%
14
 
5.5%
12
 
4.7%
11
 
4.3%
10
 
3.9%
6
 
2.3%
Other values (53) 90
35.2%
Common
ValueCountFrequency (%)
77
40.7%
1 19
 
10.1%
- 16
 
8.5%
3 14
 
7.4%
8 10
 
5.3%
2 10
 
5.3%
6 9
 
4.8%
4 9
 
4.8%
5 7
 
3.7%
0 7
 
3.7%
Other values (2) 11
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 256
57.5%
ASCII 189
42.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
77
40.7%
1 19
 
10.1%
- 16
 
8.5%
3 14
 
7.4%
8 10
 
5.3%
2 10
 
5.3%
6 9
 
4.8%
4 9
 
4.8%
5 7
 
3.7%
0 7
 
3.7%
Other values (2) 11
 
5.8%
Hangul
ValueCountFrequency (%)
25
 
9.8%
25
 
9.8%
24
 
9.4%
24
 
9.4%
15
 
5.9%
14
 
5.5%
12
 
4.7%
11
 
4.3%
10
 
3.9%
6
 
2.3%
Other values (53) 90
35.2%

세대수
Real number (ℝ)

Distinct24
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean573.2
Minimum199
Maximum1534
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T16:34:02.771821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum199
5-th percentile208
Q1386
median563
Q3657
95-th percentile1118
Maximum1534
Range1335
Interquartile range (IQR)271

Descriptive statistics

Standard deviation307.28299
Coefficient of variation (CV)0.53608337
Kurtosis3.0286969
Mean573.2
Median Absolute Deviation (MAD)157
Skewness1.513938
Sum14330
Variance94422.833
MonotonicityNot monotonic
2023-12-12T16:34:02.944093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
653 2
 
8.0%
374 1
 
4.0%
621 1
 
4.0%
386 1
 
4.0%
657 1
 
4.0%
667 1
 
4.0%
597 1
 
4.0%
426 1
 
4.0%
400 1
 
4.0%
199 1
 
4.0%
Other values (14) 14
56.0%
ValueCountFrequency (%)
199 1
4.0%
200 1
4.0%
240 1
4.0%
266 1
4.0%
340 1
4.0%
374 1
4.0%
386 1
4.0%
400 1
4.0%
415 1
4.0%
426 1
4.0%
ValueCountFrequency (%)
1534 1
4.0%
1135 1
4.0%
1050 1
4.0%
741 1
4.0%
720 1
4.0%
667 1
4.0%
657 1
4.0%
653 2
8.0%
621 1
4.0%
597 1
4.0%

Interactions

2023-12-12T16:34:00.332537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:34:03.119632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군조합명모집신고일자설립인가일자사업지 위치세대수
시군1.0001.0000.8191.0001.0000.000
조합명1.0001.0001.0001.0001.0001.000
모집신고일자0.8191.0001.0001.0001.0000.000
설립인가일자1.0001.0001.0001.0001.0001.000
사업지 위치1.0001.0001.0001.0001.0001.000
세대수0.0001.0000.0001.0001.0001.000
2023-12-12T16:34:03.227892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
모집신고일자시군
모집신고일자1.0000.530
시군0.5301.000
2023-12-12T16:34:03.647540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세대수시군모집신고일자
세대수1.0000.0000.000
시군0.0001.0000.530
모집신고일자0.0000.5301.000

Missing values

2023-12-12T16:34:00.461982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:34:00.570604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군조합명모집신고일자설립인가일자사업지 위치세대수
0천안시직산월드메르디앙지역주택조합2017-6-3 부칙적용 제외 대상2014-11-04직산읍 삼은리 36-3번지 일원374
1천안시봉명이편한세상지역주택조합2017-6-3 부칙적용 제외 대상2015-01-30봉명동 204-3번지 일원459
2천안시청당코오롱하늘채지역주택조합2017-6-3 부칙적용 제외 대상2015-07-09청당동 389-51번지 일원1534
3천안시청수지역주택조합2017-6-3 부칙적용 제외 대상2015-10-06청당동 295-3번지 일원741
4천안시파크시티이편한세상지역주택조합2017-6-3 부칙적용 제외 대상2015-12-09삼룡동 378-1번지 일원1050
5천안시천안첨단지역주택조합2017-6-3 부칙적용 제외 대상2016-03-04성거읍 신월리 406-1번지 일원653
6천안시청수행정타운지역주택조합2017-6-3 부칙적용 제외 대상2016-07-13청수동 214-48번지 일원584
7천안시성환지구지역주택조합2018-05-232018-10-24성환읍 성환리 189-8번지 일원340
8천안시천안목천지역주택조합2017-6-3 부칙적용 제외 대상2019-04-24동남구 목천읍 응원리 166-1번지 일원1135
9공주시(가칭)공주금학동지역주택조합2021-04-02<NA>공주시 금학동 302번지 일원200
시군조합명모집신고일자설립인가일자사업지 위치세대수
15서산시예천지역주택조합2017-6-3 부칙적용 제외 대상2015-05-17서산시 예천동 507번지 일원653
16서산시서산석림 지역주택조합2017-6-3 부칙적용 제외 대상2016-04-12서산시 석림동 493-1번지 일원266
17서산시서산잠홍동지역주택조합2017-6-3 부칙적용 제외 대상2019-12-27서산시 잠홍동 541번지 일원450
18계룡시계룡금암지역주택조합2020-06-032021-07-07계룡시 금암동 287번지 일원199
19당진시당진신평지역주택조합2017-6-3 부칙적용 제외 대상2016-06-10당진시 신평면 금천리 460번지 일원400
20당진시당진아이파크지역주택조합2017-6-3 부칙적용 제외 대상2017-04-27당진시 읍내동 1687번지 일원426
21당진시(가칭)당진송산지역주택조합2018-08-06<NA>당진시 송산면 유곡리 1313597
22당진시당진푸르지오3차지역주택조합2020-07-272021-08-11당진시 송악읍 기지시리 512번지 일원667
23예산군예산지역주택조합2017-6-3 부칙적용 제외 대상2016-01-25예산군 대회리 235-4번지 일원657
24태안군가이아지역주택조합2022-11-02<NA>태안군 태안읍 반곡리 1148-50번지 일원386