Overview

Dataset statistics

Number of variables6
Number of observations21
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory57.3 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description경상남도 하동군에 있는 마을회관 (연번, 구분, 주소, 건물면적(제곱미터), 건축연도, 전화번호)의 정보를 제공하고 있습니다
URLhttps://www.data.go.kr/data/15085729/fileData.do

Alerts

연번 is highly overall correlated with 전화번호High correlation
전화번호 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
구분 has unique valuesUnique
주소 has unique valuesUnique
건물면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:11:47.763545
Analysis finished2023-12-12 12:11:49.360348
Duration1.6 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2023-12-12T21:11:49.416794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median11
Q316
95-th percentile20
Maximum21
Range20
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.2048368
Coefficient of variation (CV)0.56407607
Kurtosis-1.2
Mean11
Median Absolute Deviation (MAD)5
Skewness0
Sum231
Variance38.5
MonotonicityStrictly increasing
2023-12-12T21:11:49.566007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 1
 
4.8%
2 1
 
4.8%
21 1
 
4.8%
20 1
 
4.8%
19 1
 
4.8%
18 1
 
4.8%
17 1
 
4.8%
16 1
 
4.8%
15 1
 
4.8%
14 1
 
4.8%
Other values (11) 11
52.4%
ValueCountFrequency (%)
1 1
4.8%
2 1
4.8%
3 1
4.8%
4 1
4.8%
5 1
4.8%
6 1
4.8%
7 1
4.8%
8 1
4.8%
9 1
4.8%
10 1
4.8%
ValueCountFrequency (%)
21 1
4.8%
20 1
4.8%
19 1
4.8%
18 1
4.8%
17 1
4.8%
16 1
4.8%
15 1
4.8%
14 1
4.8%
13 1
4.8%
12 1
4.8%

구분
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T21:11:49.806698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.047619
Min length6

Characters and Unicode

Total characters127
Distinct characters32
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row궁항마을회관
2nd row목도마을회관
3rd row신지마을회관
4th row고서마을회관
5th row정금마을회관
ValueCountFrequency (%)
궁항마을회관 1
 
4.8%
대촌마을회관 1
 
4.8%
신정마을회관 1
 
4.8%
월운마을회관 1
 
4.8%
술상마을회관 1
 
4.8%
서근마을회관 1
 
4.8%
청도마을회관 1
 
4.8%
영천마을회관 1
 
4.8%
진정마을회관 1
 
4.8%
중평마을회관 1
 
4.8%
Other values (11) 11
52.4%
2023-12-12T21:11:50.173808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
16.5%
21
16.5%
21
16.5%
21
16.5%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (22) 26
20.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 127
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
16.5%
21
16.5%
21
16.5%
21
16.5%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (22) 26
20.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 127
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
16.5%
21
16.5%
21
16.5%
21
16.5%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (22) 26
20.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 127
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
16.5%
21
16.5%
21
16.5%
21
16.5%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (22) 26
20.5%

주소
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T21:11:50.401119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length20.619048
Min length18

Characters and Unicode

Total characters433
Distinct characters66
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row경상남도 하동군 하동읍 신기궁항길 152
2nd row경상남도 하동군 하동읍 목도1길 143
3rd row경상남도 하동군 하동읍 섬진강대로 2487
4th row경상남도 하동군 하동읍 고서길 70
5th row경상남도 하동군 화개면 쌍계로 367
ValueCountFrequency (%)
경상남도 21
20.0%
하동군 21
20.0%
화개면 4
 
3.8%
하동읍 4
 
3.8%
악양면 4
 
3.8%
금남면 3
 
2.9%
진교면 2
 
1.9%
쌍계로 2
 
1.9%
금성면 2
 
1.9%
7 2
 
1.9%
Other values (40) 40
38.1%
2023-12-12T21:11:50.771004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
19.4%
25
 
5.8%
25
 
5.8%
24
 
5.5%
24
 
5.5%
24
 
5.5%
21
 
4.8%
21
 
4.8%
17
 
3.9%
16
 
3.7%
Other values (56) 152
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 285
65.8%
Space Separator 84
 
19.4%
Decimal Number 60
 
13.9%
Dash Punctuation 4
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
8.8%
25
 
8.8%
24
 
8.4%
24
 
8.4%
24
 
8.4%
21
 
7.4%
21
 
7.4%
17
 
6.0%
16
 
5.6%
6
 
2.1%
Other values (44) 82
28.8%
Decimal Number
ValueCountFrequency (%)
1 12
20.0%
2 9
15.0%
3 9
15.0%
7 8
13.3%
4 6
10.0%
6 5
8.3%
8 5
8.3%
9 2
 
3.3%
0 2
 
3.3%
5 2
 
3.3%
Space Separator
ValueCountFrequency (%)
84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 285
65.8%
Common 148
34.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
8.8%
25
 
8.8%
24
 
8.4%
24
 
8.4%
24
 
8.4%
21
 
7.4%
21
 
7.4%
17
 
6.0%
16
 
5.6%
6
 
2.1%
Other values (44) 82
28.8%
Common
ValueCountFrequency (%)
84
56.8%
1 12
 
8.1%
2 9
 
6.1%
3 9
 
6.1%
7 8
 
5.4%
4 6
 
4.1%
6 5
 
3.4%
8 5
 
3.4%
- 4
 
2.7%
9 2
 
1.4%
Other values (2) 4
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 285
65.8%
ASCII 148
34.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
84
56.8%
1 12
 
8.1%
2 9
 
6.1%
3 9
 
6.1%
7 8
 
5.4%
4 6
 
4.1%
6 5
 
3.4%
8 5
 
3.4%
- 4
 
2.7%
9 2
 
1.4%
Other values (2) 4
 
2.7%
Hangul
ValueCountFrequency (%)
25
 
8.8%
25
 
8.8%
24
 
8.4%
24
 
8.4%
24
 
8.4%
21
 
7.4%
21
 
7.4%
17
 
6.0%
16
 
5.6%
6
 
2.1%
Other values (44) 82
28.8%

건물면적(제곱미터)
Real number (ℝ)

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean175.32762
Minimum70.5
Maximum447.43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2023-12-12T21:11:50.928041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum70.5
5-th percentile82.86
Q1100.58
median141.35
Q3200.7
95-th percentile447.09
Maximum447.43
Range376.93
Interquartile range (IQR)100.12

Descriptive statistics

Standard deviation105.92849
Coefficient of variation (CV)0.6041746
Kurtosis2.6287758
Mean175.32762
Median Absolute Deviation (MAD)43.33
Skewness1.6986192
Sum3681.88
Variance11220.846
MonotonicityNot monotonic
2023-12-12T21:11:51.083395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
132.24 1
 
4.8%
141.35 1
 
4.8%
152.82 1
 
4.8%
70.5 1
 
4.8%
98.04 1
 
4.8%
447.09 1
 
4.8%
174.46 1
 
4.8%
128.02 1
 
4.8%
264.4 1
 
4.8%
87.66 1
 
4.8%
Other values (11) 11
52.4%
ValueCountFrequency (%)
70.5 1
4.8%
82.86 1
4.8%
87.66 1
4.8%
98.04 1
4.8%
100.0 1
4.8%
100.58 1
4.8%
102.81 1
4.8%
106.29 1
4.8%
128.02 1
4.8%
132.24 1
4.8%
ValueCountFrequency (%)
447.43 1
4.8%
447.09 1
4.8%
264.4 1
4.8%
241.33 1
4.8%
231.11 1
4.8%
200.7 1
4.8%
187.51 1
4.8%
184.68 1
4.8%
174.46 1
4.8%
152.82 1
4.8%

건축연도
Real number (ℝ)

Distinct14
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1996.2857
Minimum1971
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2023-12-12T21:11:51.277882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1971
5-th percentile1975
Q11993
median1998
Q32002
95-th percentile2020
Maximum2022
Range51
Interquartile range (IQR)9

Descriptive statistics

Standard deviation13.715476
Coefficient of variation (CV)0.0068704976
Kurtosis-0.17316243
Mean1996.2857
Median Absolute Deviation (MAD)5
Skewness-0.20042591
Sum41922
Variance188.11429
MonotonicityNot monotonic
2023-12-12T21:11:51.442397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1998 5
23.8%
2002 3
14.3%
1993 2
 
9.5%
1981 1
 
4.8%
1997 1
 
4.8%
1976 1
 
4.8%
2005 1
 
4.8%
1971 1
 
4.8%
2022 1
 
4.8%
2006 1
 
4.8%
Other values (4) 4
19.0%
ValueCountFrequency (%)
1971 1
 
4.8%
1975 1
 
4.8%
1976 1
 
4.8%
1978 1
 
4.8%
1981 1
 
4.8%
1993 2
 
9.5%
1997 1
 
4.8%
1998 5
23.8%
2002 3
14.3%
2005 1
 
4.8%
ValueCountFrequency (%)
2022 1
 
4.8%
2020 1
 
4.8%
2009 1
 
4.8%
2006 1
 
4.8%
2005 1
 
4.8%
2002 3
14.3%
1998 5
23.8%
1997 1
 
4.8%
1993 2
 
9.5%
1981 1
 
4.8%

전화번호
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Memory size300.0 B
055-880-6042
055-880-6077
055-880-6107
055-880-6227
055-880-6257
Other values (3)

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique2 ?
Unique (%)9.5%

Sample

1st row055-880-6042
2nd row055-880-6042
3rd row055-880-6042
4th row055-880-6042
5th row055-880-6077

Common Values

ValueCountFrequency (%)
055-880-6042 4
19.0%
055-880-6077 4
19.0%
055-880-6107 4
19.0%
055-880-6227 3
14.3%
055-880-6257 2
9.5%
055-880-6292 2
9.5%
055-880-6327 1
 
4.8%
055-880-6357 1
 
4.8%

Length

2023-12-12T21:11:51.585324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:11:51.736040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
055-880-6042 4
19.0%
055-880-6077 4
19.0%
055-880-6107 4
19.0%
055-880-6227 3
14.3%
055-880-6257 2
9.5%
055-880-6292 2
9.5%
055-880-6327 1
 
4.8%
055-880-6357 1
 
4.8%

Interactions

2023-12-12T21:11:48.669694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.014680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.338472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.761522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.118071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.466246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.854958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.226559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:11:48.556010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:11:51.844609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주소건물면적(제곱미터)건축연도전화번호
연번1.0001.0001.0000.5490.2390.830
구분1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
건물면적(제곱미터)0.5491.0001.0001.0000.7310.374
건축연도0.2391.0001.0000.7311.0000.483
전화번호0.8301.0001.0000.3740.4831.000
2023-12-12T21:11:51.960994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번건물면적(제곱미터)건축연도전화번호
연번1.0000.0560.2580.673
건물면적(제곱미터)0.0561.0000.1910.135
건축연도0.2580.1911.0000.221
전화번호0.6730.1350.2211.000

Missing values

2023-12-12T21:11:48.955062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:11:49.321710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분주소건물면적(제곱미터)건축연도전화번호
01궁항마을회관경상남도 하동군 하동읍 신기궁항길 152132.241981055-880-6042
12목도마을회관경상남도 하동군 하동읍 목도1길 143141.351998055-880-6042
23신지마을회관경상남도 하동군 하동읍 섬진강대로 2487106.291997055-880-6042
34고서마을회관경상남도 하동군 하동읍 고서길 70102.811976055-880-6042
45정금마을회관경상남도 하동군 화개면 쌍계로 367200.72005055-880-6077
56신촌마을회관경상남도 하동군 화개면 쌍계로 448-282.862002055-880-6077
67석문마을회관경상남도 하동군 화개면 차시배지길 7100.581971055-880-6077
78단천마을회관경상남도 하동군 화개면 단천길 139231.112022055-880-6077
89대축마을회관경상남도 하동군 악양면 대축길 26184.682006055-880-6107
910상중대마을회관경상남도 하동군 악양면 상중대1길 16100.01993055-880-6107
연번구분주소건물면적(제곱미터)건축연도전화번호
1112대촌마을회관경상남도 하동군 악양면 대촌길 7241.331975055-880-6107
1213중평마을회관경상남도 하동군 금남면 중평해안길 82447.431998055-880-6227
1314진정마을회관경상남도 하동군 금남면 진정1길 2887.662002055-880-6227
1415영천마을회관경상남도 하동군 금남면 영천안길 43264.41978055-880-6227
1516청도마을회관경상남도 하동군 금성면 내도청도길 146128.021998055-880-6257
1617서근마을회관경상남도 하동군 금성면 금성로 386174.461998055-880-6257
1718술상마을회관경상남도 하동군 진교면 술상길 57-3447.092009055-880-6292
1819월운마을회관경상남도 하동군 진교면 월운길 213-198.042002055-880-6292
1920신정마을회관경상남도 하동군 양보면 양보로 27070.51993055-880-6327
2021상촌마을회관경상남도 하동군 북천면 상촌길 137152.822020055-880-6357