Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory44.9 B

Variable types

Numeric1
Text2
Categorical1
DateTime1

Dataset

Description예천군 관내 건축물 중 석면조사대상 건축물 현황 자료입니다.(번호, 건물명, 주소, 건축물구분, 데이터기준일자 등)
URLhttps://www.data.go.kr/data/3047310/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
구분 is highly imbalanced (71.5%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:50:25.920562
Analysis finished2023-12-12 02:50:26.372993
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.5
Minimum1
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-12T11:50:26.445819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.65
Q19.25
median17.5
Q325.75
95-th percentile32.35
Maximum34
Range33
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation9.9582462
Coefficient of variation (CV)0.56904264
Kurtosis-1.2
Mean17.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum595
Variance99.166667
MonotonicityStrictly increasing
2023-12-12T11:50:26.590278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 1
 
2.9%
27 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
28 1
 
2.9%
19 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%
25 1
2.9%
Distinct21
Distinct (%)61.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T11:50:26.778140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length10
Mean length7.3529412
Min length4

Characters and Unicode

Total characters250
Distinct characters74
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)41.2%

Sample

1st row성락어린이집
2nd row예천군산림조합
3rd row예천공공하수처리시설
4th row예천공설운동장
5th row예천문화회관
ValueCountFrequency (%)
예천농협 7
17.1%
예천지보농협 4
 
9.8%
예천군농촌지도소 3
 
7.3%
예천공공하수처리시설 2
 
4.9%
한국전력 2
 
4.9%
예천지사 2
 
4.9%
남예천농협 2
 
4.9%
예천군산림조합 2
 
4.9%
본점 1
 
2.4%
성락어린이집 1
 
2.4%
Other values (15) 15
36.6%
2023-12-12T11:50:27.088992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
12.8%
32
 
12.8%
21
 
8.4%
17
 
6.8%
13
 
5.2%
8
 
3.2%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (64) 106
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 240
96.0%
Space Separator 8
 
3.2%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
13.3%
32
 
13.3%
21
 
8.8%
17
 
7.1%
13
 
5.4%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (61) 100
41.7%
Space Separator
ValueCountFrequency (%)
8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 240
96.0%
Common 10
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
13.3%
32
 
13.3%
21
 
8.8%
17
 
7.1%
13
 
5.4%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (61) 100
41.7%
Common
ValueCountFrequency (%)
8
80.0%
) 1
 
10.0%
( 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 240
96.0%
ASCII 10
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
13.3%
32
 
13.3%
21
 
8.8%
17
 
7.1%
13
 
5.4%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (61) 100
41.7%
ASCII
ValueCountFrequency (%)
8
80.0%
) 1
 
10.0%
( 1
 
10.0%

주소
Text

Distinct28
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T11:50:27.342884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length20
Mean length20.5
Min length19

Characters and Unicode

Total characters697
Distinct characters64
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)67.6%

Sample

1st row경상북도 예천군 예천읍 효자로 196
2nd row경상북도 예천군 예천읍 군청앞길 13
3rd row경상북도 예천군 예천읍 상동길 49-50
4th row경상북도 예천군 예천읍 충효로 395
5th row경상북도 예천군 예천읍 서본리 240 예천군문화회관, 36827
ValueCountFrequency (%)
경상북도 34
19.8%
예천군 34
19.8%
예천읍 20
 
11.6%
충효로 5
 
2.9%
효자로 4
 
2.3%
지보면 4
 
2.3%
433 3
 
1.7%
49-50 2
 
1.2%
시장로 2
 
1.2%
용궁면 2
 
1.2%
Other values (52) 62
36.0%
2023-12-12T11:50:27.981141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
138
19.8%
56
 
8.0%
56
 
8.0%
38
 
5.5%
36
 
5.2%
35
 
5.0%
34
 
4.9%
34
 
4.9%
1 23
 
3.3%
20
 
2.9%
Other values (54) 227
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 456
65.4%
Space Separator 138
 
19.8%
Decimal Number 99
 
14.2%
Dash Punctuation 3
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
12.3%
56
12.3%
38
 
8.3%
36
 
7.9%
35
 
7.7%
34
 
7.5%
34
 
7.5%
20
 
4.4%
20
 
4.4%
14
 
3.1%
Other values (41) 113
24.8%
Decimal Number
ValueCountFrequency (%)
1 23
23.2%
3 12
12.1%
4 12
12.1%
9 8
 
8.1%
0 8
 
8.1%
6 8
 
8.1%
5 8
 
8.1%
7 7
 
7.1%
2 7
 
7.1%
8 6
 
6.1%
Space Separator
ValueCountFrequency (%)
138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 456
65.4%
Common 241
34.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
12.3%
56
12.3%
38
 
8.3%
36
 
7.9%
35
 
7.7%
34
 
7.5%
34
 
7.5%
20
 
4.4%
20
 
4.4%
14
 
3.1%
Other values (41) 113
24.8%
Common
ValueCountFrequency (%)
138
57.3%
1 23
 
9.5%
3 12
 
5.0%
4 12
 
5.0%
9 8
 
3.3%
0 8
 
3.3%
6 8
 
3.3%
5 8
 
3.3%
7 7
 
2.9%
2 7
 
2.9%
Other values (3) 10
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 456
65.4%
ASCII 241
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
138
57.3%
1 23
 
9.5%
3 12
 
5.0%
4 12
 
5.0%
9 8
 
3.3%
0 8
 
3.3%
6 8
 
3.3%
5 8
 
3.3%
7 7
 
2.9%
2 7
 
2.9%
Other values (3) 10
 
4.1%
Hangul
ValueCountFrequency (%)
56
12.3%
56
12.3%
38
 
8.3%
36
 
7.9%
35
 
7.7%
34
 
7.5%
34
 
7.5%
20
 
4.4%
20
 
4.4%
14
 
3.1%
Other values (41) 113
24.8%

구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
공공건축물
31 
어린이집
 
1
불특정다수이용
 
1
대학교
 
1

Length

Max length7
Median length5
Mean length4.9705882
Min length3

Unique

Unique3 ?
Unique (%)8.8%

Sample

1st row어린이집
2nd row공공건축물
3rd row공공건축물
4th row공공건축물
5th row공공건축물

Common Values

ValueCountFrequency (%)
공공건축물 31
91.2%
어린이집 1
 
2.9%
불특정다수이용 1
 
2.9%
대학교 1
 
2.9%

Length

2023-12-12T11:50:28.118340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:50:28.231265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공건축물 31
91.2%
어린이집 1
 
2.9%
불특정다수이용 1
 
2.9%
대학교 1
 
2.9%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2023-08-11 00:00:00
Maximum2023-08-11 00:00:00
2023-12-12T11:50:28.319048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:50:28.413685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T11:50:26.129713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:50:28.473811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호건물명주소구분
번호1.0000.8940.8950.000
건물명0.8941.0000.9821.000
주소0.8950.9821.0001.000
구분0.0001.0001.0001.000
2023-12-12T11:50:28.561779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분
번호1.0000.000
구분0.0001.000

Missing values

2023-12-12T11:50:26.245110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:50:26.336864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호건물명주소구분데이터기준일자
01성락어린이집경상북도 예천군 예천읍 효자로 196어린이집2023-08-11
12예천군산림조합경상북도 예천군 예천읍 군청앞길 13공공건축물2023-08-11
23예천공공하수처리시설경상북도 예천군 예천읍 상동길 49-50공공건축물2023-08-11
34예천공설운동장경상북도 예천군 예천읍 충효로 395공공건축물2023-08-11
45예천문화회관경상북도 예천군 예천읍 서본리 240 예천군문화회관, 36827공공건축물2023-08-11
56예천군농촌지도소경상북도 예천군 예천읍 충효로 433공공건축물2023-08-11
67예천등기소경상북도 예천군 예천읍 충효로 81공공건축물2023-08-11
78한국전력 예천지사경상북도 예천군 예천읍 효자로 170공공건축물2023-08-11
89한국전력 예천지사경상북도 예천군 예천읍 효자로 170공공건축물2023-08-11
910영주전력지사 예천변전소경상북도 예천군 예천읍 이미기길 19공공건축물2023-08-11
번호건물명주소구분데이터기준일자
2425예천농협경상북도 예천군 용문면 상금시장길 9공공건축물2023-08-11
2526예천농협경상북도 예천군 예천읍 역전길 16공공건축물2023-08-11
2627예천농헙경상북도 예천군 예천읍 역전길 16공공건축물2023-08-11
2728예천농협 본점경상북도 예천군 예천읍 시장로 53공공건축물2023-08-11
2829예천공공하수처리시설경상북도 예천군 예천읍 상동길 49-50공공건축물2023-08-11
2930예천군농촌지도소경상북도 예천군 예천읍 충효로 433공공건축물2023-08-11
3031예천군농촌지도소경상북도 예천군 예천읍 충효로 433공공건축물2023-08-11
3132예천군산림조합경상북도 예천군 예천읍 양궁로 57공공건축물2023-08-11
3233예천권병원 부속병원경상북도 예천군 예천읍 효자로 67불특정다수이용2023-08-11
3334경북도립대학교경상북도 예천군 예천읍 도립대학길 114대학교2023-08-11