Overview

Dataset statistics

Number of variables13
Number of observations51
Missing cells3
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory108.6 B

Variable types

Numeric2
Text6
DateTime1
Categorical4

Dataset

Description부산광역시 사상구 다중이용시설 냉온수기 및 정수기 설치현황 (관리번호, 업종명,업소명,사업장소재지,소재지전화)
Author부산광역시 사상구
URLhttps://www.data.go.kr/data/15025678/fileData.do

Alerts

상호명 is highly overall correlated with 설치구분 and 1 other fieldsHigh correlation
다중이용시설구분 is highly overall correlated with 설치구분High correlation
설치구분 is highly overall correlated with 상호명 and 1 other fieldsHigh correlation
휴_폐업구분 is highly overall correlated with 상호명High correlation
상호명 is highly imbalanced (55.9%)Imbalance
설치구분 is highly imbalanced (76.1%)Imbalance
휴_폐업구분 is highly imbalanced (86.1%)Imbalance
대표자전화번호 has 3 (5.9%) missing valuesMissing
연번 has unique valuesUnique
설치관리번호 has unique valuesUnique
신고번호 has unique valuesUnique
관리주체 has unique valuesUnique
대표자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:31:05.435638
Analysis finished2023-12-12 03:31:07.372927
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T12:31:07.490096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.5
Q113.5
median26
Q338.5
95-th percentile48.5
Maximum51
Range50
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.866069
Coefficient of variation (CV)0.57177187
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum1326
Variance221
MonotonicityStrictly increasing
2023-12-12T12:31:07.703381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
2 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
51 1
2.0%
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%

설치관리번호
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T12:31:07.975603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length21
Min length21

Characters and Unicode

Total characters1071
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row3390000-33-2014-00001
2nd row3390000-33-2014-00002
3rd row3390000-33-2014-00003
4th row3390000-33-2014-00004
5th row3390000-33-2014-00005
ValueCountFrequency (%)
3390000-33-2014-00001 1
 
2.0%
3390000-33-2014-00027 1
 
2.0%
3390000-33-2014-00029 1
 
2.0%
3390000-33-2014-00030 1
 
2.0%
3390000-33-2014-00031 1
 
2.0%
3390000-33-2014-00032 1
 
2.0%
3390000-33-2014-00033 1
 
2.0%
3390000-33-2014-00034 1
 
2.0%
3390000-33-2014-00035 1
 
2.0%
3390000-33-2014-00036 1
 
2.0%
Other values (41) 41
80.4%
2023-12-12T12:31:08.383463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 429
40.1%
3 219
20.4%
- 153
 
14.3%
2 70
 
6.5%
1 70
 
6.5%
9 56
 
5.2%
4 54
 
5.0%
7 6
 
0.6%
5 5
 
0.5%
6 5
 
0.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 918
85.7%
Dash Punctuation 153
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 429
46.7%
3 219
23.9%
2 70
 
7.6%
1 70
 
7.6%
9 56
 
6.1%
4 54
 
5.9%
7 6
 
0.7%
5 5
 
0.5%
6 5
 
0.5%
8 4
 
0.4%
Dash Punctuation
ValueCountFrequency (%)
- 153
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1071
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 429
40.1%
3 219
20.4%
- 153
 
14.3%
2 70
 
6.5%
1 70
 
6.5%
9 56
 
5.2%
4 54
 
5.0%
7 6
 
0.6%
5 5
 
0.5%
6 5
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1071
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 429
40.1%
3 219
20.4%
- 153
 
14.3%
2 70
 
6.5%
1 70
 
6.5%
9 56
 
5.2%
4 54
 
5.0%
7 6
 
0.6%
5 5
 
0.5%
6 5
 
0.5%

신고번호
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T12:31:08.680016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.5882353
Min length2

Characters and Unicode

Total characters387
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row'2014-1
2nd row'2014-2
3rd row'2014-3
4th row'2014-5
5th row'2014-6
ValueCountFrequency (%)
2014-1 1
 
2.0%
2014-29 1
 
2.0%
2014-31 1
 
2.0%
2014-32 1
 
2.0%
2014-33 1
 
2.0%
2014-34 1
 
2.0%
2014-35 1
 
2.0%
2014-36 1
 
2.0%
2014-37 1
 
2.0%
2014-28 1
 
2.0%
Other values (41) 41
80.4%
2023-12-12T12:31:09.129927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 69
17.8%
1 68
17.6%
0 55
14.2%
4 54
14.0%
' 51
13.2%
- 50
12.9%
3 15
 
3.9%
7 6
 
1.6%
6 5
 
1.3%
9 5
 
1.3%
Other values (3) 9
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 285
73.6%
Other Punctuation 51
 
13.2%
Dash Punctuation 50
 
12.9%
Space Separator 1
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 69
24.2%
1 68
23.9%
0 55
19.3%
4 54
18.9%
3 15
 
5.3%
7 6
 
2.1%
6 5
 
1.8%
9 5
 
1.8%
5 4
 
1.4%
8 4
 
1.4%
Other Punctuation
ValueCountFrequency (%)
' 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 387
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 69
17.8%
1 68
17.6%
0 55
14.2%
4 54
14.0%
' 51
13.2%
- 50
12.9%
3 15
 
3.9%
7 6
 
1.6%
6 5
 
1.3%
9 5
 
1.3%
Other values (3) 9
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 387
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 69
17.8%
1 68
17.6%
0 55
14.2%
4 54
14.0%
' 51
13.2%
- 50
12.9%
3 15
 
3.9%
7 6
 
1.6%
6 5
 
1.3%
9 5
 
1.3%
Other values (3) 9
 
2.3%
Distinct28
Distinct (%)54.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
Minimum2014-02-12 00:00:00
Maximum2022-06-24 00:00:00
2023-12-12T12:31:09.348766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:09.547913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)

상호명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)25.5%
Missing0
Missing (%)0.0%
Memory size540.0 B
39 
홈플러스(주)서부산점
 
1
(주)이마트 사상점
 
1
(주)이마트서부산점
 
1
좋은삼선병원
 
1
Other values (8)

Length

Max length16
Median length1
Mean length2.9607843
Min length1

Unique

Unique12 ?
Unique (%)23.5%

Sample

1st row홈플러스(주)서부산점
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
39
76.5%
홈플러스(주)서부산점 1
 
2.0%
(주)이마트 사상점 1
 
2.0%
(주)이마트서부산점 1
 
2.0%
좋은삼선병원 1
 
2.0%
스파캐슬 1
 
2.0%
삼인요양병원 1
 
2.0%
서원의료재단 감로수요양병원 1
 
2.0%
희경의료재단 한국요양병원 1
 
2.0%
좋은주례요양병원 1
 
2.0%
Other values (3) 3
 
5.9%

Length

2023-12-12T12:31:09.732129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
홈플러스(주)서부산점 1
 
6.2%
주)이마트 1
 
6.2%
사상점 1
 
6.2%
주)이마트서부산점 1
 
6.2%
좋은삼선병원 1
 
6.2%
스파캐슬 1
 
6.2%
삼인요양병원 1
 
6.2%
서원의료재단 1
 
6.2%
감로수요양병원 1
 
6.2%
희경의료재단 1
 
6.2%
Other values (6) 6
37.5%

다중이용시설구분
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)21.6%
Missing0
Missing (%)0.0%
Memory size540.0 B
의료기관
19 
노인의료복지시설
대규모점포
어린이집(민간)
인터넷컴퓨터게임시설
Other values (6)

Length

Max length12
Median length10
Mean length6.0196078
Min length3

Unique

Unique4 ?
Unique (%)7.8%

Sample

1st row대규모점포
2nd row어린이집(민간)
3rd row어린이집(국공립)
4th row대규모점포
5th row의료기관

Common Values

ValueCountFrequency (%)
의료기관 19
37.3%
노인의료복지시설 9
17.6%
대규모점포 6
 
11.8%
어린이집(민간) 6
 
11.8%
인터넷컴퓨터게임시설 3
 
5.9%
어린이집(국공립) 2
 
3.9%
목욕장 2
 
3.9%
영화상영관 1
 
2.0%
장례식장 1
 
2.0%
둘 이상의 용도 건축물 1
 
2.0%

Length

2023-12-12T12:31:09.897790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의료기관 19
35.2%
노인의료복지시설 9
16.7%
대규모점포 6
 
11.1%
어린이집(민간 6
 
11.1%
인터넷컴퓨터게임시설 3
 
5.6%
어린이집(국공립 2
 
3.7%
목욕장 2
 
3.7%
영화상영관 1
 
1.9%
장례식장 1
 
1.9%
1
 
1.9%
Other values (4) 4
 
7.4%

관리주체
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T12:31:10.167499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length8.9215686
Min length4

Characters and Unicode

Total characters455
Distinct characters143
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row홈플러스(주)서부산점
2nd row주례고운어린이집
3rd row근로복지공단어린이집
4th row(주)르네시떼
5th row부산보훈병원
ValueCountFrequency (%)
홈플러스(주)서부산점 1
 
1.6%
스파캐슬 1
 
1.6%
대남병원(강신택 1
 
1.6%
서부산노인건강센터 1
 
1.6%
좋은삼선병원장례식장 1
 
1.6%
정향행복한마을 1
 
1.6%
서부산센텀병원 1
 
1.6%
더락피씨카페(김성미 1
 
1.6%
다락방pc 1
 
1.6%
이미혜 1
 
1.6%
Other values (51) 51
83.6%
2023-12-12T12:31:10.714469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
5.9%
24
 
5.3%
( 17
 
3.7%
) 17
 
3.7%
16
 
3.5%
14
 
3.1%
13
 
2.9%
11
 
2.4%
10
 
2.2%
9
 
2.0%
Other values (133) 297
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 405
89.0%
Open Punctuation 17
 
3.7%
Close Punctuation 17
 
3.7%
Space Separator 10
 
2.2%
Uppercase Letter 4
 
0.9%
Decimal Number 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
6.7%
24
 
5.9%
16
 
4.0%
14
 
3.5%
13
 
3.2%
11
 
2.7%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
Other values (126) 264
65.2%
Uppercase Letter
ValueCountFrequency (%)
P 2
50.0%
C 2
50.0%
Decimal Number
ValueCountFrequency (%)
7 1
50.0%
2 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 405
89.0%
Common 46
 
10.1%
Latin 4
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
6.7%
24
 
5.9%
16
 
4.0%
14
 
3.5%
13
 
3.2%
11
 
2.7%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
Other values (126) 264
65.2%
Common
ValueCountFrequency (%)
( 17
37.0%
) 17
37.0%
10
21.7%
7 1
 
2.2%
2 1
 
2.2%
Latin
ValueCountFrequency (%)
P 2
50.0%
C 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 405
89.0%
ASCII 50
 
11.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
6.7%
24
 
5.9%
16
 
4.0%
14
 
3.5%
13
 
3.2%
11
 
2.7%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
Other values (126) 264
65.2%
ASCII
ValueCountFrequency (%)
( 17
34.0%
) 17
34.0%
10
20.0%
P 2
 
4.0%
C 2
 
4.0%
7 1
 
2.0%
2 1
 
2.0%

설치구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
정수기
49 
냉온수기
 
2

Length

Max length4
Median length3
Mean length3.0392157
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정수기
2nd row정수기
3rd row정수기
4th row정수기
5th row정수기

Common Values

ValueCountFrequency (%)
정수기 49
96.1%
냉온수기 2
 
3.9%

Length

2023-12-12T12:31:10.929592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:31:11.078673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정수기 49
96.1%
냉온수기 2
 
3.9%

총 설치대수
Real number (ℝ)

Distinct21
Distinct (%)41.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.1568627
Minimum1
Maximum61
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T12:31:11.218444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q312.5
95-th percentile30.5
Maximum61
Range60
Interquartile range (IQR)10.5

Descriptive statistics

Standard deviation11.918679
Coefficient of variation (CV)1.3016116
Kurtosis7.3204534
Mean9.1568627
Median Absolute Deviation (MAD)3
Skewness2.5114653
Sum467
Variance142.0549
MonotonicityNot monotonic
2023-12-12T12:31:11.402662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 11
21.6%
4 6
11.8%
3 5
9.8%
5 5
9.8%
2 4
 
7.8%
13 3
 
5.9%
6 2
 
3.9%
9 2
 
3.9%
10 1
 
2.0%
12 1
 
2.0%
Other values (11) 11
21.6%
ValueCountFrequency (%)
1 11
21.6%
2 4
 
7.8%
3 5
9.8%
4 6
11.8%
5 5
9.8%
6 2
 
3.9%
8 1
 
2.0%
9 2
 
3.9%
10 1
 
2.0%
12 1
 
2.0%
ValueCountFrequency (%)
61 1
2.0%
44 1
2.0%
33 1
2.0%
28 1
2.0%
26 1
2.0%
25 1
2.0%
23 1
2.0%
16 1
2.0%
15 1
2.0%
14 1
2.0%

대표자
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T12:31:11.696496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length11.333333
Min length3

Characters and Unicode

Total characters578
Distinct characters159
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row홈플러스(주) 이제훈
2nd row주례고운어린이집
3rd row부산근로복지공단어린이집
4th row(주)르네시떼 전태섭
5th row부산보훈병원
ValueCountFrequency (%)
의료법인 2
 
2.9%
주현의료재단 2
 
2.9%
홈플러스(주 1
 
1.4%
김성순 1
 
1.4%
서부산노인건강센터 1
 
1.4%
좋은삼선병원장례식장 1
 
1.4%
정향행복한마을 1
 
1.4%
의료법인센텀의료재단서부산센텀병원 1
 
1.4%
김성미(더락피씨카페 1
 
1.4%
다락방pc 1
 
1.4%
Other values (57) 57
82.6%
2023-12-12T12:31:12.560323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
4.8%
22
 
3.8%
20
 
3.5%
19
 
3.3%
( 18
 
3.1%
) 18
 
3.1%
18
 
3.1%
16
 
2.8%
14
 
2.4%
13
 
2.2%
Other values (149) 392
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 517
89.4%
Open Punctuation 18
 
3.1%
Close Punctuation 18
 
3.1%
Space Separator 18
 
3.1%
Uppercase Letter 4
 
0.7%
Decimal Number 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
5.4%
22
 
4.3%
20
 
3.9%
19
 
3.7%
16
 
3.1%
14
 
2.7%
13
 
2.5%
13
 
2.5%
12
 
2.3%
11
 
2.1%
Other values (141) 349
67.5%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
7 1
33.3%
2 1
33.3%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
P 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 517
89.4%
Common 57
 
9.9%
Latin 4
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
5.4%
22
 
4.3%
20
 
3.9%
19
 
3.7%
16
 
3.1%
14
 
2.7%
13
 
2.5%
13
 
2.5%
12
 
2.3%
11
 
2.1%
Other values (141) 349
67.5%
Common
ValueCountFrequency (%)
( 18
31.6%
) 18
31.6%
18
31.6%
1 1
 
1.8%
7 1
 
1.8%
2 1
 
1.8%
Latin
ValueCountFrequency (%)
C 2
50.0%
P 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 517
89.4%
ASCII 61
 
10.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
5.4%
22
 
4.3%
20
 
3.9%
19
 
3.7%
16
 
3.1%
14
 
2.7%
13
 
2.5%
13
 
2.5%
12
 
2.3%
11
 
2.1%
Other values (141) 349
67.5%
ASCII
ValueCountFrequency (%)
( 18
29.5%
) 18
29.5%
18
29.5%
C 2
 
3.3%
P 2
 
3.3%
1 1
 
1.6%
7 1
 
1.6%
2 1
 
1.6%

대표자전화번호
Text

MISSING 

Distinct48
Distinct (%)100.0%
Missing3
Missing (%)5.9%
Memory size540.0 B
2023-12-12T12:31:12.904800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters576
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row051-319-9168
2nd row051-316-2178
3rd row051-311-7204
4th row051-319-8943
5th row051-601-6092
ValueCountFrequency (%)
051-302-6267 1
 
2.1%
051-305-9840 1
 
2.1%
051-303-5100 1
 
2.1%
051-325-9998 1
 
2.1%
051-310-9292 1
 
2.1%
051-303-1258 1
 
2.1%
051-329-3281 1
 
2.1%
070-8233-4477 1
 
2.1%
051-316-1512 1
 
2.1%
051-304-7001 1
 
2.1%
Other values (38) 38
79.2%
2023-12-12T12:31:13.414622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 113
19.6%
1 94
16.3%
- 94
16.3%
5 73
12.7%
3 62
10.8%
2 44
 
7.6%
9 26
 
4.5%
7 22
 
3.8%
6 16
 
2.8%
8 16
 
2.8%
Other values (2) 16
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 481
83.5%
Dash Punctuation 94
 
16.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 113
23.5%
1 94
19.5%
5 73
15.2%
3 62
12.9%
2 44
 
9.1%
9 26
 
5.4%
7 22
 
4.6%
6 16
 
3.3%
8 16
 
3.3%
4 15
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 576
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 113
19.6%
1 94
16.3%
- 94
16.3%
5 73
12.7%
3 62
10.8%
2 44
 
7.6%
9 26
 
4.5%
7 22
 
3.8%
6 16
 
2.8%
8 16
 
2.8%
Other values (2) 16
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 576
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 113
19.6%
1 94
16.3%
- 94
16.3%
5 73
12.7%
3 62
10.8%
2 44
 
7.6%
9 26
 
4.5%
7 22
 
3.8%
6 16
 
2.8%
8 16
 
2.8%
Other values (2) 16
 
2.8%
Distinct47
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T12:31:13.732196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length34
Mean length25.196078
Min length21

Characters and Unicode

Total characters1285
Distinct characters68
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)84.3%

Sample

1st row부산광역시 사상구 광장로 7 (괘법동 홈플러스(주)서부산점)
2nd row부산광역시 사상구 동주로 2-11 (주례동)
3rd row부산광역시 사상구 사상로 255 (괘법동)
4th row부산광역시 사상구 광장로 7 (괘법동)
5th row부산광역시 사상구 백양대로 420 (주례동)
ValueCountFrequency (%)
부산광역시 51
19.7%
사상구 51
19.7%
괘법동 13
 
5.0%
주례동 11
 
4.2%
학장동 10
 
3.9%
사상로 7
 
2.7%
대동로 6
 
2.3%
가야대로 5
 
1.9%
학감대로39번길 4
 
1.5%
엄궁동 4
 
1.5%
Other values (73) 97
37.5%
2023-12-12T12:31:14.219709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
211
 
16.4%
61
 
4.7%
61
 
4.7%
60
 
4.7%
55
 
4.3%
53
 
4.1%
( 52
 
4.0%
) 52
 
4.0%
52
 
4.0%
51
 
4.0%
Other values (58) 577
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 787
61.2%
Space Separator 211
 
16.4%
Decimal Number 175
 
13.6%
Open Punctuation 52
 
4.0%
Close Punctuation 52
 
4.0%
Dash Punctuation 8
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
7.8%
61
 
7.8%
60
 
7.6%
55
 
7.0%
53
 
6.7%
52
 
6.6%
51
 
6.5%
51
 
6.5%
51
 
6.5%
51
 
6.5%
Other values (44) 241
30.6%
Decimal Number
ValueCountFrequency (%)
2 31
17.7%
1 30
17.1%
3 23
13.1%
7 16
9.1%
5 15
8.6%
9 15
8.6%
6 14
8.0%
8 12
 
6.9%
0 11
 
6.3%
4 8
 
4.6%
Space Separator
ValueCountFrequency (%)
211
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 787
61.2%
Common 498
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
7.8%
61
 
7.8%
60
 
7.6%
55
 
7.0%
53
 
6.7%
52
 
6.6%
51
 
6.5%
51
 
6.5%
51
 
6.5%
51
 
6.5%
Other values (44) 241
30.6%
Common
ValueCountFrequency (%)
211
42.4%
( 52
 
10.4%
) 52
 
10.4%
2 31
 
6.2%
1 30
 
6.0%
3 23
 
4.6%
7 16
 
3.2%
5 15
 
3.0%
9 15
 
3.0%
6 14
 
2.8%
Other values (4) 39
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 787
61.2%
ASCII 498
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
211
42.4%
( 52
 
10.4%
) 52
 
10.4%
2 31
 
6.2%
1 30
 
6.0%
3 23
 
4.6%
7 16
 
3.2%
5 15
 
3.0%
9 15
 
3.0%
6 14
 
2.8%
Other values (4) 39
 
7.8%
Hangul
ValueCountFrequency (%)
61
 
7.8%
61
 
7.8%
60
 
7.6%
55
 
7.0%
53
 
6.7%
52
 
6.6%
51
 
6.5%
51
 
6.5%
51
 
6.5%
51
 
6.5%
Other values (44) 241
30.6%

휴_폐업구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
정상
50 
폐업
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row정상
2nd row정상
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 50
98.0%
폐업 1
 
2.0%

Length

2023-12-12T12:31:14.393575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:31:14.505554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 50
98.0%
폐업 1
 
2.0%

Interactions

2023-12-12T12:31:06.626474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:06.375077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:06.735784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:31:06.506701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:31:14.606013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설치관리번호신고번호신고일자상호명다중이용시설구분관리주체설치구분총 설치대수대표자대표자전화번호사업장소재지(도로명)휴_폐업구분
연번1.0001.0001.0000.9560.3390.2151.0000.6340.3041.0001.0000.7440.125
설치관리번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
신고번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
신고일자0.9561.0001.0001.0000.9440.5501.0001.0000.2451.0001.0000.9480.000
상호명0.3391.0001.0000.9441.0000.4871.0001.0000.7441.0001.0000.7781.000
다중이용시설구분0.2151.0001.0000.5500.4871.0001.0000.6410.0001.0001.0000.0000.000
관리주체1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
설치구분0.6341.0001.0001.0001.0000.6411.0001.0000.0001.0001.0001.0000.000
총 설치대수0.3041.0001.0000.2450.7440.0001.0000.0001.0001.0001.0000.6180.000
대표자1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
대표자전화번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
사업장소재지(도로명)0.7441.0001.0000.9480.7780.0001.0001.0000.6181.0001.0001.0001.000
휴_폐업구분0.1251.0001.0000.0001.0000.0001.0000.0000.0001.0001.0001.0001.000
2023-12-12T12:31:14.771246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치구분상호명휴_폐업구분다중이용시설구분
설치구분1.0000.8810.0000.560
상호명0.8811.0000.8810.199
휴_폐업구분0.0000.8811.0000.000
다중이용시설구분0.5600.1990.0001.000
2023-12-12T12:31:14.890106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번총 설치대수상호명다중이용시설구분설치구분휴_폐업구분
연번1.0000.0510.0990.0860.4470.064
총 설치대수0.0511.0000.2900.0000.0000.000
상호명0.0990.2901.0000.1990.8810.881
다중이용시설구분0.0860.0000.1991.0000.5600.000
설치구분0.4470.0000.8810.5601.0000.000
휴_폐업구분0.0640.0000.8810.0000.0001.000

Missing values

2023-12-12T12:31:06.925181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:31:07.264253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번설치관리번호신고번호신고일자상호명다중이용시설구분관리주체설치구분총 설치대수대표자대표자전화번호사업장소재지(도로명)휴_폐업구분
013390000-33-2014-00001'2014-12014-02-12홈플러스(주)서부산점대규모점포홈플러스(주)서부산점정수기4홈플러스(주) 이제훈051-319-9168부산광역시 사상구 광장로 7 (괘법동 홈플러스(주)서부산점)정상
123390000-33-2014-00002'2014-22014-02-28어린이집(민간)주례고운어린이집정수기1주례고운어린이집051-316-2178부산광역시 사상구 동주로 2-11 (주례동)정상
233390000-33-2014-00003'2014-32014-03-03어린이집(국공립)근로복지공단어린이집정수기3부산근로복지공단어린이집051-311-7204부산광역시 사상구 사상로 255 (괘법동)정상
343390000-33-2014-00004'2014-52014-03-03대규모점포(주)르네시떼정수기33(주)르네시떼 전태섭051-319-8943부산광역시 사상구 광장로 7 (괘법동)정상
453390000-33-2014-00005'2014-62014-03-03의료기관부산보훈병원정수기61부산보훈병원051-601-6092부산광역시 사상구 백양대로 420 (주례동)정상
563390000-33-2014-00006'2014-42014-03-03목욕장사상해수온천정수기5사상해수온천051-311-2363부산광역시 사상구 사상로223번길 55 (괘법동)정상
673390000-33-2014-00007'2014-72014-03-04어린이집(민간)양지어린이집정수기1양지어린이집051-325-0674부산광역시 사상구 양지로30번길 88-1 (주례동)정상
783390000-33-2014-00008'2014-82014-03-05영화상영관롯데시네마 사상7정수기1롯데쇼핑(주)롯데시네마사상7051-312-6400부산광역시 사상구 사상로 201 (괘법동)정상
893390000-33-2014-00009'2014-92014-03-06의료기관부산시립정신병원정수기13부산시립정신병원051-312-2288부산광역시 사상구 학감대로39번길 104-36 (학장동)정상
9103390000-33-2014-00010'2014-102014-03-06(주)이마트 사상점대규모점포(주)이마트 사상점정수기16(주)이마트사상점051-329-1004부산광역시 사상구 광장로 17 이마트 (괘법동)정상
연번설치관리번호신고번호신고일자상호명다중이용시설구분관리주체설치구분총 설치대수대표자대표자전화번호사업장소재지(도로명)휴_폐업구분
41423390000-33-2014-00042'2014-432014-04-02노인의료복지시설윤금노인요양원(이영환)정수기4사회복지법인그리스도구원선051-312-0675부산광역시 사상구 가야대로 187 (주례동)정상
42433390000-33-2014-00043'2014-442014-04-16어린이집(민간)하늘나리어린이집(김은숙)정수기1하늘나리어린이집(김은숙)051-332-1519부산광역시 사상구 백양대로934번길 52-29 (모라동)정상
43443390000-33-2014-00044'2014-222014-06-27대규모점포(주)롯데마트사상점정수기28롯데쇼핑(주)롯데마트 사상점 신헌051-329-2500부산광역시 사상구 낙동대로 733 (엄궁동)정상
44453390000-33-2015-00001'2015-10-23삼인요양병원의료기관삼인요양병원정수기9송수진외 1051-327-3333부산광역시 사상구 대동로 95 (학장동)정상
45463390000-33-2017-00001'2017-12017-08-22서원의료재단 감로수요양병원의료기관서원의료재단 감로수요양병원정수기8서원의료재단 감로수요양병원(문영주)051-315-0030부산광역시 사상구 덕상로 116-38 (모라동)정상
46473390000-33-2017-00002'2017-22017-09-04희경의료재단 한국요양병원의료기관한국요양병원정수기5한국요양병원(최순희)051-328-8251부산광역시 사상구 광장로 33 (괘법동)정상
47483390000-33-2019-00001'2019-12019-12-19좋은주례요양병원의료기관좋은주례요양병원정수기5좋은주례요양병원051-325-0300부산광역시 사상구 가야대로 264 (주례동)정상
48493390000-33-2022-00001'2022-12022-06-24한국건강관리협회의료기관예담솔루텍정수기23김인원'0516019700부산광역시 사상구 학감대로 230 (감전동)정상
49503390000-33-2016-00001'2016-12016-11-22시티요양병원노인요양시설시티요양병원냉온수기5강수원051-317-0080부산광역시 사상구 백양대로907번길 11 (모라동)정상
50513390000-33-2020-00001'2020-12020-05-08의료법인영재의료재단 큰솔2병원의료기관의료법인영재의료재단 큰솔2병원냉온수기12배영일(큰솔2병원)051-322-0050부산광역시 사상구 학장로 189 (학장동)정상