Overview

Dataset statistics

Number of variables12
Number of observations28
Missing cells9
Missing cells (%)2.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory100.7 B

Variable types

Unsupported1
Categorical3
Text8

Dataset

Description소방시설설계,공사,감리,방염,관리업현황201611
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202700

Alerts

Unnamed: 1 is highly overall correlated with Unnamed: 2 and 1 other fieldsHigh correlation
Unnamed: 10 is highly overall correlated with Unnamed: 1High correlation
Unnamed: 2 is highly overall correlated with Unnamed: 1High correlation
Unnamed: 1 is highly imbalanced (72.1%)Imbalance
Unnamed: 10 is highly imbalanced (56.1%)Imbalance
소방시설 설계업 등록 현황 has 1 (3.6%) missing valuesMissing
Unnamed: 3 has 1 (3.6%) missing valuesMissing
Unnamed: 4 has 1 (3.6%) missing valuesMissing
Unnamed: 5 has 1 (3.6%) missing valuesMissing
Unnamed: 6 has 1 (3.6%) missing valuesMissing
Unnamed: 7 has 1 (3.6%) missing valuesMissing
Unnamed: 8 has 1 (3.6%) missing valuesMissing
Unnamed: 9 has 1 (3.6%) missing valuesMissing
Unnamed: 11 has 1 (3.6%) missing valuesMissing
소방시설 설계업 등록 현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 01:22:20.375541
Analysis finished2024-03-14 01:22:21.117979
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소방시설 설계업 등록 현황
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)3.6%
Memory size356.0 B

Unnamed: 1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size356.0 B
전북
26 
<NA>
 
1
지역
 
1

Length

Max length4
Median length2
Mean length2.0714286
Min length2

Unique

Unique2 ?
Unique (%)7.1%

Sample

1st row<NA>
2nd row지역
3rd row전북
4th row전북
5th row전북

Common Values

ValueCountFrequency (%)
전북 26
92.9%
<NA> 1
 
3.6%
지역 1
 
3.6%

Length

2024-03-14T10:22:21.188648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:22:21.302185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북 26
92.9%
na 1
 
3.6%
지역 1
 
3.6%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size356.0 B
전주시
18 
익산시
군산시
<NA>
 
1
조회지역
 
1
Other values (3)

Length

Max length4
Median length3
Mean length3.0714286
Min length3

Unique

Unique5 ?
Unique (%)17.9%

Sample

1st row<NA>
2nd row조회지역
3rd row전주시
4th row전주시
5th row익산시

Common Values

ValueCountFrequency (%)
전주시 18
64.3%
익산시 3
 
10.7%
군산시 2
 
7.1%
<NA> 1
 
3.6%
조회지역 1
 
3.6%
정읍시 1
 
3.6%
부안군 1
 
3.6%
남원시 1
 
3.6%

Length

2024-03-14T10:22:21.420613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:22:21.542957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전주시 18
64.3%
익산시 3
 
10.7%
군산시 2
 
7.1%
na 1
 
3.6%
조회지역 1
 
3.6%
정읍시 1
 
3.6%
부안군 1
 
3.6%
남원시 1
 
3.6%

Unnamed: 3
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:21.704859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length8.0740741
Min length2

Characters and Unicode

Total characters218
Distinct characters72
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row상호
2nd row티앤제이건설(주)
3rd row(주)주연이엔지
4th row지오방재이엔씨(주)
5th row(주)덕진소방
ValueCountFrequency (%)
주식회사 2
 
6.9%
상호 1
 
3.4%
주)목양종합건축사사무소 1
 
3.4%
주)다우코퍼레이션 1
 
3.4%
주)호원엔지니어링 1
 
3.4%
유)유진엔지니어링 1
 
3.4%
대한엔지니어링 1
 
3.4%
주)대성건축사사무소 1
 
3.4%
광성엔지니어링 1
 
3.4%
주)부윤엔지니어링 1
 
3.4%
Other values (18) 18
62.1%
2024-03-14T10:22:21.963782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 19
 
8.7%
( 19
 
8.7%
16
 
7.3%
13
 
6.0%
12
 
5.5%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
Other values (62) 99
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 175
80.3%
Close Punctuation 19
 
8.7%
Open Punctuation 19
 
8.7%
Uppercase Letter 3
 
1.4%
Space Separator 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
9.1%
13
 
7.4%
12
 
6.9%
8
 
4.6%
8
 
4.6%
8
 
4.6%
8
 
4.6%
8
 
4.6%
7
 
4.0%
4
 
2.3%
Other values (56) 83
47.4%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
S 1
33.3%
C 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 175
80.3%
Common 40
 
18.3%
Latin 3
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
9.1%
13
 
7.4%
12
 
6.9%
8
 
4.6%
8
 
4.6%
8
 
4.6%
8
 
4.6%
8
 
4.6%
7
 
4.0%
4
 
2.3%
Other values (56) 83
47.4%
Common
ValueCountFrequency (%)
) 19
47.5%
( 19
47.5%
2
 
5.0%
Latin
ValueCountFrequency (%)
E 1
33.3%
S 1
33.3%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 175
80.3%
ASCII 43
 
19.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 19
44.2%
( 19
44.2%
2
 
4.7%
E 1
 
2.3%
S 1
 
2.3%
C 1
 
2.3%
Hangul
ValueCountFrequency (%)
16
 
9.1%
13
 
7.4%
12
 
6.9%
8
 
4.6%
8
 
4.6%
8
 
4.6%
8
 
4.6%
8
 
4.6%
7
 
4.0%
4
 
2.3%
Other values (56) 83
47.4%

Unnamed: 4
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:22.124259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.962963
Min length2

Characters and Unicode

Total characters80
Distinct characters51
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row대표자
2nd row고태유
3rd row김종배
4th row장영철
5th row박동규
ValueCountFrequency (%)
대표자 1
 
3.7%
김경옥 1
 
3.7%
박식 1
 
3.7%
신장근 1
 
3.7%
김영수 1
 
3.7%
김경미 1
 
3.7%
김창호 1
 
3.7%
정태종 1
 
3.7%
백종호 1
 
3.7%
김한규 1
 
3.7%
Other values (17) 17
63.0%
2024-03-14T10:22:22.424923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
8.8%
6
 
7.5%
3
 
3.8%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (41) 48
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 80
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
8.8%
6
 
7.5%
3
 
3.8%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (41) 48
60.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 80
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
8.8%
6
 
7.5%
3
 
3.8%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (41) 48
60.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 80
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
8.8%
6
 
7.5%
3
 
3.8%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (41) 48
60.0%

Unnamed: 5
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:22.599840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.962963
Min length4

Characters and Unicode

Total characters134
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row우편번호
2nd row55008
3rd row55062
4th row54663
5th row54923
ValueCountFrequency (%)
우편번호 1
 
3.7%
54890 1
 
3.7%
54877 1
 
3.7%
54846 1
 
3.7%
54962 1
 
3.7%
54954 1
 
3.7%
54621 1
 
3.7%
54941 1
 
3.7%
55123 1
 
3.7%
54977 1
 
3.7%
Other values (17) 17
63.0%
2024-03-14T10:22:22.876936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 38
28.4%
4 20
14.9%
6 13
 
9.7%
0 11
 
8.2%
2 10
 
7.5%
8 9
 
6.7%
9 9
 
6.7%
7 8
 
6.0%
3 7
 
5.2%
1 5
 
3.7%
Other values (4) 4
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 130
97.0%
Other Letter 4
 
3.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 38
29.2%
4 20
15.4%
6 13
 
10.0%
0 11
 
8.5%
2 10
 
7.7%
8 9
 
6.9%
9 9
 
6.9%
7 8
 
6.2%
3 7
 
5.4%
1 5
 
3.8%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 130
97.0%
Hangul 4
 
3.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 38
29.2%
4 20
15.4%
6 13
 
10.0%
0 11
 
8.5%
2 10
 
7.7%
8 9
 
6.9%
9 9
 
6.9%
7 8
 
6.2%
3 7
 
5.4%
1 5
 
3.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 130
97.0%
Hangul 4
 
3.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 38
29.2%
4 20
15.4%
6 13
 
10.0%
0 11
 
8.5%
2 10
 
7.7%
8 9
 
6.9%
9 9
 
6.9%
7 8
 
6.2%
3 7
 
5.4%
1 5
 
3.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 6
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:23.082270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length35
Mean length28.37037
Min length4

Characters and Unicode

Total characters766
Distinct characters102
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row본사주소
2nd row전라북도 전주시 덕진구 팽나무4길 15-5 (인후동1가)
3rd row전라북도 전주시 완산구 호암로 82 402호(그린빌딩) (효자동2가)
4th row전라북도 익산시 고현로 24-3 (송학동)
5th row전라북도 전주시 덕진구 조경단로 38 (금암동)
ValueCountFrequency (%)
전라북도 26
 
16.1%
전주시 18
 
11.2%
완산구 13
 
8.1%
덕진구 5
 
3.1%
3층 4
 
2.5%
익산시 3
 
1.9%
효자동2가 3
 
1.9%
서신동 2
 
1.2%
군산시 2
 
1.2%
조촌동 2
 
1.2%
Other values (79) 83
51.6%
2024-03-14T10:22:23.429449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
144
 
18.8%
49
 
6.4%
( 27
 
3.5%
27
 
3.5%
) 27
 
3.5%
26
 
3.4%
2 26
 
3.4%
26
 
3.4%
26
 
3.4%
25
 
3.3%
Other values (92) 363
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 445
58.1%
Space Separator 144
 
18.8%
Decimal Number 112
 
14.6%
Open Punctuation 27
 
3.5%
Close Punctuation 27
 
3.5%
Dash Punctuation 6
 
0.8%
Other Punctuation 4
 
0.5%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
11.0%
27
 
6.1%
26
 
5.8%
26
 
5.8%
26
 
5.8%
25
 
5.6%
25
 
5.6%
20
 
4.5%
19
 
4.3%
18
 
4.0%
Other values (76) 184
41.3%
Decimal Number
ValueCountFrequency (%)
2 26
23.2%
1 19
17.0%
3 17
15.2%
4 13
11.6%
5 10
 
8.9%
0 9
 
8.0%
6 6
 
5.4%
8 5
 
4.5%
7 4
 
3.6%
9 3
 
2.7%
Space Separator
ValueCountFrequency (%)
144
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Uppercase Letter
ValueCountFrequency (%)
F 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 445
58.1%
Common 320
41.8%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
11.0%
27
 
6.1%
26
 
5.8%
26
 
5.8%
26
 
5.8%
25
 
5.6%
25
 
5.6%
20
 
4.5%
19
 
4.3%
18
 
4.0%
Other values (76) 184
41.3%
Common
ValueCountFrequency (%)
144
45.0%
( 27
 
8.4%
) 27
 
8.4%
2 26
 
8.1%
1 19
 
5.9%
3 17
 
5.3%
4 13
 
4.1%
5 10
 
3.1%
0 9
 
2.8%
- 6
 
1.9%
Other values (5) 22
 
6.9%
Latin
ValueCountFrequency (%)
F 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 445
58.1%
ASCII 321
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
144
44.9%
( 27
 
8.4%
) 27
 
8.4%
2 26
 
8.1%
1 19
 
5.9%
3 17
 
5.3%
4 13
 
4.0%
5 10
 
3.1%
0 9
 
2.8%
- 6
 
1.9%
Other values (6) 23
 
7.2%
Hangul
ValueCountFrequency (%)
49
 
11.0%
27
 
6.1%
26
 
5.8%
26
 
5.8%
26
 
5.8%
25
 
5.6%
25
 
5.6%
20
 
4.5%
19
 
4.3%
18
 
4.0%
Other values (76) 184
41.3%

Unnamed: 7
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:23.651591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.740741
Min length4

Characters and Unicode

Total characters317
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row전화번호
2nd row063-242-1071
3rd row063-227-6606
4th row063-843-1119
5th row063-272-6119
ValueCountFrequency (%)
전화번호 1
 
3.7%
070-7733-0700 1
 
3.7%
063-255-8200 1
 
3.7%
063-273-7557 1
 
3.7%
063-224-6794 1
 
3.7%
063-221-2948 1
 
3.7%
063-841-1203 1
 
3.7%
063-252-4119 1
 
3.7%
063-225-2750 1
 
3.7%
063-227-6626 1
 
3.7%
Other values (17) 17
63.0%
2024-03-14T10:22:23.946731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 52
16.4%
0 47
14.8%
2 45
14.2%
6 42
13.2%
3 35
11.0%
1 22
6.9%
7 19
 
6.0%
4 17
 
5.4%
5 16
 
5.0%
8 11
 
3.5%
Other values (5) 11
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 261
82.3%
Dash Punctuation 52
 
16.4%
Other Letter 4
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 47
18.0%
2 45
17.2%
6 42
16.1%
3 35
13.4%
1 22
8.4%
7 19
7.3%
4 17
 
6.5%
5 16
 
6.1%
8 11
 
4.2%
9 7
 
2.7%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 313
98.7%
Hangul 4
 
1.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 52
16.6%
0 47
15.0%
2 45
14.4%
6 42
13.4%
3 35
11.2%
1 22
7.0%
7 19
 
6.1%
4 17
 
5.4%
5 16
 
5.1%
8 11
 
3.5%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 313
98.7%
Hangul 4
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 52
16.6%
0 47
15.0%
2 45
14.4%
6 42
13.4%
3 35
11.2%
1 22
7.0%
7 19
 
6.1%
4 17
 
5.4%
5 16
 
5.1%
8 11
 
3.5%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 8
Text

MISSING 

Distinct26
Distinct (%)96.3%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:24.121155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.703704
Min length4

Characters and Unicode

Total characters316
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)92.6%

Sample

1st row팩스번호
2nd row063-242-1078
3rd row063-227-6607
4th row063-843-1139
5th row063-255-7119
ValueCountFrequency (%)
063-241-5980 2
 
7.4%
팩스번호 1
 
3.7%
063-255-8230 1
 
3.7%
063-211-2181 1
 
3.7%
063-228-6796 1
 
3.7%
063-221-8722 1
 
3.7%
063-842-1204 1
 
3.7%
063-252-4339 1
 
3.7%
063-226-2751 1
 
3.7%
063-227-6696 1
 
3.7%
Other values (16) 16
59.3%
2024-03-14T10:22:24.446035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 52
16.5%
2 42
13.3%
0 41
13.0%
3 40
12.7%
6 39
12.3%
1 23
7.3%
8 18
 
5.7%
5 17
 
5.4%
4 14
 
4.4%
9 13
 
4.1%
Other values (5) 17
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 260
82.3%
Dash Punctuation 52
 
16.5%
Other Letter 4
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 42
16.2%
0 41
15.8%
3 40
15.4%
6 39
15.0%
1 23
8.8%
8 18
6.9%
5 17
6.5%
4 14
 
5.4%
9 13
 
5.0%
7 13
 
5.0%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 312
98.7%
Hangul 4
 
1.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 52
16.7%
2 42
13.5%
0 41
13.1%
3 40
12.8%
6 39
12.5%
1 23
7.4%
8 18
 
5.8%
5 17
 
5.4%
4 14
 
4.5%
9 13
 
4.2%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 312
98.7%
Hangul 4
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 52
16.7%
2 42
13.5%
0 41
13.1%
3 40
12.8%
6 39
12.5%
1 23
7.4%
8 18
 
5.8%
5 17
 
5.4%
4 14
 
4.5%
9 13
 
4.2%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 9
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:24.654798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.962963
Min length9

Characters and Unicode

Total characters269
Distinct characters20
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row최초면허(설계업)
2nd row2010-11-10
3rd row2003-01-08
4th row2016-04-11
5th row2012-12-28
ValueCountFrequency (%)
최초면허(설계업 1
 
3.7%
2012-05-29 1
 
3.7%
2006-08-14 1
 
3.7%
2003-07-04 1
 
3.7%
2012-05-08 1
 
3.7%
2009-04-07 1
 
3.7%
1997-06-24 1
 
3.7%
2011-02-10 1
 
3.7%
2009-06-08 1
 
3.7%
2006-09-13 1
 
3.7%
Other values (17) 17
63.0%
2024-03-14T10:22:24.961306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 73
27.1%
- 52
19.3%
2 41
15.2%
1 36
13.4%
6 13
 
4.8%
7 9
 
3.3%
9 9
 
3.3%
3 8
 
3.0%
8 7
 
2.6%
4 6
 
2.2%
Other values (10) 15
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 208
77.3%
Dash Punctuation 52
 
19.3%
Other Letter 7
 
2.6%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 73
35.1%
2 41
19.7%
1 36
17.3%
6 13
 
6.2%
7 9
 
4.3%
9 9
 
4.3%
3 8
 
3.8%
8 7
 
3.4%
4 6
 
2.9%
5 6
 
2.9%
Other Letter
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 262
97.4%
Hangul 7
 
2.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 73
27.9%
- 52
19.8%
2 41
15.6%
1 36
13.7%
6 13
 
5.0%
7 9
 
3.4%
9 9
 
3.4%
3 8
 
3.1%
8 7
 
2.7%
4 6
 
2.3%
Other values (3) 8
 
3.1%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 262
97.4%
Hangul 7
 
2.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 73
27.9%
- 52
19.8%
2 41
15.6%
1 36
13.7%
6 13
 
5.0%
7 9
 
3.4%
9 9
 
3.4%
3 8
 
3.1%
8 7
 
2.7%
4 6
 
2.3%
Other values (3) 8
 
3.1%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Unnamed: 10
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size356.0 B
일반(전기),일반(기계)
23 
일반(기계)
 
2
<NA>
 
1
분야(설계업)
 
1
전문
 
1

Length

Max length13
Median length13
Mean length11.571429
Min length2

Unique

Unique3 ?
Unique (%)10.7%

Sample

1st row<NA>
2nd row분야(설계업)
3rd row일반(전기),일반(기계)
4th row일반(전기),일반(기계)
5th row전문

Common Values

ValueCountFrequency (%)
일반(전기),일반(기계) 23
82.1%
일반(기계) 2
 
7.1%
<NA> 1
 
3.6%
분야(설계업) 1
 
3.6%
전문 1
 
3.6%

Length

2024-03-14T10:22:25.076595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:22:25.169089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반(전기),일반(기계 23
82.1%
일반(기계 2
 
7.1%
na 1
 
3.6%
분야(설계업 1
 
3.6%
전문 1
 
3.6%

Unnamed: 11
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing1
Missing (%)3.6%
Memory size356.0 B
2024-03-14T10:22:25.333888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.259259
Min length7

Characters and Unicode

Total characters331
Distinct characters37
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row등록번호(설계업)
2nd row전주덕진 제2010-4호
3rd row전주완산 제2007-4호
4th row2016-01-00057
5th row전주덕진제2012-12호
ValueCountFrequency (%)
전주완산 2
 
6.5%
등록번호(설계업 1
 
3.2%
전주덕진제2006-06호 1
 
3.2%
전주완산제2006-15호 1
 
3.2%
전주덕진2003-3호 1
 
3.2%
전주완산제2012-05호 1
 
3.2%
전주완산제2009-07호 1
 
3.2%
제97-53호 1
 
3.2%
전주완산제2011-3호 1
 
3.2%
전주완산제2009-5호 1
 
3.2%
Other values (20) 20
64.5%
2024-03-14T10:22:25.622313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 55
16.6%
2 31
9.4%
1 29
 
8.8%
- 27
 
8.2%
26
 
7.9%
24
 
7.3%
21
 
6.3%
18
 
5.4%
15
 
4.5%
12
 
3.6%
Other values (27) 73
22.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 152
45.9%
Other Letter 146
44.1%
Dash Punctuation 27
 
8.2%
Space Separator 4
 
1.2%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
17.8%
24
16.4%
21
14.4%
18
12.3%
15
10.3%
12
8.2%
6
 
4.1%
6
 
4.1%
3
 
2.1%
2
 
1.4%
Other values (13) 13
8.9%
Decimal Number
ValueCountFrequency (%)
0 55
36.2%
2 31
20.4%
1 29
19.1%
5 9
 
5.9%
3 7
 
4.6%
6 6
 
3.9%
7 6
 
3.9%
9 5
 
3.3%
4 3
 
2.0%
8 1
 
0.7%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 185
55.9%
Hangul 146
44.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
17.8%
24
16.4%
21
14.4%
18
12.3%
15
10.3%
12
8.2%
6
 
4.1%
6
 
4.1%
3
 
2.1%
2
 
1.4%
Other values (13) 13
8.9%
Common
ValueCountFrequency (%)
0 55
29.7%
2 31
16.8%
1 29
15.7%
- 27
14.6%
5 9
 
4.9%
3 7
 
3.8%
6 6
 
3.2%
7 6
 
3.2%
9 5
 
2.7%
4
 
2.2%
Other values (4) 6
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 185
55.9%
Hangul 146
44.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 55
29.7%
2 31
16.8%
1 29
15.7%
- 27
14.6%
5 9
 
4.9%
3 7
 
3.8%
6 6
 
3.2%
7 6
 
3.2%
9 5
 
2.7%
4
 
2.2%
Other values (4) 6
 
3.2%
Hangul
ValueCountFrequency (%)
26
17.8%
24
16.4%
21
14.4%
18
12.3%
15
10.3%
12
8.2%
6
 
4.1%
6
 
4.1%
3
 
2.1%
2
 
1.4%
Other values (13) 13
8.9%

Correlations

2024-03-14T10:22:25.992849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11
Unnamed: 11.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 21.0001.0001.0001.0001.0001.0001.0001.0001.0000.6731.000
Unnamed: 31.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 41.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 51.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 61.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 71.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 81.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 91.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 101.0000.6731.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 111.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
2024-03-14T10:22:26.104797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 1Unnamed: 10
Unnamed: 21.0000.8940.493
Unnamed: 10.8941.0000.959
Unnamed: 100.4930.9591.000
2024-03-14T10:22:26.186869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 10
Unnamed: 11.0000.8940.959
Unnamed: 20.8941.0000.493
Unnamed: 100.9590.4931.000

Missing values

2024-03-14T10:22:20.770000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T10:22:20.890393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T10:22:21.014019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소방시설 설계업 등록 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11
0NaN<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1순번지역조회지역상호대표자우편번호본사주소전화번호팩스번호최초면허(설계업)분야(설계업)등록번호(설계업)
21전북전주시티앤제이건설(주)고태유55008전라북도 전주시 덕진구 팽나무4길 15-5 (인후동1가)063-242-1071063-242-10782010-11-10일반(전기),일반(기계)전주덕진 제2010-4호
32전북전주시(주)주연이엔지김종배55062전라북도 전주시 완산구 호암로 82 402호(그린빌딩) (효자동2가)063-227-6606063-227-66072003-01-08일반(전기),일반(기계)전주완산 제2007-4호
43전북익산시지오방재이엔씨(주)장영철54663전라북도 익산시 고현로 24-3 (송학동)063-843-1119063-843-11392016-04-11전문2016-01-00057
54전북전주시(주)덕진소방박동규54923전라북도 전주시 덕진구 조경단로 38 (금암동)063-272-6119063-255-71192012-12-28일반(전기),일반(기계)전주덕진제2012-12호
65전북정읍시(유)우석엔지니어링이성준56158전라북도 정읍시 서부산업도로 302 (연지동 343-11, 은규빌딩 3F)063-538-2752063-538-27512011-06-22일반(전기),일반(기계)정읍제2011-01호
76전북전주시(주)대신기술단왕영식55038전라북도 전주시 완산구 전주천동로 210 (다가동2가)063-226-6500063-226-61991998-01-22일반(전기),일반(기계)전주완산 제98-71호
87전북부안군태인엔지니어링(주)시춘근56320전라북도 부안군 백산면 부평로 214-20 ()063-236-6200063-236-62012006-04-10일반(전기),일반(기계)부안 제2015-02호
98전북전주시주식회사 대화박진형55058전라북도 전주시 완산구 우전2길 45 (효자동2가)063-228-2127063-228-21292006-03-31일반(전기),일반(기계)전주완산제2006-10호
소방시설 설계업 등록 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11
1817전북전주시진성ESC김영국55122전라북도 전주시 완산구 평화14길 28-15 3층 (평화동1가)063-221-8547063-225-85492010-07-13일반(전기),일반(기계)전주완산제2010-10호
1918전북전주시(주)이린김한규54977전라북도 전주시 완산구 중산로 20 (중화산동2가)063-227-6626063-227-66962006-09-13일반(전기),일반(기계)전주완산제2006-19호
2019전북전주시(주)부윤엔지니어링백종호55123전라북도 전주시 완산구 모악로 4704 (평화동2가,1층)063-225-2750063-226-27512009-06-08일반(전기),일반(기계)전주완산제2009-5호
2120전북전주시광성엔지니어링정태종54941전라북도 전주시 완산구 전룡로 124 (서신동)063-252-4119063-252-43392011-02-10일반(전기),일반(기계)전주완산제2011-3호
2221전북익산시(주)대성건축사사무소김창호54621전라북도 익산시 선화로 259 (남중동)063-841-1203063-842-12041997-06-24일반(전기),일반(기계)제97-53호
2322전북전주시대한엔지니어링김경미54954전라북도 전주시 완산구 전룡6길 6 (서신동)063-221-2948063-221-87222009-04-07일반(전기),일반(기계)전주완산제2009-07호
2423전북전주시(유)유진엔지니어링김영수54962전라북도 전주시 완산구 마전1길 10 (효자동3가) 가온빌딩 3층063-224-6794063-228-67962012-05-08일반(전기),일반(기계)전주완산제2012-05호
2524전북전주시(주)호원엔지니어링신장근54846전라북도 전주시 덕진구 추천로 293 (팔복동1가)063-273-7557063-211-21812003-07-04일반(기계)전주덕진2003-3호
2625전북전주시(주)다우코퍼레이션박식54877전라북도 전주시 완산구 천잠로 453 (효자동3가)063-255-8200063-255-82302006-08-14일반(전기),일반(기계)전주완산제2006-15호
2726전북전주시원일엔지니어링공유원54980전라북도 전주시 완산구 산월2길 37 (중화산동2가) 3층063-229-1020063-229-10702015-06-26일반(기계)전주완산제2015-12호