Overview

Dataset statistics

Number of variables7
Number of observations28
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory60.7 B

Variable types

Categorical3
Text4

Alerts

비고 is highly overall correlated with 시군 and 1 other fieldsHigh correlation
업종 is highly overall correlated with 비고High correlation
시군 is highly overall correlated with 비고High correlation
상호 has unique valuesUnique
대표자 has unique valuesUnique
소재지 has unique valuesUnique
연락처 has unique valuesUnique

Reproduction

Analysis started2024-03-13 23:46:23.567114
Analysis finished2024-03-13 23:46:24.045929
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
전주시
12 
남원시
군산시
익산시
정읍시
Other values (2)

Length

Max length3
Median length3
Mean length2.9285714
Min length1

Unique

Unique1 ?
Unique (%)3.6%

Sample

1st row
2nd row전주시
3rd row전주시
4th row전주시
5th row전주시

Common Values

ValueCountFrequency (%)
전주시 12
42.9%
남원시 5
17.9%
군산시 3
 
10.7%
익산시 3
 
10.7%
정읍시 2
 
7.1%
고창군 2
 
7.1%
1
 
3.6%

Length

2024-03-14T08:46:24.104301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T08:46:24.205887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전주시 12
42.9%
남원시 5
17.9%
군산시 3
 
10.7%
익산시 3
 
10.7%
정읍시 2
 
7.1%
고창군 2
 
7.1%
1
 
3.6%

업종
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size356.0 B
보수단청업
18 
실측설계업
보존과학업
문화재감리업
6개업종
 
1

Length

Max length6
Median length5
Mean length5.0357143
Min length4

Unique

Unique2 ?
Unique (%)7.1%

Sample

1st row6개업종
2nd row보수단청업
3rd row보수단청업
4th row보수단청업
5th row보수단청업

Common Values

ValueCountFrequency (%)
보수단청업 18
64.3%
실측설계업 3
 
10.7%
보존과학업 3
 
10.7%
문화재감리업 2
 
7.1%
6개업종 1
 
3.6%
식물보호업 1
 
3.6%

Length

2024-03-14T08:46:24.318494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T08:46:24.416374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보수단청업 18
64.3%
실측설계업 3
 
10.7%
보존과학업 3
 
10.7%
문화재감리업 2
 
7.1%
6개업종 1
 
3.6%
식물보호업 1
 
3.6%

상호
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-14T08:46:24.590988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8.5
Mean length5.9642857
Min length3

Characters and Unicode

Total characters167
Distinct characters75
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row27개소
2nd row(유)아산종합건설
3rd row㈜조양
4th row㈜토울
5th row㈜혜전건설
ValueCountFrequency (%)
27개소 1
 
3.6%
유)아산종합건설 1
 
3.6%
유)건웅종합건설 1
 
3.6%
보림문화재 1
 
3.6%
유)하늘채 1
 
3.6%
유)동문 1
 
3.6%
㈜동강종합건설 1
 
3.6%
㈜남전종합건설 1
 
3.6%
㈜예원종합건설 1
 
3.6%
㈜상락문화재수리 1
 
3.6%
Other values (18) 18
64.3%
2024-03-14T08:46:24.934207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
9.6%
13
 
7.8%
9
 
5.4%
7
 
4.2%
7
 
4.2%
( 6
 
3.6%
6
 
3.6%
) 6
 
3.6%
5
 
3.0%
4
 
2.4%
Other values (65) 88
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 137
82.0%
Other Symbol 16
 
9.6%
Open Punctuation 6
 
3.6%
Close Punctuation 6
 
3.6%
Decimal Number 2
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
9.5%
9
 
6.6%
7
 
5.1%
7
 
5.1%
6
 
4.4%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.2%
3
 
2.2%
Other values (60) 76
55.5%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
7 1
50.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 153
91.6%
Common 14
 
8.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
10.5%
13
 
8.5%
9
 
5.9%
7
 
4.6%
7
 
4.6%
6
 
3.9%
5
 
3.3%
4
 
2.6%
4
 
2.6%
3
 
2.0%
Other values (61) 79
51.6%
Common
ValueCountFrequency (%)
( 6
42.9%
) 6
42.9%
2 1
 
7.1%
7 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 137
82.0%
None 16
 
9.6%
ASCII 14
 
8.4%

Most frequent character per block

None
ValueCountFrequency (%)
16
100.0%
Hangul
ValueCountFrequency (%)
13
 
9.5%
9
 
6.6%
7
 
5.1%
7
 
5.1%
6
 
4.4%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.2%
3
 
2.2%
Other values (60) 76
55.5%
ASCII
ValueCountFrequency (%)
( 6
42.9%
) 6
42.9%
2 1
 
7.1%
7 1
 
7.1%

대표자
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-14T08:46:25.201500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9285714
Min length1

Characters and Unicode

Total characters82
Distinct characters54
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row-
2nd row한기수
3rd row이호석
4th row김재문
5th row오무웅
ValueCountFrequency (%)
1
 
3.6%
한기수 1
 
3.6%
정경호 1
 
3.6%
임선기 1
 
3.6%
박은주 1
 
3.6%
서보근 1
 
3.6%
함병진 1
 
3.6%
유도현 1
 
3.6%
송선희 1
 
3.6%
김두용 1
 
3.6%
Other values (18) 18
64.3%
2024-03-14T08:46:25.498742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
7.3%
4
 
4.9%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
2
 
2.4%
2
 
2.4%
Other values (44) 50
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 81
98.8%
Dash Punctuation 1
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
7.4%
4
 
4.9%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
2
 
2.5%
2
 
2.5%
Other values (43) 49
60.5%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 81
98.8%
Common 1
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
7.4%
4
 
4.9%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
2
 
2.5%
2
 
2.5%
Other values (43) 49
60.5%
Common
ValueCountFrequency (%)
- 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 81
98.8%
ASCII 1
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
7.4%
4
 
4.9%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
2
 
2.5%
2
 
2.5%
Other values (43) 49
60.5%
ASCII
ValueCountFrequency (%)
- 1
100.0%

소재지
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-14T08:46:25.707232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length24
Mean length18.107143
Min length1

Characters and Unicode

Total characters507
Distinct characters99
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row-
2nd row전주시 완산구 태평2길 22 2층 202호
3rd row전주시 덕진구 백제대로 670(금암동)
4th row전주시 완산구 인정3길 1 .2층
5th row전주시 완산구 홍산1길 12(효자동2가 2층)
ValueCountFrequency (%)
전주시 12
 
10.6%
완산구 8
 
7.1%
남원시 5
 
4.4%
덕진구 4
 
3.5%
익산시 3
 
2.7%
군산시 3
 
2.7%
36 3
 
2.7%
2층 3
 
2.7%
서동로 2
 
1.8%
백릉로 2
 
1.8%
Other values (63) 68
60.2%
2024-03-14T08:46:26.084404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
16.8%
26
 
5.1%
2 25
 
4.9%
1 20
 
3.9%
19
 
3.7%
19
 
3.7%
3 17
 
3.4%
) 13
 
2.6%
( 13
 
2.6%
13
 
2.6%
Other values (89) 257
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 286
56.4%
Decimal Number 103
 
20.3%
Space Separator 85
 
16.8%
Close Punctuation 13
 
2.6%
Open Punctuation 13
 
2.6%
Dash Punctuation 5
 
1.0%
Other Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
9.1%
19
 
6.6%
19
 
6.6%
13
 
4.5%
12
 
4.2%
12
 
4.2%
12
 
4.2%
8
 
2.8%
8
 
2.8%
7
 
2.4%
Other values (73) 150
52.4%
Decimal Number
ValueCountFrequency (%)
2 25
24.3%
1 20
19.4%
3 17
16.5%
6 9
 
8.7%
0 9
 
8.7%
7 8
 
7.8%
4 6
 
5.8%
8 4
 
3.9%
5 3
 
2.9%
9 2
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 286
56.4%
Common 221
43.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
9.1%
19
 
6.6%
19
 
6.6%
13
 
4.5%
12
 
4.2%
12
 
4.2%
12
 
4.2%
8
 
2.8%
8
 
2.8%
7
 
2.4%
Other values (73) 150
52.4%
Common
ValueCountFrequency (%)
85
38.5%
2 25
 
11.3%
1 20
 
9.0%
3 17
 
7.7%
) 13
 
5.9%
( 13
 
5.9%
6 9
 
4.1%
0 9
 
4.1%
7 8
 
3.6%
4 6
 
2.7%
Other values (6) 16
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 286
56.4%
ASCII 221
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
85
38.5%
2 25
 
11.3%
1 20
 
9.0%
3 17
 
7.7%
) 13
 
5.9%
( 13
 
5.9%
6 9
 
4.1%
0 9
 
4.1%
7 8
 
3.6%
4 6
 
2.7%
Other values (6) 16
 
7.2%
Hangul
ValueCountFrequency (%)
26
 
9.1%
19
 
6.6%
19
 
6.6%
13
 
4.5%
12
 
4.2%
12
 
4.2%
12
 
4.2%
8
 
2.8%
8
 
2.8%
7
 
2.4%
Other values (73) 150
52.4%

연락처
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-14T08:46:26.272717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.75
Min length1

Characters and Unicode

Total characters217
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row-
2nd row236-6700
3rd row251-3771
4th row228-6730
5th row228-0150
ValueCountFrequency (%)
1
 
3.6%
236-6700 1
 
3.6%
562-7797 1
 
3.6%
625-5050 1
 
3.6%
626-4111 1
 
3.6%
245-7304 1
 
3.6%
625-3103 1
 
3.6%
632-7320 1
 
3.6%
571-0403 1
 
3.6%
535-9913 1
 
3.6%
Other values (18) 18
64.3%
2024-03-14T08:46:26.557117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 30
13.8%
- 28
12.9%
0 26
12.0%
5 25
11.5%
3 24
11.1%
7 21
9.7%
1 20
9.2%
6 16
7.4%
4 11
 
5.1%
8 10
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 189
87.1%
Dash Punctuation 28
 
12.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 30
15.9%
0 26
13.8%
5 25
13.2%
3 24
12.7%
7 21
11.1%
1 20
10.6%
6 16
8.5%
4 11
 
5.8%
8 10
 
5.3%
9 6
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 217
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 30
13.8%
- 28
12.9%
0 26
12.0%
5 25
11.5%
3 24
11.1%
7 21
9.7%
1 20
9.2%
6 16
7.4%
4 11
 
5.1%
8 10
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 217
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 30
13.8%
- 28
12.9%
0 26
12.0%
5 25
11.5%
3 24
11.1%
7 21
9.7%
1 20
9.2%
6 16
7.4%
4 11
 
5.1%
8 10
 
4.6%

비고
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size356.0 B
종합
18 
전문
-
 
1

Length

Max length2
Median length2
Mean length1.9642857
Min length1

Unique

Unique1 ?
Unique (%)3.6%

Sample

1st row-
2nd row종합
3rd row종합
4th row종합
5th row종합

Common Values

ValueCountFrequency (%)
종합 18
64.3%
전문 9
32.1%
- 1
 
3.6%

Length

2024-03-14T08:46:26.665095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T08:46:26.747028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종합 18
64.3%
전문 9
32.1%
1
 
3.6%

Correlations

2024-03-14T08:46:26.804363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군업종상호대표자소재지연락처비고
시군1.0000.4861.0001.0001.0001.0000.786
업종0.4861.0001.0001.0001.0001.0001.000
상호1.0001.0001.0001.0001.0001.0001.000
대표자1.0001.0001.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.0001.000
연락처1.0001.0001.0001.0001.0001.0001.000
비고0.7861.0001.0001.0001.0001.0001.000
2024-03-14T08:46:26.894065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고업종시군
비고1.0000.9380.667
업종0.9381.0000.293
시군0.6670.2931.000
2024-03-14T08:46:27.307404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군업종비고
시군1.0000.2930.667
업종0.2931.0000.938
비고0.6670.9381.000

Missing values

2024-03-14T08:46:23.898626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T08:46:24.006705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군업종상호대표자소재지연락처비고
06개업종27개소----
1전주시보수단청업(유)아산종합건설한기수전주시 완산구 태평2길 22 2층 202호236-6700종합
2전주시보수단청업㈜조양이호석전주시 덕진구 백제대로 670(금암동)251-3771종합
3전주시보수단청업㈜토울김재문전주시 완산구 인정3길 1 .2층228-6730종합
4전주시보수단청업㈜혜전건설오무웅전주시 완산구 홍산1길 12(효자동2가 2층)228-0150종합
5전주시보수단청업㈜한백건설유용식전주시 덕진구 백제대로 782(우아동 3가)211-9339종합
6전주시실측설계업㈜길건축사박태우전주시 완산구 서곡5길 14-3276-7200전문
7전주시실측설계업아리건축사심재경전주시 완산구 감나무3길 776-2253-7402전문
8전주시실측설계업㈜미추홀건축사무소강선중전주시 완산구 화산쳔변 3길 3-4(중화산동 2가)237-0137전문
9전주시보존과학업한림보존테크함성희전주시 완산구 서곡3길 15243-0388전문
시군업종상호대표자소재지연락처비고
18익산시보수단청업㈜청광건설이상원익산시 서동로 9(인화동 2가)855-1310종합
19정읍시보수단청업㈜상락문화재수리김두용정읍시 수성로 20535-9913종합
20정읍시보수단청업㈜예원종합건설송선희정읍시 신태인읍 정신로 1133-1(신흥철물2층)571-0403종합
21남원시보수단청업㈜남전종합건설유도현남원시 남문로 467(죽항동)632-7320종합
22남원시보수단청업㈜동강종합건설함병진남원시 용성로 11625-3103종합
23남원시보수단청업(유)동문서보근남원시 황죽로 37(고죽동)245-7304종합
24남원시보수단청업(유)하늘채박은주남원시 사매면 덕오로 101626-4111종합
25남원시보존과학업보림문화재임선기남원시 시묘길 122625-5050전문
26고창군보수단청업(유)건웅종합건설정경호고창군 고창읍 중앙로 239562-7797종합
27고창군보수단청업(유)성내이용구고창군 고창읍 성산로 36 13호(3층, 케이티고창빌딩)562-7304종합