Overview

Dataset statistics

Number of variables4
Number of observations190
Missing cells52
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory33.7 B

Variable types

Numeric1
Text3

Dataset

Description전북특별자치도 고창군에 위치한 비영리재단 및 중소기업에 대한 데이터로 명단과 전화번호 및 팩스번호에 대한 항목을 제공합니다.
Author전북특별자치도 고창군
URLhttps://www.data.go.kr/data/15126604/fileData.do

Alerts

전화번호 has 13 (6.8%) missing valuesMissing
팩스번호 has 39 (20.5%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 20:07:16.219261
Analysis finished2024-03-14 20:07:17.517929
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct190
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean95.5
Minimum1
Maximum190
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2024-03-15T05:07:17.752544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.45
Q148.25
median95.5
Q3142.75
95-th percentile180.55
Maximum190
Range189
Interquartile range (IQR)94.5

Descriptive statistics

Standard deviation54.992424
Coefficient of variation (CV)0.5758369
Kurtosis-1.2
Mean95.5
Median Absolute Deviation (MAD)47.5
Skewness0
Sum18145
Variance3024.1667
MonotonicityStrictly increasing
2024-03-15T05:07:18.196741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
132 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
127 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
Other values (180) 180
94.7%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
190 1
0.5%
189 1
0.5%
188 1
0.5%
187 1
0.5%
186 1
0.5%
185 1
0.5%
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
Distinct186
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-03-15T05:07:19.030860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length9.9842105
Min length2

Characters and Unicode

Total characters1897
Distinct characters246
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)95.8%

Sample

1st row(농)국순당고창명주(주)
2nd row(유)건설기업
3rd row(유)기능건설
4th row(유)대명산업
5th row(유)대승산업
ValueCountFrequency (%)
주식회사 36
 
13.0%
농업회사법인 13
 
4.7%
유한회사 13
 
4.7%
고창군농협조합공동사업법인 4
 
1.4%
영농조합법인 3
 
1.1%
제2공장 2
 
0.7%
흥진 2
 
0.7%
재단법인 2
 
0.7%
인플러스 2
 
0.7%
고창공장 2
 
0.7%
Other values (193) 198
71.5%
2024-03-15T05:07:19.969539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
 
4.7%
87
 
4.6%
84
 
4.4%
73
 
3.8%
66
 
3.5%
62
 
3.3%
( 59
 
3.1%
) 59
 
3.1%
58
 
3.1%
57
 
3.0%
Other values (236) 1203
63.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1681
88.6%
Space Separator 87
 
4.6%
Open Punctuation 59
 
3.1%
Close Punctuation 59
 
3.1%
Decimal Number 6
 
0.3%
Uppercase Letter 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
5.3%
84
 
5.0%
73
 
4.3%
66
 
3.9%
62
 
3.7%
58
 
3.5%
57
 
3.4%
53
 
3.2%
50
 
3.0%
45
 
2.7%
Other values (227) 1044
62.1%
Uppercase Letter
ValueCountFrequency (%)
R 1
20.0%
B 1
20.0%
L 1
20.0%
G 1
20.0%
U 1
20.0%
Space Separator
ValueCountFrequency (%)
87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 59
100.0%
Decimal Number
ValueCountFrequency (%)
2 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1681
88.6%
Common 211
 
11.1%
Latin 5
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
5.3%
84
 
5.0%
73
 
4.3%
66
 
3.9%
62
 
3.7%
58
 
3.5%
57
 
3.4%
53
 
3.2%
50
 
3.0%
45
 
2.7%
Other values (227) 1044
62.1%
Latin
ValueCountFrequency (%)
R 1
20.0%
B 1
20.0%
L 1
20.0%
G 1
20.0%
U 1
20.0%
Common
ValueCountFrequency (%)
87
41.2%
( 59
28.0%
) 59
28.0%
2 6
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1681
88.6%
ASCII 216
 
11.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
89
 
5.3%
84
 
5.0%
73
 
4.3%
66
 
3.9%
62
 
3.7%
58
 
3.5%
57
 
3.4%
53
 
3.2%
50
 
3.0%
45
 
2.7%
Other values (227) 1044
62.1%
ASCII
ValueCountFrequency (%)
87
40.3%
( 59
27.3%
) 59
27.3%
2 6
 
2.8%
R 1
 
0.5%
B 1
 
0.5%
L 1
 
0.5%
G 1
 
0.5%
U 1
 
0.5%

전화번호
Text

MISSING 

Distinct167
Distinct (%)94.4%
Missing13
Missing (%)6.8%
Memory size1.6 KiB
2024-03-15T05:07:20.854088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.050847
Min length12

Characters and Unicode

Total characters2133
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique158 ?
Unique (%)89.3%

Sample

1st row063-564-9800
2nd row063-564-9511
3rd row063-564-1476
4th row063-561-0885
5th row063-564-8983
ValueCountFrequency (%)
063-564-0880 3
 
1.7%
063-564-0113 2
 
1.1%
063-564-8511 2
 
1.1%
063-564-5008 2
 
1.1%
063-564-2009 2
 
1.1%
063-561-0008 2
 
1.1%
063-561-0209 2
 
1.1%
063-562-6088 2
 
1.1%
063-562-4176 2
 
1.1%
063-560-5113 1
 
0.6%
Other values (157) 157
88.7%
2024-03-15T05:07:21.948858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 381
17.9%
- 354
16.6%
0 323
15.1%
3 266
12.5%
5 231
10.8%
1 141
 
6.6%
4 111
 
5.2%
2 104
 
4.9%
7 83
 
3.9%
8 78
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1779
83.4%
Dash Punctuation 354
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 381
21.4%
0 323
18.2%
3 266
15.0%
5 231
13.0%
1 141
 
7.9%
4 111
 
6.2%
2 104
 
5.8%
7 83
 
4.7%
8 78
 
4.4%
9 61
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 354
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2133
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 381
17.9%
- 354
16.6%
0 323
15.1%
3 266
12.5%
5 231
10.8%
1 141
 
6.6%
4 111
 
5.2%
2 104
 
4.9%
7 83
 
3.9%
8 78
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2133
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 381
17.9%
- 354
16.6%
0 323
15.1%
3 266
12.5%
5 231
10.8%
1 141
 
6.6%
4 111
 
5.2%
2 104
 
4.9%
7 83
 
3.9%
8 78
 
3.7%

팩스번호
Text

MISSING 

Distinct138
Distinct (%)91.4%
Missing39
Missing (%)20.5%
Memory size1.6 KiB
2024-03-15T05:07:22.830796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.019868
Min length12

Characters and Unicode

Total characters1815
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)83.4%

Sample

1st row063-564-9804
2nd row063-561-2923
3rd row063-564-1476
4th row063-564-8984
5th row063-564-8984
ValueCountFrequency (%)
063-564-7001 3
 
2.0%
063-561-0998 2
 
1.3%
063-900-3508 2
 
1.3%
063-564-8514 2
 
1.3%
063-561-6187 2
 
1.3%
063-561-3576 2
 
1.3%
063-564-8984 2
 
1.3%
063-563-3482 2
 
1.3%
063-563-6680 2
 
1.3%
063-564-9990 2
 
1.3%
Other values (128) 130
86.1%
2024-03-15T05:07:24.685898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 331
18.2%
- 298
16.4%
0 246
13.6%
3 238
13.1%
5 202
11.1%
4 114
 
6.3%
1 104
 
5.7%
2 84
 
4.6%
9 75
 
4.1%
8 68
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1517
83.6%
Dash Punctuation 298
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 331
21.8%
0 246
16.2%
3 238
15.7%
5 202
13.3%
4 114
 
7.5%
1 104
 
6.9%
2 84
 
5.5%
9 75
 
4.9%
8 68
 
4.5%
7 55
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 298
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1815
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 331
18.2%
- 298
16.4%
0 246
13.6%
3 238
13.1%
5 202
11.1%
4 114
 
6.3%
1 104
 
5.7%
2 84
 
4.6%
9 75
 
4.1%
8 68
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1815
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 331
18.2%
- 298
16.4%
0 246
13.6%
3 238
13.1%
5 202
11.1%
4 114
 
6.3%
1 104
 
5.7%
2 84
 
4.6%
9 75
 
4.1%
8 68
 
3.7%

Interactions

2024-03-15T05:07:16.456163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-15T05:07:16.801823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:07:17.119854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T05:07:17.387532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번회사명전화번호팩스번호
01(농)국순당고창명주(주)063-564-9800063-564-9804
12(유)건설기업063-564-9511063-561-2923
23(유)기능건설063-564-1476063-564-1476
34(유)대명산업063-561-0885063-564-8984
45(유)대승산업063-564-8983063-564-8984
56(유)대진레미콘063-561-0500063-561-0503
67(유)도건엔지니어링063-564-1773063-561-1773
78(유)목인070-8688-5954063-900-3508
89(유)신도정보통신063-563-1718063-561-1719
910(유)신성레미콘063-561-0488063-561-1732
순번회사명전화번호팩스번호
180181해풍영농조합법인063-564-2590<NA>
181182현대종합금속(주)063-560-6060063-560-6099
182183현우이엔티063-923-9990063-923-9992
183184호암유기질비료063-561-6787063-561-3001
184185환경그린070-4226-778302-3442-4667
185186황토배기오늘김치063-564-6662<NA>
186187효자효부쌀미곡처리장063-564-5446063-564-0473
187188힐링푸드063-563-8787<NA>
188189(재)고창군장학재단063-560-8616063-560-2899
189190(재)고창문화관광재단063-561-1110063-561-5556