Overview

Dataset statistics

Number of variables5
Number of observations129
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory41.0 B

Variable types

Categorical3
Text2

Alerts

지역 is highly overall correlated with 담당부서 연락처High correlation
구분 is highly overall correlated with 담당부서 연락처High correlation
담당부서 연락처 is highly overall correlated with 지역 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 21:28:32.738558
Analysis finished2023-12-10 21:28:33.202865
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)11.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
용인시
28 
연천군
25 
포천시
16 
남양주시
10 
양주시
Other values (10)
41 

Length

Max length4
Median length3
Mean length3.1395349
Min length3

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row의왕시
2nd row의정부시
3rd row연천군
4th row연천군
5th row연천군

Common Values

ValueCountFrequency (%)
용인시 28
21.7%
연천군 25
19.4%
포천시 16
12.4%
남양주시 10
 
7.8%
양주시 9
 
7.0%
동두천시 7
 
5.4%
이천시 6
 
4.7%
김포시 5
 
3.9%
여주시 5
 
3.9%
광주시 5
 
3.9%
Other values (5) 13
10.1%

Length

2023-12-11T06:28:33.281195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용인시 28
21.7%
연천군 25
19.4%
포천시 16
12.4%
남양주시 10
 
7.8%
양주시 9
 
7.0%
동두천시 7
 
5.4%
이천시 6
 
4.7%
김포시 5
 
3.9%
여주시 5
 
3.9%
광주시 5
 
3.9%
Other values (5) 13
10.1%

구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
농산물
65 
가공식품
24 
축산물
19 
기타
10 
임산물
Other values (2)
 
3

Length

Max length4
Median length3
Mean length3.1085271
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row가공품
2nd row농산물
3rd row농산물
4th row농산물
5th row농산물

Common Values

ValueCountFrequency (%)
농산물 65
50.4%
가공식품 24
 
18.6%
축산물 19
 
14.7%
기타 10
 
7.8%
임산물 8
 
6.2%
수산물 2
 
1.6%
가공품 1
 
0.8%

Length

2023-12-11T06:28:33.409378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:28:33.550431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농산물 65
50.4%
가공식품 24
 
18.6%
축산물 19
 
14.7%
기타 10
 
7.8%
임산물 8
 
6.2%
수산물 2
 
1.6%
가공품 1
 
0.8%
Distinct106
Distinct (%)82.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T06:28:33.888577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.9224806
Min length2

Characters and Unicode

Total characters635
Distinct characters190
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)77.5%

Sample

1st row쿠키라인
2nd row송산배
3rd row연천쌀
4th row아희와
5th row아희랑
ValueCountFrequency (%)
소요산 7
 
4.5%
자연다믄 7
 
4.5%
뜰안에 6
 
3.9%
된장 6
 
3.9%
자연채 5
 
3.2%
임금님표 5
 
3.2%
이천 5
 
3.2%
대왕님표 4
 
2.6%
햇살드리 3
 
1.9%
계란 1
 
0.6%
Other values (105) 105
68.2%
2023-12-11T06:28:34.367471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
3.9%
24
 
3.8%
22
 
3.5%
21
 
3.3%
17
 
2.7%
15
 
2.4%
14
 
2.2%
13
 
2.0%
11
 
1.7%
11
 
1.7%
Other values (180) 462
72.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 604
95.1%
Space Separator 25
 
3.9%
Uppercase Letter 3
 
0.5%
Decimal Number 2
 
0.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
4.0%
22
 
3.6%
21
 
3.5%
17
 
2.8%
15
 
2.5%
14
 
2.3%
13
 
2.2%
11
 
1.8%
11
 
1.8%
11
 
1.8%
Other values (173) 445
73.7%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
M 1
33.3%
Z 1
33.3%
Decimal Number
ValueCountFrequency (%)
0 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 604
95.1%
Common 28
 
4.4%
Latin 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
4.0%
22
 
3.6%
21
 
3.5%
17
 
2.8%
15
 
2.5%
14
 
2.3%
13
 
2.2%
11
 
1.8%
11
 
1.8%
11
 
1.8%
Other values (173) 445
73.7%
Common
ValueCountFrequency (%)
25
89.3%
/ 1
 
3.6%
0 1
 
3.6%
4 1
 
3.6%
Latin
ValueCountFrequency (%)
D 1
33.3%
M 1
33.3%
Z 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 604
95.1%
ASCII 31
 
4.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25
80.6%
D 1
 
3.2%
M 1
 
3.2%
/ 1
 
3.2%
Z 1
 
3.2%
0 1
 
3.2%
4 1
 
3.2%
Hangul
ValueCountFrequency (%)
24
 
4.0%
22
 
3.6%
21
 
3.5%
17
 
2.8%
15
 
2.5%
14
 
2.3%
13
 
2.2%
11
 
1.8%
11
 
1.8%
11
 
1.8%
Other values (173) 445
73.7%

품목
Text

Distinct79
Distinct (%)61.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T06:28:34.659721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length6
Mean length2.4883721
Min length1

Characters and Unicode

Total characters321
Distinct characters127
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)45.7%

Sample

1st row쿠기 빵
2nd row
3rd row
4th row현미
5th row현미
ValueCountFrequency (%)
9
 
6.8%
8
 
6.1%
포도 7
 
5.3%
버섯 5
 
3.8%
계란 4
 
3.0%
한우 4
 
3.0%
부추 3
 
2.3%
3
 
2.3%
김치 3
 
2.3%
한과 3
 
2.3%
Other values (72) 83
62.9%
2023-12-11T06:28:35.083289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
3.4%
10
 
3.1%
10
 
3.1%
10
 
3.1%
10
 
3.1%
10
 
3.1%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (117) 230
71.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 308
96.0%
Decimal Number 4
 
1.2%
Space Separator 3
 
0.9%
Close Punctuation 2
 
0.6%
Open Punctuation 2
 
0.6%
Other Punctuation 1
 
0.3%
Math Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
3.6%
10
 
3.2%
10
 
3.2%
10
 
3.2%
10
 
3.2%
10
 
3.2%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
Other values (110) 217
70.5%
Decimal Number
ValueCountFrequency (%)
1 3
75.0%
0 1
 
25.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 308
96.0%
Common 13
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
3.6%
10
 
3.2%
10
 
3.2%
10
 
3.2%
10
 
3.2%
10
 
3.2%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
Other values (110) 217
70.5%
Common
ValueCountFrequency (%)
1 3
23.1%
3
23.1%
) 2
15.4%
( 2
15.4%
/ 1
 
7.7%
0 1
 
7.7%
~ 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 308
96.0%
ASCII 13
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
3.6%
10
 
3.2%
10
 
3.2%
10
 
3.2%
10
 
3.2%
10
 
3.2%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
Other values (110) 217
70.5%
ASCII
ValueCountFrequency (%)
1 3
23.1%
3
23.1%
) 2
15.4%
( 2
15.4%
/ 1
 
7.7%
0 1
 
7.7%
~ 1
 
7.7%

담당부서 연락처
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
031-839-2317
25 
031-538-3723
10 
031-590-2315
031-8082-6112
031-860-2313
Other values (45)
69 

Length

Max length13
Median length12
Mean length12.077519
Min length12

Unique

Unique34 ?
Unique (%)26.4%

Sample

1st row031-345-2382
2nd row031-828-2312
3rd row031-839-2317
4th row031-839-2317
5th row031-839-2317

Common Values

ValueCountFrequency (%)
031-839-2317 25
19.4%
031-538-3723 10
 
7.8%
031-590-2315 9
 
7.0%
031-8082-6112 9
 
7.0%
031-860-2313 7
 
5.4%
031-760-2877 5
 
3.9%
031-980-2813 5
 
3.9%
031-310-2323 4
 
3.1%
031-538-3881 4
 
3.1%
031-644-2628 4
 
3.1%
Other values (40) 47
36.4%

Length

2023-12-11T06:28:35.217435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
031-839-2317 25
19.4%
031-538-3723 10
 
7.8%
031-590-2315 9
 
7.0%
031-8082-6112 9
 
7.0%
031-860-2313 7
 
5.4%
031-760-2877 5
 
3.9%
031-980-2813 5
 
3.9%
031-310-2323 4
 
3.1%
031-538-3881 4
 
3.1%
031-644-2628 4
 
3.1%
Other values (40) 47
36.4%

Correlations

2023-12-11T06:28:35.314388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분품목담당부서 연락처
지역1.0000.6910.0001.000
구분0.6911.0000.9970.934
품목0.0000.9971.0000.000
담당부서 연락처1.0000.9340.0001.000
2023-12-11T06:28:35.414152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
담당부서 연락처지역구분
담당부서 연락처1.0000.8320.571
지역0.8321.0000.387
구분0.5710.3871.000
2023-12-11T06:28:35.500911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분담당부서 연락처
지역1.0000.3870.832
구분0.3871.0000.571
담당부서 연락처0.8320.5711.000

Missing values

2023-12-11T06:28:33.057311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:28:33.154247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역구분브랜드명품목담당부서 연락처
0의왕시가공품쿠키라인쿠기 빵031-345-2382
1의정부시농산물송산배031-828-2312
2연천군농산물연천쌀031-839-2317
3연천군농산물아희와현미031-839-2317
4연천군농산물아희랑현미031-839-2317
5연천군가공식품선한이웃장류031-839-2317
6연천군축산물유환유정란달걀031-839-2317
7연천군농산물가람가온김치김치031-839-2317
8연천군농산물황진이031-839-2317
9연천군가공식품개성홍삼홍삼031-839-2317
지역구분브랜드명품목담당부서 연락처
119광주시농산물자연채느타리버섯031-760-2877
120연천군축산물연천토종꿀토종꿀031-839-2317
121구리시농산물구리먹골배031-550-2321
122구리시농산물백교부추부추031-550-2321
123연천군축산물연천대광꿀031-839-2317
124연천군임산물연천DMZ밤031-839-2317
125의왕시농산물의왕우렁쌀부추031-345-2392
126연천군가공식품고궁한과한과031-839-2317
127연천군농산물연천포도포도031-839-2317
128연천군가공식품율무막걸리막걸리031-839-2317