Overview

Dataset statistics

Number of variables4
Number of observations28
Missing cells16
Missing cells (%)14.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory36.7 B

Variable types

Text3
Categorical1

Alerts

도내 정기시장(5일장) has 16 (57.1%) missing valuesMissing
Unnamed: 1 has unique valuesUnique
Unnamed: 2 has unique valuesUnique

Reproduction

Analysis started2024-03-14 01:30:35.499455
Analysis finished2024-03-14 01:30:35.840649
Duration0.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct12
Distinct (%)100.0%
Missing16
Missing (%)57.1%
Memory size356.0 B
2024-03-14T10:30:35.924691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length8.1666667
Min length3

Characters and Unicode

Total characters98
Distinct characters30
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)100.0%

Sample

1st row시군명
2nd row10 시군
3rd row익 산 시 (1)
4th row정 읍 시 (1)
5th row남 원 시 (2)
ValueCountFrequency (%)
6
 
14.0%
4
 
9.3%
1 3
 
7.0%
2
 
4.7%
3 2
 
4.7%
2 2
 
4.7%
4 2
 
4.7%
2
 
4.7%
시군명 1
 
2.3%
1
 
2.3%
Other values (18) 18
41.9%
2024-03-14T10:30:36.182328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
31.6%
) 10
 
10.2%
( 10
 
10.2%
8
 
8.2%
6
 
6.1%
1 4
 
4.1%
2
 
2.0%
2
 
2.0%
3 2
 
2.0%
2 2
 
2.0%
Other values (20) 21
21.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35
35.7%
Space Separator 31
31.6%
Decimal Number 12
 
12.2%
Close Punctuation 10
 
10.2%
Open Punctuation 10
 
10.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
22.9%
6
17.1%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (11) 11
31.4%
Decimal Number
ValueCountFrequency (%)
1 4
33.3%
3 2
16.7%
2 2
16.7%
4 2
16.7%
0 1
 
8.3%
5 1
 
8.3%
Space Separator
ValueCountFrequency (%)
31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 63
64.3%
Hangul 35
35.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
22.9%
6
17.1%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (11) 11
31.4%
Common
ValueCountFrequency (%)
31
49.2%
) 10
 
15.9%
( 10
 
15.9%
1 4
 
6.3%
3 2
 
3.2%
2 2
 
3.2%
4 2
 
3.2%
0 1
 
1.6%
5 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 63
64.3%
Hangul 35
35.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31
49.2%
) 10
 
15.9%
( 10
 
15.9%
1 4
 
6.3%
3 2
 
3.2%
2 2
 
3.2%
4 2
 
3.2%
0 1
 
1.6%
5 1
 
1.6%
Hangul
ValueCountFrequency (%)
8
22.9%
6
17.1%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (11) 11
31.4%

Unnamed: 1
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-14T10:30:36.371576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.9642857
Min length3

Characters and Unicode

Total characters111
Distinct characters42
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row시장명
2nd row합 계
3rd row여산시장
4th row신태인시장
5th row인월시장
ValueCountFrequency (%)
시장명 1
 
3.4%
산서시장 1
 
3.4%
무장시장 1
 
3.4%
대산시장 1
 
3.4%
해리시장 1
 
3.4%
흥덕시장 1
 
3.4%
복흥시장 1
 
3.4%
동계시장 1
 
3.4%
순창시장 1
 
3.4%
강진시장 1
 
3.4%
Other values (19) 19
65.5%
2024-03-14T10:30:36.648075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
27.0%
27
24.3%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (32) 35
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110
99.1%
Space Separator 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
27.3%
27
24.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (31) 34
30.9%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110
99.1%
Common 1
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
27.3%
27
24.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (31) 34
30.9%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110
99.1%
ASCII 1
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
27.3%
27
24.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (31) 34
30.9%
ASCII
ValueCountFrequency (%)
1
100.0%

Unnamed: 2
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-03-14T10:30:36.854094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length11.071429
Min length5

Characters and Unicode

Total characters310
Distinct characters83
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row시장소재지
2nd row26개 시장
3rd row여산면 서촌2길 21
4th row신태인읍 시장2길 14
5th row인월면 인월로65-3
ValueCountFrequency (%)
2 2
 
2.6%
14 2
 
2.6%
3 2
 
2.6%
시장소재지 1
 
1.3%
사선1길 1
 
1.3%
순창읍 1
 
1.3%
14-12 1
 
1.3%
호국로 1
 
1.3%
강진면 1
 
1.3%
70-3 1
 
1.3%
Other values (65) 65
83.3%
2024-03-14T10:30:37.187370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
 
16.1%
1 19
 
6.1%
19
 
6.1%
17
 
5.5%
2 15
 
4.8%
3 11
 
3.5%
10
 
3.2%
9
 
2.9%
- 8
 
2.6%
8
 
2.6%
Other values (73) 144
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 177
57.1%
Decimal Number 75
24.2%
Space Separator 50
 
16.1%
Dash Punctuation 8
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
10.7%
17
 
9.6%
10
 
5.6%
9
 
5.1%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
4
 
2.3%
Other values (61) 85
48.0%
Decimal Number
ValueCountFrequency (%)
1 19
25.3%
2 15
20.0%
3 11
14.7%
4 7
 
9.3%
6 6
 
8.0%
8 5
 
6.7%
5 5
 
6.7%
0 4
 
5.3%
7 2
 
2.7%
9 1
 
1.3%
Space Separator
ValueCountFrequency (%)
50
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 177
57.1%
Common 133
42.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
10.7%
17
 
9.6%
10
 
5.6%
9
 
5.1%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
4
 
2.3%
Other values (61) 85
48.0%
Common
ValueCountFrequency (%)
50
37.6%
1 19
 
14.3%
2 15
 
11.3%
3 11
 
8.3%
- 8
 
6.0%
4 7
 
5.3%
6 6
 
4.5%
8 5
 
3.8%
5 5
 
3.8%
0 4
 
3.0%
Other values (2) 3
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 177
57.1%
ASCII 133
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50
37.6%
1 19
 
14.3%
2 15
 
11.3%
3 11
 
8.3%
- 8
 
6.0%
4 7
 
5.3%
6 6
 
4.5%
8 5
 
3.8%
5 5
 
3.8%
0 4
 
3.0%
Other values (2) 3
 
2.3%
Hangul
ValueCountFrequency (%)
19
 
10.7%
17
 
9.6%
10
 
5.6%
9
 
5.1%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
4
 
2.3%
Other values (61) 85
48.0%

Unnamed: 3
Categorical

Distinct7
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
1,6
3,8
5,10
2,7
4,9
Other values (2)

Length

Max length4
Median length3
Mean length3.2142857
Min length3

Unique

Unique2 ?
Unique (%)7.1%

Sample

1st row장 날
2nd row<NA>
3rd row1,6
4th row3,8
5th row3,8

Common Values

ValueCountFrequency (%)
1,6 7
25.0%
3,8 6
21.4%
5,10 5
17.9%
2,7 5
17.9%
4,9 3
10.7%
장 날 1
 
3.6%
<NA> 1
 
3.6%

Length

2024-03-14T10:30:37.299603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:30:37.401109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1,6 7
24.1%
3,8 6
20.7%
5,10 5
17.2%
2,7 5
17.2%
4,9 3
10.3%
1
 
3.4%
1
 
3.4%
na 1
 
3.4%

Correlations

2024-03-14T10:30:37.467371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도내 정기시장(5일장)Unnamed: 1Unnamed: 2Unnamed: 3
도내 정기시장(5일장)1.0001.0001.0001.000
Unnamed: 11.0001.0001.0001.000
Unnamed: 21.0001.0001.0001.000
Unnamed: 31.0001.0001.0001.000

Missing values

2024-03-14T10:30:35.727942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T10:30:35.808300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도내 정기시장(5일장)Unnamed: 1Unnamed: 2Unnamed: 3
0시군명시장명시장소재지장 날
110 시군합 계26개 시장<NA>
2익 산 시 (1)여산시장여산면 서촌2길 211,6
3정 읍 시 (1)신태인시장신태인읍 시장2길 143,8
4남 원 시 (2)인월시장인월면 인월로65-33,8
5<NA>운봉시장운봉읍 운성로 201,6
6김 제 시 (1)원평시장금산면 원평6길 3-104,9
7완 주 군 (2)봉동시장봉동읍 봉동동서로134-55,10
8<NA>운주시장운주면 운주로 134-181,6
9무 주 군 (4)무주시장무주읍 장터로 21,6
도내 정기시장(5일장)Unnamed: 1Unnamed: 2Unnamed: 3
18<NA>관촌시장관촌면 사선1길 70-35,10
19<NA>강진시장강진면 호국로 14-122,7
20순 창 군 (3)순창시장순창읍 남계로 581,6
21<NA>동계시장동계면 동계로 222,7
22<NA>복흥시장복흥면 정산2길 23,8
23고 창 군 (5)흥덕시장흥덕면 흥덕시장길 34,9
24<NA>해리시장해리면 남시길 144,9
25<NA>대산시장대산면 공음대산로 9352,7
26<NA>무장시장무장면 왕제산로 7255,10
27<NA>상하시장상하면 명동1길 31,6