Overview

Dataset statistics

Number of variables4
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory964.0 B
Average record size in memory37.1 B

Variable types

Categorical2
Text2

Alerts

시장명 has unique valuesUnique
시장소재지 has unique valuesUnique

Reproduction

Analysis started2024-03-14 01:30:32.770792
Analysis finished2024-03-14 01:30:33.012713
Duration0.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct10
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size340.0 B
고 창 군
무 주 군
임 실 군
장 수 군
순 창 군
Other values (5)

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique3 ?
Unique (%)11.5%

Sample

1st row익 산 시
2nd row정 읍 시
3rd row남 원 시
4th row남 원 시
5th row김 제 시

Common Values

ValueCountFrequency (%)
고 창 군 5
19.2%
무 주 군 4
15.4%
임 실 군 4
15.4%
장 수 군 3
11.5%
순 창 군 3
11.5%
남 원 시 2
 
7.7%
완 주 군 2
 
7.7%
익 산 시 1
 
3.8%
정 읍 시 1
 
3.8%
김 제 시 1
 
3.8%

Length

2024-03-14T10:30:33.061243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:30:33.169870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
21
26.9%
8
 
10.3%
6
 
7.7%
5
 
6.4%
5
 
6.4%
4
 
5.1%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
Other values (10) 15
19.2%

시장명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-03-14T10:30:33.345160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.0384615
Min length4

Characters and Unicode

Total characters105
Distinct characters39
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row여산시장
2nd row신태인시장
3rd row인월시장
4th row운봉시장
5th row원평시장
ValueCountFrequency (%)
여산시장 1
 
3.8%
신태인시장 1
 
3.8%
무장시장 1
 
3.8%
대산시장 1
 
3.8%
해리시장 1
 
3.8%
흥덕시장 1
 
3.8%
복흥시장 1
 
3.8%
동계시장 1
 
3.8%
순창시장 1
 
3.8%
강진시장 1
 
3.8%
Other values (16) 16
61.5%
2024-03-14T10:30:33.964563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
27.6%
26
24.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (29) 32
30.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 105
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
27.6%
26
24.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (29) 32
30.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 105
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
27.6%
26
24.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (29) 32
30.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 105
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
27.6%
26
24.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (29) 32
30.5%

시장소재지
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-03-14T10:30:34.179062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13.5
Mean length11.5
Min length9

Characters and Unicode

Total characters299
Distinct characters79
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row여산면 서촌2길 21
2nd row신태인읍 시장2길 14
3rd row인월면 인월로65-3
4th row운봉읍 운성로 20
5th row금산면 원평6길 3-10
ValueCountFrequency (%)
2 2
 
2.7%
14 2
 
2.7%
3 2
 
2.7%
사선1길 1
 
1.3%
남계로 1
 
1.3%
순창읍 1
 
1.3%
14-12 1
 
1.3%
호국로 1
 
1.3%
강진면 1
 
1.3%
70-3 1
 
1.3%
Other values (62) 62
82.7%
2024-03-14T10:30:34.487765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
16.4%
19
 
6.4%
1 19
 
6.4%
17
 
5.7%
2 14
 
4.7%
3 11
 
3.7%
9
 
3.0%
8
 
2.7%
- 8
 
2.7%
8
 
2.7%
Other values (69) 137
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 169
56.5%
Decimal Number 73
24.4%
Space Separator 49
 
16.4%
Dash Punctuation 8
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
11.2%
17
 
10.1%
9
 
5.3%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
Other values (57) 81
47.9%
Decimal Number
ValueCountFrequency (%)
1 19
26.0%
2 14
19.2%
3 11
15.1%
4 7
 
9.6%
5 5
 
6.8%
6 5
 
6.8%
8 5
 
6.8%
0 4
 
5.5%
7 2
 
2.7%
9 1
 
1.4%
Space Separator
ValueCountFrequency (%)
49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 169
56.5%
Common 130
43.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
11.2%
17
 
10.1%
9
 
5.3%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
Other values (57) 81
47.9%
Common
ValueCountFrequency (%)
49
37.7%
1 19
 
14.6%
2 14
 
10.8%
3 11
 
8.5%
- 8
 
6.2%
4 7
 
5.4%
5 5
 
3.8%
6 5
 
3.8%
8 5
 
3.8%
0 4
 
3.1%
Other values (2) 3
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 169
56.5%
ASCII 130
43.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49
37.7%
1 19
 
14.6%
2 14
 
10.8%
3 11
 
8.5%
- 8
 
6.2%
4 7
 
5.4%
5 5
 
3.8%
6 5
 
3.8%
8 5
 
3.8%
0 4
 
3.1%
Other values (2) 3
 
2.3%
Hangul
ValueCountFrequency (%)
19
 
11.2%
17
 
10.1%
9
 
5.3%
8
 
4.7%
8
 
4.7%
7
 
4.1%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
Other values (57) 81
47.9%

장 날
Categorical

Distinct5
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
1,6
3,8
5,10
2,7
4,9

Length

Max length4
Median length3
Mean length3.1923077
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1,6
2nd row3,8
3rd row3,8
4th row1,6
5th row4,9

Common Values

ValueCountFrequency (%)
1,6 7
26.9%
3,8 6
23.1%
5,10 5
19.2%
2,7 5
19.2%
4,9 3
11.5%

Length

2024-03-14T10:30:34.602973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:30:34.690554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1,6 7
26.9%
3,8 6
23.1%
5,10 5
19.2%
2,7 5
19.2%
4,9 3
11.5%

Correlations

2024-03-14T10:30:34.752454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명시장명시장소재지장 날
시군명1.0001.0001.0000.000
시장명1.0001.0001.0001.000
시장소재지1.0001.0001.0001.000
장 날0.0001.0001.0001.000
2024-03-14T10:30:34.832875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명장 날
시군명1.0000.000
장 날0.0001.000
2024-03-14T10:30:34.933833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명장 날
시군명1.0000.000
장 날0.0001.000

Missing values

2024-03-14T10:30:32.926403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T10:30:32.987540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명시장명시장소재지장 날
0익 산 시여산시장여산면 서촌2길 211,6
1정 읍 시신태인시장신태인읍 시장2길 143,8
2남 원 시인월시장인월면 인월로65-33,8
3남 원 시운봉시장운봉읍 운성로 201,6
4김 제 시원평시장금산면 원평6길 3-104,9
5완 주 군봉동시장봉동읍 봉동동서로134-55,10
6완 주 군운주시장운주면 운주로 134-181,6
7무 주 군무주시장무주읍 장터로 21,6
8무 주 군무풍시장무풍면 현내로 2133,8
9무 주 군설천시장설천면 삼도봉로 112,7
시군명시장명시장소재지장 날
16임 실 군관촌시장관촌면 사선1길 70-35,10
17임 실 군강진시장강진면 호국로 14-122,7
18순 창 군순창시장순창읍 남계로 581,6
19순 창 군동계시장동계면 동계로 222,7
20순 창 군복흥시장복흥면 정산2길 23,8
21고 창 군흥덕시장흥덕면 흥덕시장길 34,9
22고 창 군해리시장해리면 남시길 144,9
23고 창 군대산시장대산면 공음대산로 9352,7
24고 창 군무장시장무장면 왕제산로 7255,10
25고 창 군상하시장상하면 명동1길 31,6