Overview

Dataset statistics

Number of variables6
Number of observations355
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.1 KiB
Average record size in memory49.4 B

Variable types

Numeric1
Categorical2
Text2
DateTime1

Dataset

Description경상북도 김천시 담배 소매인 지정 현황 목록으로 소매인 구분, 업소명, 업소도로명 주소, 지정일자 정보를 제공합니다.
Author경상북도 김천시
URLhttps://www.data.go.kr/data/15083398/fileData.do

Alerts

기준일자 has constant value ""Constant
연번 is highly overall correlated with 소매인구분High correlation
소매인구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:14:29.597564
Analysis finished2023-12-12 17:14:31.033393
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct355
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean178
Minimum1
Maximum355
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-13T02:14:31.111147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.7
Q189.5
median178
Q3266.5
95-th percentile337.3
Maximum355
Range354
Interquartile range (IQR)177

Descriptive statistics

Standard deviation102.62391
Coefficient of variation (CV)0.57653881
Kurtosis-1.2
Mean178
Median Absolute Deviation (MAD)89
Skewness0
Sum63190
Variance10531.667
MonotonicityStrictly increasing
2023-12-13T02:14:31.256872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
224 1
 
0.3%
244 1
 
0.3%
243 1
 
0.3%
242 1
 
0.3%
241 1
 
0.3%
240 1
 
0.3%
239 1
 
0.3%
238 1
 
0.3%
237 1
 
0.3%
Other values (345) 345
97.2%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
355 1
0.3%
354 1
0.3%
353 1
0.3%
352 1
0.3%
351 1
0.3%
350 1
0.3%
349 1
0.3%
348 1
0.3%
347 1
0.3%
346 1
0.3%

소매인구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
제7조의3제2항에따른경우
314 
제7조의3제3항에따른경우
41 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제7조의3제2항에따른경우
2nd row제7조의3제2항에따른경우
3rd row제7조의3제2항에따른경우
4th row제7조의3제2항에따른경우
5th row제7조의3제2항에따른경우

Common Values

ValueCountFrequency (%)
제7조의3제2항에따른경우 314
88.5%
제7조의3제3항에따른경우 41
 
11.5%

Length

2023-12-13T02:14:31.394343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:14:31.490121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제7조의3제2항에따른경우 314
88.5%
제7조의3제3항에따른경우 41
 
11.5%
Distinct352
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T02:14:31.753552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length8.6084507
Min length2

Characters and Unicode

Total characters3056
Distinct characters363
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique350 ?
Unique (%)98.6%

Sample

1st row새김천농업협동조합
2nd row형제슈퍼
3rd row지에스25아포중앙점
4th row봉이칼국수슈퍼
5th row대구슈퍼
ValueCountFrequency (%)
지에스(gs)25 21
 
3.9%
씨유 20
 
3.8%
세븐일레븐 15
 
2.8%
주식회사 9
 
1.7%
미니스톱 8
 
1.5%
이마트24 8
 
1.5%
김천 7
 
1.3%
전자담배 7
 
1.3%
cu 5
 
0.9%
gs25 5
 
0.9%
Other values (391) 428
80.3%
2023-12-13T02:14:32.152069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
178
 
5.8%
164
 
5.4%
153
 
5.0%
150
 
4.9%
65
 
2.1%
61
 
2.0%
58
 
1.9%
58
 
1.9%
2 52
 
1.7%
( 51
 
1.7%
Other values (353) 2066
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2526
82.7%
Space Separator 178
 
5.8%
Uppercase Letter 126
 
4.1%
Decimal Number 105
 
3.4%
Open Punctuation 51
 
1.7%
Close Punctuation 51
 
1.7%
Lowercase Letter 12
 
0.4%
Other Punctuation 6
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
 
6.5%
153
 
6.1%
150
 
5.9%
65
 
2.6%
61
 
2.4%
58
 
2.3%
58
 
2.3%
50
 
2.0%
39
 
1.5%
39
 
1.5%
Other values (316) 1689
66.9%
Uppercase Letter
ValueCountFrequency (%)
S 36
28.6%
G 34
27.0%
C 19
15.1%
U 12
 
9.5%
D 6
 
4.8%
H 4
 
3.2%
K 3
 
2.4%
W 2
 
1.6%
I 2
 
1.6%
R 1
 
0.8%
Other values (7) 7
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
25.0%
l 3
25.0%
h 1
 
8.3%
c 1
 
8.3%
f 1
 
8.3%
s 1
 
8.3%
w 1
 
8.3%
t 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 52
49.5%
5 37
35.2%
4 13
 
12.4%
1 2
 
1.9%
0 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 3
50.0%
& 2
33.3%
. 1
 
16.7%
Space Separator
ValueCountFrequency (%)
178
100.0%
Open Punctuation
ValueCountFrequency (%)
( 51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2526
82.7%
Common 392
 
12.8%
Latin 138
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
 
6.5%
153
 
6.1%
150
 
5.9%
65
 
2.6%
61
 
2.4%
58
 
2.3%
58
 
2.3%
50
 
2.0%
39
 
1.5%
39
 
1.5%
Other values (316) 1689
66.9%
Latin
ValueCountFrequency (%)
S 36
26.1%
G 34
24.6%
C 19
13.8%
U 12
 
8.7%
D 6
 
4.3%
H 4
 
2.9%
K 3
 
2.2%
e 3
 
2.2%
l 3
 
2.2%
W 2
 
1.4%
Other values (15) 16
11.6%
Common
ValueCountFrequency (%)
178
45.4%
2 52
 
13.3%
( 51
 
13.0%
) 51
 
13.0%
5 37
 
9.4%
4 13
 
3.3%
, 3
 
0.8%
& 2
 
0.5%
1 2
 
0.5%
0 1
 
0.3%
Other values (2) 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2526
82.7%
ASCII 530
 
17.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
178
33.6%
2 52
 
9.8%
( 51
 
9.6%
) 51
 
9.6%
5 37
 
7.0%
S 36
 
6.8%
G 34
 
6.4%
C 19
 
3.6%
4 13
 
2.5%
U 12
 
2.3%
Other values (27) 47
 
8.9%
Hangul
ValueCountFrequency (%)
164
 
6.5%
153
 
6.1%
150
 
5.9%
65
 
2.6%
61
 
2.4%
58
 
2.3%
58
 
2.3%
50
 
2.0%
39
 
1.5%
39
 
1.5%
Other values (316) 1689
66.9%
Distinct353
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T02:14:32.559118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length53
Mean length25.642254
Min length16

Characters and Unicode

Total characters9103
Distinct characters248
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique351 ?
Unique (%)98.9%

Sample

1st row경상북도 김천시 어모면 어모문화3길 2
2nd row경상북도 김천시 자산로 180 (성내동)
3rd row경상북도 김천시 아포읍 한지1길 71
4th row경상북도 김천시 배나무골길 22 (양천동)
5th row경상북도 김천시 대항면 황악로 1441-1
ValueCountFrequency (%)
경상북도 355
 
17.9%
김천시 355
 
17.9%
율곡동 51
 
2.6%
1층 44
 
2.2%
부곡동 29
 
1.5%
신음동 24
 
1.2%
평화동 22
 
1.1%
영남대로 20
 
1.0%
덕곡동 19
 
1.0%
남면 18
 
0.9%
Other values (532) 1041
52.6%
2023-12-13T02:14:33.078141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1663
18.3%
408
 
4.5%
1 404
 
4.4%
394
 
4.3%
388
 
4.3%
376
 
4.1%
366
 
4.0%
362
 
4.0%
358
 
3.9%
284
 
3.1%
Other values (238) 4100
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5368
59.0%
Space Separator 1663
 
18.3%
Decimal Number 1367
 
15.0%
Close Punctuation 245
 
2.7%
Open Punctuation 245
 
2.7%
Other Punctuation 120
 
1.3%
Dash Punctuation 65
 
0.7%
Uppercase Letter 18
 
0.2%
Math Symbol 11
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
408
 
7.6%
394
 
7.3%
388
 
7.2%
376
 
7.0%
366
 
6.8%
362
 
6.7%
358
 
6.7%
284
 
5.3%
200
 
3.7%
143
 
2.7%
Other values (208) 2089
38.9%
Uppercase Letter
ValueCountFrequency (%)
L 4
22.2%
H 2
11.1%
A 2
11.1%
W 2
11.1%
S 1
 
5.6%
K 1
 
5.6%
X 1
 
5.6%
T 1
 
5.6%
G 1
 
5.6%
V 1
 
5.6%
Other values (2) 2
11.1%
Decimal Number
ValueCountFrequency (%)
1 404
29.6%
2 150
 
11.0%
0 148
 
10.8%
4 142
 
10.4%
3 134
 
9.8%
5 101
 
7.4%
7 84
 
6.1%
6 75
 
5.5%
8 68
 
5.0%
9 61
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 119
99.2%
/ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
1663
100.0%
Close Punctuation
ValueCountFrequency (%)
) 245
100.0%
Open Punctuation
ValueCountFrequency (%)
( 245
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5368
59.0%
Common 3716
40.8%
Latin 19
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
408
 
7.6%
394
 
7.3%
388
 
7.2%
376
 
7.0%
366
 
6.8%
362
 
6.7%
358
 
6.7%
284
 
5.3%
200
 
3.7%
143
 
2.7%
Other values (208) 2089
38.9%
Common
ValueCountFrequency (%)
1663
44.8%
1 404
 
10.9%
) 245
 
6.6%
( 245
 
6.6%
2 150
 
4.0%
0 148
 
4.0%
4 142
 
3.8%
3 134
 
3.6%
, 119
 
3.2%
5 101
 
2.7%
Other values (7) 365
 
9.8%
Latin
ValueCountFrequency (%)
L 4
21.1%
H 2
10.5%
A 2
10.5%
W 2
10.5%
S 1
 
5.3%
K 1
 
5.3%
X 1
 
5.3%
T 1
 
5.3%
1
 
5.3%
G 1
 
5.3%
Other values (3) 3
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5368
59.0%
ASCII 3734
41.0%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1663
44.5%
1 404
 
10.8%
) 245
 
6.6%
( 245
 
6.6%
2 150
 
4.0%
0 148
 
4.0%
4 142
 
3.8%
3 134
 
3.6%
, 119
 
3.2%
5 101
 
2.7%
Other values (19) 383
 
10.3%
Hangul
ValueCountFrequency (%)
408
 
7.6%
394
 
7.3%
388
 
7.2%
376
 
7.0%
366
 
6.8%
362
 
6.7%
358
 
6.7%
284
 
5.3%
200
 
3.7%
143
 
2.7%
Other values (208) 2089
38.9%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct320
Distinct (%)90.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum1998-03-17 00:00:00
Maximum2022-03-17 00:00:00
2023-12-13T02:14:33.239199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:14:33.377409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2022-03-28
355 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-03-28
2nd row2022-03-28
3rd row2022-03-28
4th row2022-03-28
5th row2022-03-28

Common Values

ValueCountFrequency (%)
2022-03-28 355
100.0%

Length

2023-12-13T02:14:33.546172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:14:33.657165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-03-28 355
100.0%

Interactions

2023-12-13T02:14:30.776881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:14:33.724377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소매인구분
연번1.0000.995
소매인구분0.9951.000
2023-12-13T02:14:33.812243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소매인구분
연번1.0000.927
소매인구분0.9271.000

Missing values

2023-12-13T02:14:30.890775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:14:30.988451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번소매인구분업소명업소도로명주소지정일자기준일자
01제7조의3제2항에따른경우새김천농업협동조합경상북도 김천시 어모면 어모문화3길 21998-03-172022-03-28
12제7조의3제2항에따른경우형제슈퍼경상북도 김천시 자산로 180 (성내동)2002-05-182022-03-28
23제7조의3제2항에따른경우지에스25아포중앙점경상북도 김천시 아포읍 한지1길 712006-07-262022-03-28
34제7조의3제2항에따른경우봉이칼국수슈퍼경상북도 김천시 배나무골길 22 (양천동)2006-11-302022-03-28
45제7조의3제2항에따른경우대구슈퍼경상북도 김천시 대항면 황악로 1441-12009-11-062022-03-28
56제7조의3제2항에따른경우희망대반점경상북도 김천시 구성면 남김천대로 2397-102009-11-062022-03-28
67제7조의3제2항에따른경우감문슈퍼경상북도 김천시 감문면 배시내길 40-22009-11-192022-03-28
78제7조의3제2항에따른경우선경낚시경상북도 김천시 자산로 47-1 (모암동)2009-12-082022-03-28
89제7조의3제2항에따른경우웰텍(welltech)경상북도 김천시 아랫장터5길 16 (용두동)2009-12-112022-03-28
910제7조의3제2항에따른경우교동슈퍼경상북도 김천시 교동 8112009-12-282022-03-28
연번소매인구분업소명업소도로명주소지정일자기준일자
345346제7조의3제3항에따른경우미니스톱 김천대학점경상북도 김천시 거문들1길 10-39, 1층 (삼락동)2018-02-272022-03-28
346347제7조의3제3항에따른경우24시 행복마트경상북도 김천시 혁신1로 50, 율곡스퀘어 KTX 1층 110~113호 (율곡동)2018-04-172022-03-28
347348제7조의3제3항에따른경우지에스25 김천지례점경상북도 김천시 지례면 장터길 992018-08-142022-03-28
348349제7조의3제3항에따른경우휘진마트경상북도 김천시 김천로 111, 스토리웨이 (평화동)2018-09-182022-03-28
349350제7조의3제3항에따른경우대박할인마트(평화점)경상북도 김천시 평화장미길 40 (평화동)2019-11-142022-03-28
350351제7조의3제3항에따른경우김천농협경제서부간이지점경상북도 김천시 영남대로 1315 (백옥동)2020-04-292022-03-28
351352제7조의3제3항에따른경우지에스(GS)25 김천으뜸점경상북도 김천시 시청3길 16 (신음동)2021-05-242022-03-28
352353제7조의3제3항에따른경우세븐일레븐 김천성남교점경상북도 김천시 자산로 118 (성내동)2021-07-092022-03-28
353354제7조의3제3항에따른경우씨유 김천팔달점경상북도 김천시 황산로 111 (지좌동)2021-07-232022-03-28
354355제7조의3제3항에따른경우21세기할인마트경상북도 김천시 체육공원길 17 (지좌동)2021-12-202022-03-28