Amino acid dipepetide frequency for Nanhai ghost shark arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.353AlaAla: 6.353 ± 2.207
1.412AlaCys: 1.412 ± 0.582
1.176AlaAsp: 1.176 ± 0.752
3.059AlaGlu: 3.059 ± 1.04
2.353AlaPhe: 2.353 ± 0.885
4.706AlaGly: 4.706 ± 1.202
1.882AlaHis: 1.882 ± 0.958
2.588AlaIle: 2.588 ± 1.064
2.353AlaLys: 2.353 ± 0.885
7.059AlaLeu: 7.059 ± 3.014
1.412AlaMet: 1.412 ± 0.718
1.412AlaAsn: 1.412 ± 1.87
4.941AlaPro: 4.941 ± 0.494
1.412AlaGln: 1.412 ± 0.56
2.353AlaArg: 2.353 ± 1.197
2.824AlaSer: 2.824 ± 1.11
4.471AlaThr: 4.471 ± 1.358
5.647AlaVal: 5.647 ± 2.191
1.647AlaTrp: 1.647 ± 1.441
2.824AlaTyr: 2.824 ± 0.932
0.0AlaXaa: 0.0 ± 0.0
Cys
0.706CysAla: 0.706 ± 0.359
0.471CysCys: 0.471 ± 0.493
0.471CysAsp: 0.471 ± 0.239
0.941CysGlu: 0.941 ± 0.85
0.471CysPhe: 0.471 ± 0.493
1.647CysGly: 1.647 ± 0.838
0.235CysHis: 0.235 ± 0.12
0.941CysIle: 0.941 ± 0.479
0.471CysLys: 0.471 ± 0.563
3.529CysLeu: 3.529 ± 1.53
0.235CysMet: 0.235 ± 0.12
0.941CysAsn: 0.941 ± 0.454
0.941CysPro: 0.941 ± 1.126
0.706CysGln: 0.706 ± 0.359
2.824CysArg: 2.824 ± 1.168
0.941CysSer: 0.941 ± 0.445
1.647CysThr: 1.647 ± 1.052
2.118CysVal: 2.118 ± 0.645
0.471CysTrp: 0.471 ± 0.239
0.471CysTyr: 0.471 ± 0.239
0.0CysXaa: 0.0 ± 0.0
Asp
1.882AspAla: 1.882 ± 0.737
1.412AspCys: 1.412 ± 0.718
2.118AspAsp: 2.118 ± 1.928
2.353AspGlu: 2.353 ± 1.197
0.471AspPhe: 0.471 ± 1.059
1.412AspGly: 1.412 ± 0.917
1.412AspHis: 1.412 ± 0.917
2.118AspIle: 2.118 ± 0.634
1.176AspLys: 1.176 ± 0.58
5.647AspLeu: 5.647 ± 3.016
0.941AspMet: 0.941 ± 0.445
1.412AspAsn: 1.412 ± 0.533
2.118AspPro: 2.118 ± 0.927
0.706AspGln: 0.706 ± 0.359
0.235AspArg: 0.235 ± 0.12
2.824AspSer: 2.824 ± 1.353
1.882AspThr: 1.882 ± 0.551
4.0AspVal: 4.0 ± 1.489
0.941AspTrp: 0.941 ± 0.445
1.647AspTyr: 1.647 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
2.353GluAla: 2.353 ± 0.934
1.176GluCys: 1.176 ± 0.426
1.882GluAsp: 1.882 ± 1.753
3.529GluGlu: 3.529 ± 0.802
1.882GluPhe: 1.882 ± 0.551
3.294GluGly: 3.294 ± 0.775
0.471GluHis: 0.471 ± 0.911
3.059GluIle: 3.059 ± 3.106
4.235GluLys: 4.235 ± 1.508
4.471GluLeu: 4.471 ± 1.668
1.412GluMet: 1.412 ± 0.583
0.706GluAsn: 0.706 ± 0.459
2.118GluPro: 2.118 ± 0.477
2.353GluGln: 2.353 ± 0.506
1.882GluArg: 1.882 ± 0.958
4.0GluSer: 4.0 ± 0.844
4.0GluThr: 4.0 ± 1.661
4.235GluVal: 4.235 ± 1.334
0.471GluTrp: 0.471 ± 0.239
0.941GluTyr: 0.941 ± 0.85
0.0GluXaa: 0.0 ± 0.0
Phe
2.353PheAla: 2.353 ± 0.713
0.471PheCys: 0.471 ± 0.239
1.882PheAsp: 1.882 ± 1.401
2.353PheGlu: 2.353 ± 0.463
1.412PhePhe: 1.412 ± 0.891
2.824PheGly: 2.824 ± 0.524
0.471PheHis: 0.471 ± 0.911
1.647PheIle: 1.647 ± 1.303
1.412PheLys: 1.412 ± 1.369
2.824PheLeu: 2.824 ± 0.833
0.706PheMet: 0.706 ± 0.359
0.941PheAsn: 0.941 ± 0.818
2.824PhePro: 2.824 ± 1.772
1.647PheGln: 1.647 ± 0.524
0.706PheArg: 0.706 ± 1.04
2.588PheSer: 2.588 ± 1.641
2.118PheThr: 2.118 ± 1.197
2.588PheVal: 2.588 ± 1.133
0.706PheTrp: 0.706 ± 0.493
1.647PheTyr: 1.647 ± 1.684
0.0PheXaa: 0.0 ± 0.0
Gly
4.0GlyAla: 4.0 ± 1.102
1.412GlyCys: 1.412 ± 0.662
4.0GlyAsp: 4.0 ± 1.472
3.765GlyGlu: 3.765 ± 1.603
2.588GlyPhe: 2.588 ± 0.805
5.412GlyGly: 5.412 ± 3.228
0.941GlyHis: 0.941 ± 0.479
4.235GlyIle: 4.235 ± 0.542
3.529GlyLys: 3.529 ± 1.19
9.176GlyLeu: 9.176 ± 1.521
2.118GlyMet: 2.118 ± 0.645
2.588GlyAsn: 2.588 ± 0.805
7.529GlyPro: 7.529 ± 1.101
1.882GlyGln: 1.882 ± 0.958
4.0GlyArg: 4.0 ± 3.813
3.765GlySer: 3.765 ± 1.114
4.706GlyThr: 4.706 ± 2.009
6.353GlyVal: 6.353 ± 1.939
0.941GlyTrp: 0.941 ± 0.623
2.588GlyTyr: 2.588 ± 2.72
0.0GlyXaa: 0.0 ± 0.0
His
1.412HisAla: 1.412 ± 0.718
1.412HisCys: 1.412 ± 0.582
0.706HisAsp: 0.706 ± 0.684
0.706HisGlu: 0.706 ± 0.359
0.471HisPhe: 0.471 ± 0.239
0.706HisGly: 0.706 ± 0.359
0.706HisHis: 0.706 ± 0.459
0.706HisIle: 0.706 ± 0.359
0.706HisLys: 0.706 ± 0.857
3.059HisLeu: 3.059 ± 0.703
0.706HisMet: 0.706 ± 1.04
1.882HisAsn: 1.882 ± 2.423
1.412HisPro: 1.412 ± 0.917
1.412HisGln: 1.412 ± 0.533
0.941HisArg: 0.941 ± 0.454
2.118HisSer: 2.118 ± 1.078
1.882HisThr: 1.882 ± 0.958
1.647HisVal: 1.647 ± 0.838
0.471HisTrp: 0.471 ± 0.239
0.706HisTyr: 0.706 ± 0.459
0.0HisXaa: 0.0 ± 0.0
Ile
3.059IleAla: 3.059 ± 1.556
1.176IleCys: 1.176 ± 0.426
1.412IleAsp: 1.412 ± 0.718
0.941IleGlu: 0.941 ± 0.479
1.176IlePhe: 1.176 ± 0.945
2.824IleGly: 2.824 ± 0.62
0.941IleHis: 0.941 ± 0.479
1.412IleIle: 1.412 ± 1.777
1.647IleLys: 1.647 ± 0.931
5.412IleLeu: 5.412 ± 2.297
1.176IleMet: 1.176 ± 0.599
1.412IleAsn: 1.412 ± 1.164
4.471IlePro: 4.471 ± 0.955
1.882IleGln: 1.882 ± 0.988
3.059IleArg: 3.059 ± 1.301
2.353IleSer: 2.353 ± 0.727
5.176IleThr: 5.176 ± 1.716
3.294IleVal: 3.294 ± 1.181
1.647IleTrp: 1.647 ± 1.441
0.941IleTyr: 0.941 ± 0.454
0.0IleXaa: 0.0 ± 0.0
Lys
1.882LysAla: 1.882 ± 0.948
0.235LysCys: 0.235 ± 0.12
1.412LysAsp: 1.412 ± 0.44
3.059LysGlu: 3.059 ± 0.59
1.176LysPhe: 1.176 ± 0.752
4.0LysGly: 4.0 ± 2.035
1.412LysHis: 1.412 ± 0.582
2.824LysIle: 2.824 ± 1.065
1.412LysLys: 1.412 ± 0.718
5.882LysLeu: 5.882 ± 1.362
0.235LysMet: 0.235 ± 0.12
1.412LysAsn: 1.412 ± 0.56
3.059LysPro: 3.059 ± 0.977
1.882LysGln: 1.882 ± 0.948
1.412LysArg: 1.412 ± 0.56
2.118LysSer: 2.118 ± 0.678
3.294LysThr: 3.294 ± 1.129
2.588LysVal: 2.588 ± 1.317
0.706LysTrp: 0.706 ± 0.857
0.941LysTyr: 0.941 ± 0.623
0.0LysXaa: 0.0 ± 0.0
Leu
8.235LeuAla: 8.235 ± 1.368
2.118LeuCys: 2.118 ± 0.634
4.471LeuAsp: 4.471 ± 1.884
6.118LeuGlu: 6.118 ± 1.986
4.0LeuPhe: 4.0 ± 0.572
11.765LeuGly: 11.765 ± 1.912
2.588LeuHis: 2.588 ± 1.317
3.529LeuIle: 3.529 ± 0.933
4.706LeuLys: 4.706 ± 1.289
15.294LeuLeu: 15.294 ± 1.991
3.529LeuMet: 3.529 ± 0.693
2.353LeuAsn: 2.353 ± 0.727
9.647LeuPro: 9.647 ± 2.607
3.529LeuGln: 3.529 ± 1.441
4.706LeuArg: 4.706 ± 1.837
8.706LeuSer: 8.706 ± 1.677
8.471LeuThr: 8.471 ± 1.282
12.471LeuVal: 12.471 ± 1.46
1.176LeuTrp: 1.176 ± 0.945
2.824LeuTyr: 2.824 ± 1.161
0.0LeuXaa: 0.0 ± 0.0
Met
2.353MetAla: 2.353 ± 0.506
0.706MetCys: 0.706 ± 1.04
0.941MetAsp: 0.941 ± 0.454
0.941MetGlu: 0.941 ± 0.445
0.941MetPhe: 0.941 ± 1.077
0.941MetGly: 0.941 ± 0.479
0.706MetHis: 0.706 ± 0.857
0.706MetIle: 0.706 ± 0.684
1.176MetLys: 1.176 ± 0.599
1.882MetLeu: 1.882 ± 0.69
0.941MetMet: 0.941 ± 0.427
0.0MetAsn: 0.0 ± 0.0
0.941MetPro: 0.941 ± 1.303
0.471MetGln: 0.471 ± 0.239
0.706MetArg: 0.706 ± 0.359
2.353MetSer: 2.353 ± 0.637
2.588MetThr: 2.588 ± 1.868
3.294MetVal: 3.294 ± 1.129
0.471MetTrp: 0.471 ± 0.239
1.176MetTyr: 1.176 ± 0.599
0.0MetXaa: 0.0 ± 0.0
Asn
1.176AsnAla: 1.176 ± 0.794
0.471AsnCys: 0.471 ± 1.104
0.706AsnAsp: 0.706 ± 1.296
2.588AsnGlu: 2.588 ± 1.387
0.941AsnPhe: 0.941 ± 0.818
1.412AsnGly: 1.412 ± 1.091
0.235AsnHis: 0.235 ± 0.12
0.941AsnIle: 0.941 ± 0.479
0.941AsnLys: 0.941 ± 0.445
3.059AsnLeu: 3.059 ± 2.179
0.706AsnMet: 0.706 ± 2.043
1.412AsnAsn: 1.412 ± 2.597
1.882AsnPro: 1.882 ± 0.49
1.412AsnGln: 1.412 ± 0.886
1.176AsnArg: 1.176 ± 0.794
3.765AsnSer: 3.765 ± 0.589
1.412AsnThr: 1.412 ± 0.718
2.118AsnVal: 2.118 ± 1.078
1.176AsnTrp: 1.176 ± 1.685
1.176AsnTyr: 1.176 ± 0.48
0.0AsnXaa: 0.0 ± 0.0
Pro
3.529ProAla: 3.529 ± 0.753
1.176ProCys: 1.176 ± 0.794
1.882ProAsp: 1.882 ± 0.983
4.706ProGlu: 4.706 ± 0.959
3.765ProPhe: 3.765 ± 1.807
3.765ProGly: 3.765 ± 1.629
1.647ProHis: 1.647 ± 0.838
3.765ProIle: 3.765 ± 1.546
3.765ProLys: 3.765 ± 1.108
7.765ProLeu: 7.765 ± 2.192
0.471ProMet: 0.471 ± 0.239
3.059ProAsn: 3.059 ± 1.742
5.176ProPro: 5.176 ± 4.266
4.0ProGln: 4.0 ± 0.726
3.765ProArg: 3.765 ± 0.832
3.765ProSer: 3.765 ± 2.777
4.706ProThr: 4.706 ± 0.636
6.118ProVal: 6.118 ± 1.849
1.176ProTrp: 1.176 ± 0.981
2.118ProTyr: 2.118 ± 0.784
0.0ProXaa: 0.0 ± 0.0
Gln
3.529GlnAla: 3.529 ± 0.856
0.235GlnCys: 0.235 ± 0.12
1.412GlnAsp: 1.412 ± 0.718
1.647GlnGlu: 1.647 ± 0.565
1.647GlnPhe: 1.647 ± 1.303
2.824GlnGly: 2.824 ± 1.097
1.176GlnHis: 1.176 ± 0.48
0.706GlnIle: 0.706 ± 0.359
0.706GlnLys: 0.706 ± 0.359
5.882GlnLeu: 5.882 ± 0.945
0.471GlnMet: 0.471 ± 0.239
0.706GlnAsn: 0.706 ± 0.359
1.647GlnPro: 1.647 ± 0.905
1.176GlnGln: 1.176 ± 2.814
3.059GlnArg: 3.059 ± 1.865
2.118GlnSer: 2.118 ± 0.891
3.059GlnThr: 3.059 ± 0.729
3.294GlnVal: 3.294 ± 1.06
1.412GlnTrp: 1.412 ± 0.718
0.235GlnTyr: 0.235 ± 0.12
0.0GlnXaa: 0.0 ± 0.0
Arg
2.588ArgAla: 2.588 ± 0.601
0.941ArgCys: 0.941 ± 0.445
1.412ArgAsp: 1.412 ± 0.533
2.588ArgGlu: 2.588 ± 0.48
2.118ArgPhe: 2.118 ± 0.645
4.471ArgGly: 4.471 ± 5.098
1.882ArgHis: 1.882 ± 0.519
2.588ArgIle: 2.588 ± 0.48
1.647ArgLys: 1.647 ± 0.605
4.706ArgLeu: 4.706 ± 1.083
1.412ArgMet: 1.412 ± 0.54
1.412ArgAsn: 1.412 ± 0.917
2.353ArgPro: 2.353 ± 1.354
1.412ArgGln: 1.412 ± 0.56
2.353ArgArg: 2.353 ± 2.426
3.529ArgSer: 3.529 ± 1.026
3.765ArgThr: 3.765 ± 1.078
4.706ArgVal: 4.706 ± 1.419
1.882ArgTrp: 1.882 ± 1.246
0.941ArgTyr: 0.941 ± 0.445
0.0ArgXaa: 0.0 ± 0.0
Ser
2.824SerAla: 2.824 ± 1.065
1.412SerCys: 1.412 ± 1.369
2.353SerAsp: 2.353 ± 0.506
3.294SerGlu: 3.294 ± 2.11
2.588SerPhe: 2.588 ± 1.673
6.588SerGly: 6.588 ± 1.44
2.118SerHis: 2.118 ± 0.483
2.824SerIle: 2.824 ± 1.119
2.353SerLys: 2.353 ± 1.197
11.765SerLeu: 11.765 ± 3.723
2.118SerMet: 2.118 ± 0.634
1.176SerAsn: 1.176 ± 2.323
4.706SerPro: 4.706 ± 1.945
2.824SerGln: 2.824 ± 0.524
4.235SerArg: 4.235 ± 1.8
6.588SerSer: 6.588 ± 2.233
4.941SerThr: 4.941 ± 2.573
4.0SerVal: 4.0 ± 1.661
1.882SerTrp: 1.882 ± 1.52
1.647SerTyr: 1.647 ± 0.838
0.0SerXaa: 0.0 ± 0.0
Thr
4.706ThrAla: 4.706 ± 2.009
1.882ThrCys: 1.882 ± 1.051
1.882ThrAsp: 1.882 ± 0.69
2.353ThrGlu: 2.353 ± 1.169
1.647ThrPhe: 1.647 ± 0.484
4.471ThrGly: 4.471 ± 1.178
2.588ThrHis: 2.588 ± 1.235
4.0ThrIle: 4.0 ± 1.153
3.529ThrLys: 3.529 ± 1.182
8.471ThrLeu: 8.471 ± 1.499
1.412ThrMet: 1.412 ± 1.369
2.824ThrAsn: 2.824 ± 1.362
7.059ThrPro: 7.059 ± 1.607
3.294ThrGln: 3.294 ± 1.078
3.765ThrArg: 3.765 ± 1.501
7.059ThrSer: 7.059 ± 1.658
6.353ThrThr: 6.353 ± 1.969
4.706ThrVal: 4.706 ± 2.319
2.118ThrTrp: 2.118 ± 1.078
1.647ThrTyr: 1.647 ± 0.583
0.0ThrXaa: 0.0 ± 0.0
Val
6.353ValAla: 6.353 ± 1.969
1.882ValCys: 1.882 ± 0.551
4.471ValAsp: 4.471 ± 1.619
1.647ValGlu: 1.647 ± 0.484
3.529ValPhe: 3.529 ± 1.739
8.941ValGly: 8.941 ± 1.3
1.647ValHis: 1.647 ± 0.605
4.941ValIle: 4.941 ± 1.306
2.824ValLys: 2.824 ± 1.868
8.706ValLeu: 8.706 ± 2.6
2.588ValMet: 2.588 ± 1.129
1.882ValAsn: 1.882 ± 0.958
4.0ValPro: 4.0 ± 2.035
3.529ValGln: 3.529 ± 1.032
5.176ValArg: 5.176 ± 0.93
5.647ValSer: 5.647 ± 1.745
6.824ValThr: 6.824 ± 0.858
7.765ValVal: 7.765 ± 1.651
0.706ValTrp: 0.706 ± 0.684
2.118ValTyr: 2.118 ± 0.785
0.0ValXaa: 0.0 ± 0.0
Trp
0.706TrpAla: 0.706 ± 0.684
0.471TrpCys: 0.471 ± 0.239
0.706TrpAsp: 0.706 ± 0.684
0.941TrpGlu: 0.941 ± 0.479
0.471TrpPhe: 0.471 ± 0.76
0.941TrpGly: 0.941 ± 0.623
0.0TrpHis: 0.0 ± 0.0
0.471TrpIle: 0.471 ± 0.239
1.412TrpLys: 1.412 ± 0.56
3.529TrpLeu: 3.529 ± 1.441
0.235TrpMet: 0.235 ± 0.12
0.706TrpAsn: 0.706 ± 0.684
0.706TrpPro: 0.706 ± 0.684
0.706TrpGln: 0.706 ± 0.493
1.176TrpArg: 1.176 ± 0.794
2.353TrpSer: 2.353 ± 0.701
2.118TrpThr: 2.118 ± 1.121
1.647TrpVal: 1.647 ± 1.684
0.0TrpTrp: 0.0 ± 0.0
1.176TrpTyr: 1.176 ± 2.323
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.882TyrAla: 1.882 ± 0.958
0.706TyrCys: 0.706 ± 0.359
1.412TyrAsp: 1.412 ± 0.44
0.941TyrGlu: 0.941 ± 0.445
0.471TyrPhe: 0.471 ± 0.493
2.588TyrGly: 2.588 ± 1.45
0.706TyrHis: 0.706 ± 0.359
1.412TyrIle: 1.412 ± 0.891
0.941TyrLys: 0.941 ± 0.445
2.118TyrLeu: 2.118 ± 1.276
1.176TyrMet: 1.176 ± 0.48
0.706TyrAsn: 0.706 ± 1.296
2.588TyrPro: 2.588 ± 1.004
0.706TyrGln: 0.706 ± 0.459
1.176TyrArg: 1.176 ± 0.599
2.824TyrSer: 2.824 ± 1.097
2.353TyrThr: 2.353 ± 2.009
2.588TyrVal: 2.588 ± 1.567
0.471TyrTrp: 0.471 ± 0.563
1.647TyrTyr: 1.647 ± 0.809
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (4251 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski