Amino acid dipepetide frequency for Beihai tombus-like virus 19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.765AlaAla: 11.765 ± 3.825
0.471AlaCys: 0.471 ± 0.381
3.765AlaAsp: 3.765 ± 1.296
3.765AlaGlu: 3.765 ± 0.762
2.824AlaPhe: 2.824 ± 0.546
6.588AlaGly: 6.588 ± 1.228
1.882AlaHis: 1.882 ± 0.637
3.294AlaIle: 3.294 ± 0.841
3.294AlaLys: 3.294 ± 1.321
10.353AlaLeu: 10.353 ± 2.036
1.882AlaMet: 1.882 ± 1.061
3.765AlaAsn: 3.765 ± 0.451
7.059AlaPro: 7.059 ± 3.543
3.294AlaGln: 3.294 ± 1.626
10.824AlaArg: 10.824 ± 2.686
6.118AlaSer: 6.118 ± 1.894
4.235AlaThr: 4.235 ± 1.85
10.353AlaVal: 10.353 ± 1.253
1.412AlaTrp: 1.412 ± 0.491
1.412AlaTyr: 1.412 ± 1.143
0.0AlaXaa: 0.0 ± 0.0
Cys
0.471CysAla: 0.471 ± 0.62
0.471CysCys: 0.471 ± 0.381
1.412CysAsp: 1.412 ± 0.634
0.471CysGlu: 0.471 ± 0.303
0.941CysPhe: 0.941 ± 0.535
0.471CysGly: 0.471 ± 0.381
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.471CysLys: 0.471 ± 0.303
4.235CysLeu: 4.235 ± 2.365
0.941CysMet: 0.941 ± 0.319
0.0CysAsn: 0.0 ± 0.0
1.412CysPro: 1.412 ± 0.634
0.941CysGln: 0.941 ± 0.584
2.353CysArg: 2.353 ± 1.907
0.0CysSer: 0.0 ± 0.0
0.471CysThr: 0.471 ± 0.62
0.471CysVal: 0.471 ± 0.303
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.176AspAla: 5.176 ± 0.728
0.941AspCys: 0.941 ± 0.319
3.294AspAsp: 3.294 ± 0.925
3.294AspGlu: 3.294 ± 0.925
3.294AspPhe: 3.294 ± 0.482
3.294AspGly: 3.294 ± 0.71
2.353AspHis: 2.353 ± 0.434
2.353AspIle: 2.353 ± 1.034
2.353AspLys: 2.353 ± 0.434
6.588AspLeu: 6.588 ± 1.414
0.941AspMet: 0.941 ± 0.584
1.412AspAsn: 1.412 ± 0.491
3.294AspPro: 3.294 ± 1.627
2.353AspGln: 2.353 ± 0.896
3.765AspArg: 3.765 ± 1.008
2.824AspSer: 2.824 ± 1.751
1.412AspThr: 1.412 ± 0.326
5.647AspVal: 5.647 ± 0.529
2.824AspTrp: 2.824 ± 0.982
1.882AspTyr: 1.882 ± 1.07
0.0AspXaa: 0.0 ± 0.0
Glu
3.765GluAla: 3.765 ± 1.274
0.941GluCys: 0.941 ± 0.763
6.118GluAsp: 6.118 ± 1.667
2.353GluGlu: 2.353 ± 0.928
1.412GluPhe: 1.412 ± 0.767
3.294GluGly: 3.294 ± 1.072
2.824GluHis: 2.824 ± 1.753
2.353GluIle: 2.353 ± 0.177
2.353GluLys: 2.353 ± 0.177
5.647GluLeu: 5.647 ± 1.079
0.471GluMet: 0.471 ± 0.381
0.941GluAsn: 0.941 ± 0.319
2.353GluPro: 2.353 ± 0.77
0.471GluGln: 0.471 ± 0.303
2.353GluArg: 2.353 ± 0.928
1.412GluSer: 1.412 ± 0.326
1.412GluThr: 1.412 ± 0.326
2.824GluVal: 2.824 ± 0.956
0.0GluTrp: 0.0 ± 0.0
1.412GluTyr: 1.412 ± 0.609
0.0GluXaa: 0.0 ± 0.0
Phe
2.353PheAla: 2.353 ± 0.8
0.941PheCys: 0.941 ± 0.763
0.941PheAsp: 0.941 ± 0.763
0.941PheGlu: 0.941 ± 0.763
0.471PhePhe: 0.471 ± 0.62
0.471PheGly: 0.471 ± 0.303
1.412PheHis: 1.412 ± 0.326
0.471PheIle: 0.471 ± 0.62
0.471PheLys: 0.471 ± 0.381
3.765PheLeu: 3.765 ± 1.305
0.471PheMet: 0.471 ± 0.381
1.412PheAsn: 1.412 ± 1.861
1.412PhePro: 1.412 ± 0.491
0.941PheGln: 0.941 ± 0.319
2.353PheArg: 2.353 ± 0.77
4.706PheSer: 4.706 ± 0.354
3.294PheThr: 3.294 ± 1.321
0.941PheVal: 0.941 ± 0.535
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
10.824GlyAla: 10.824 ± 3.011
0.941GlyCys: 0.941 ± 0.606
4.235GlyAsp: 4.235 ± 0.7
3.294GlyGlu: 3.294 ± 0.456
0.941GlyPhe: 0.941 ± 0.584
7.529GlyGly: 7.529 ± 3.766
1.882GlyHis: 1.882 ± 0.637
2.353GlyIle: 2.353 ± 0.788
1.412GlyLys: 1.412 ± 1.118
4.706GlyLeu: 4.706 ± 1.898
1.882GlyMet: 1.882 ± 1.168
1.412GlyAsn: 1.412 ± 1.118
6.118GlyPro: 6.118 ± 0.618
3.294GlyGln: 3.294 ± 1.002
4.706GlyArg: 4.706 ± 0.918
5.176GlySer: 5.176 ± 1.137
6.588GlyThr: 6.588 ± 1.862
3.294GlyVal: 3.294 ± 1.627
1.412GlyTrp: 1.412 ± 1.118
2.353GlyTyr: 2.353 ± 0.8
0.0GlyXaa: 0.0 ± 0.0
His
3.765HisAla: 3.765 ± 1.274
0.941HisCys: 0.941 ± 0.584
0.471HisAsp: 0.471 ± 0.303
1.412HisGlu: 1.412 ± 0.491
0.0HisPhe: 0.0 ± 0.0
0.941HisGly: 0.941 ± 0.535
0.941HisHis: 0.941 ± 0.319
0.471HisIle: 0.471 ± 0.303
0.0HisLys: 0.0 ± 0.0
2.353HisLeu: 2.353 ± 0.896
0.471HisMet: 0.471 ± 0.303
0.941HisAsn: 0.941 ± 0.535
2.824HisPro: 2.824 ± 0.982
0.941HisGln: 0.941 ± 0.584
3.765HisArg: 3.765 ± 1.252
0.941HisSer: 0.941 ± 0.319
2.824HisThr: 2.824 ± 1.746
0.471HisVal: 0.471 ± 0.303
0.471HisTrp: 0.471 ± 0.62
0.471HisTyr: 0.471 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
2.824IleAla: 2.824 ± 0.982
0.471IleCys: 0.471 ± 0.381
1.882IleAsp: 1.882 ± 0.939
2.353IleGlu: 2.353 ± 1.37
0.0IlePhe: 0.0 ± 0.0
3.294IleGly: 3.294 ± 1.595
0.0IleHis: 0.0 ± 0.0
1.412IleIle: 1.412 ± 0.326
0.471IleLys: 0.471 ± 0.303
2.353IleLeu: 2.353 ± 0.788
0.941IleMet: 0.941 ± 0.763
0.471IleAsn: 0.471 ± 0.303
0.941IlePro: 0.941 ± 0.535
0.941IleGln: 0.941 ± 0.606
1.882IleArg: 1.882 ± 0.939
3.765IleSer: 3.765 ± 2.029
1.882IleThr: 1.882 ± 0.799
3.294IleVal: 3.294 ± 0.841
0.0IleTrp: 0.0 ± 0.0
0.941IleTyr: 0.941 ± 0.584
0.0IleXaa: 0.0 ± 0.0
Lys
3.765LysAla: 3.765 ± 0.451
0.471LysCys: 0.471 ± 0.62
0.941LysAsp: 0.941 ± 0.763
0.0LysGlu: 0.0 ± 0.0
0.471LysPhe: 0.471 ± 0.381
3.294LysGly: 3.294 ± 1.072
0.0LysHis: 0.0 ± 0.0
1.412LysIle: 1.412 ± 0.326
2.824LysLys: 2.824 ± 0.71
2.824LysLeu: 2.824 ± 0.546
0.471LysMet: 0.471 ± 0.381
0.0LysAsn: 0.0 ± 0.0
5.647LysPro: 5.647 ± 2.117
1.412LysGln: 1.412 ± 0.609
5.176LysArg: 5.176 ± 1.409
0.471LysSer: 0.471 ± 0.303
2.353LysThr: 2.353 ± 0.8
2.353LysVal: 2.353 ± 0.177
0.471LysTrp: 0.471 ± 0.303
1.412LysTyr: 1.412 ± 0.609
0.0LysXaa: 0.0 ± 0.0
Leu
11.765LeuAla: 11.765 ± 0.339
1.882LeuCys: 1.882 ± 0.637
5.647LeuAsp: 5.647 ± 0.336
5.647LeuGlu: 5.647 ± 0.529
0.471LeuPhe: 0.471 ± 0.381
7.059LeuGly: 7.059 ± 0.083
2.353LeuHis: 2.353 ± 0.434
3.294LeuIle: 3.294 ± 1.232
1.882LeuLys: 1.882 ± 0.236
9.882LeuLeu: 9.882 ± 2.861
1.412LeuMet: 1.412 ± 0.491
0.471LeuAsn: 0.471 ± 0.381
8.0LeuPro: 8.0 ± 3.359
3.294LeuGln: 3.294 ± 1.261
8.471LeuArg: 8.471 ± 3.018
9.412LeuSer: 9.412 ± 0.709
6.588LeuThr: 6.588 ± 1.412
7.059LeuVal: 7.059 ± 1.142
0.941LeuTrp: 0.941 ± 0.319
0.941LeuTyr: 0.941 ± 0.606
0.0LeuXaa: 0.0 ± 0.0
Met
2.353MetAla: 2.353 ± 1.557
0.0MetCys: 0.0 ± 0.0
0.471MetAsp: 0.471 ± 0.381
0.941MetGlu: 0.941 ± 0.319
0.941MetPhe: 0.941 ± 0.319
1.412MetGly: 1.412 ± 0.326
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.941MetLys: 0.941 ± 0.763
2.353MetLeu: 2.353 ± 1.557
0.471MetMet: 0.471 ± 0.303
0.941MetAsn: 0.941 ± 0.584
0.941MetPro: 0.941 ± 0.584
0.0MetGln: 0.0 ± 0.0
0.471MetArg: 0.471 ± 0.381
1.882MetSer: 1.882 ± 0.637
1.412MetThr: 1.412 ± 0.908
0.941MetVal: 0.941 ± 0.319
1.412MetTrp: 1.412 ± 0.609
0.471MetTyr: 0.471 ± 0.381
0.0MetXaa: 0.0 ± 0.0
Asn
0.941AsnAla: 0.941 ± 0.535
0.0AsnCys: 0.0 ± 0.0
1.882AsnAsp: 1.882 ± 0.236
0.471AsnGlu: 0.471 ± 0.303
0.941AsnPhe: 0.941 ± 0.535
2.824AsnGly: 2.824 ± 1.008
0.0AsnHis: 0.0 ± 0.0
0.941AsnIle: 0.941 ± 0.584
0.0AsnLys: 0.0 ± 0.0
3.294AsnLeu: 3.294 ± 1.232
0.471AsnMet: 0.471 ± 0.454
0.941AsnAsn: 0.941 ± 0.535
0.471AsnPro: 0.471 ± 0.303
1.882AsnGln: 1.882 ± 1.743
1.882AsnArg: 1.882 ± 0.637
0.941AsnSer: 0.941 ± 0.763
0.941AsnThr: 0.941 ± 0.584
0.0AsnVal: 0.0 ± 0.0
1.412AsnTrp: 1.412 ± 1.861
0.471AsnTyr: 0.471 ± 0.303
0.0AsnXaa: 0.0 ± 0.0
Pro
8.471ProAla: 8.471 ± 2.13
1.412ProCys: 1.412 ± 1.144
4.706ProAsp: 4.706 ± 0.994
1.882ProGlu: 1.882 ± 0.997
1.882ProPhe: 1.882 ± 0.637
5.176ProGly: 5.176 ± 0.685
2.353ProHis: 2.353 ± 0.788
0.941ProIle: 0.941 ± 0.319
5.647ProLys: 5.647 ± 0.336
3.765ProLeu: 3.765 ± 1.274
1.412ProMet: 1.412 ± 0.609
0.941ProAsn: 0.941 ± 0.535
6.118ProPro: 6.118 ± 1.219
1.412ProGln: 1.412 ± 0.491
7.059ProArg: 7.059 ± 1.956
5.647ProSer: 5.647 ± 0.492
6.118ProThr: 6.118 ± 2.026
6.588ProVal: 6.588 ± 1.669
0.941ProTrp: 0.941 ± 0.763
1.412ProTyr: 1.412 ± 0.634
0.0ProXaa: 0.0 ± 0.0
Gln
4.235GlnAla: 4.235 ± 1.717
0.941GlnCys: 0.941 ± 0.584
1.882GlnAsp: 1.882 ± 0.637
2.824GlnGlu: 2.824 ± 0.956
0.941GlnPhe: 0.941 ± 0.584
1.882GlnGly: 1.882 ± 1.07
0.471GlnHis: 0.471 ± 0.62
1.412GlnIle: 1.412 ± 0.767
0.941GlnLys: 0.941 ± 1.24
3.765GlnLeu: 3.765 ± 0.472
0.471GlnMet: 0.471 ± 0.303
1.412GlnAsn: 1.412 ± 1.118
2.824GlnPro: 2.824 ± 0.71
0.471GlnGln: 0.471 ± 0.62
3.294GlnArg: 3.294 ± 1.072
4.235GlnSer: 4.235 ± 1.228
1.882GlnThr: 1.882 ± 0.799
1.412GlnVal: 1.412 ± 1.118
0.941GlnTrp: 0.941 ± 0.535
0.941GlnTyr: 0.941 ± 0.319
0.0GlnXaa: 0.0 ± 0.0
Arg
8.941ArgAla: 8.941 ± 2.555
1.412ArgCys: 1.412 ± 0.767
5.647ArgAsp: 5.647 ± 0.854
3.765ArgGlu: 3.765 ± 1.89
4.235ArgPhe: 4.235 ± 0.169
7.529ArgGly: 7.529 ± 1.886
1.412ArgHis: 1.412 ± 0.491
2.353ArgIle: 2.353 ± 1.105
2.824ArgLys: 2.824 ± 0.546
8.471ArgLeu: 8.471 ± 2.232
1.412ArgMet: 1.412 ± 0.602
1.882ArgAsn: 1.882 ± 0.997
6.588ArgPro: 6.588 ± 0.913
2.353ArgGln: 2.353 ± 0.928
8.0ArgArg: 8.0 ± 2.64
6.588ArgSer: 6.588 ± 2.079
4.706ArgThr: 4.706 ± 1.516
5.647ArgVal: 5.647 ± 1.148
3.294ArgTrp: 3.294 ± 1.388
2.353ArgTyr: 2.353 ± 1.034
0.0ArgXaa: 0.0 ± 0.0
Ser
4.235SerAla: 4.235 ± 1.85
0.0SerCys: 0.0 ± 0.0
5.176SerAsp: 5.176 ± 1.075
1.882SerGlu: 1.882 ± 0.751
0.941SerPhe: 0.941 ± 0.535
10.824SerGly: 10.824 ± 3.011
0.941SerHis: 0.941 ± 0.319
1.882SerIle: 1.882 ± 0.637
2.824SerLys: 2.824 ± 0.653
3.294SerLeu: 3.294 ± 0.399
2.824SerMet: 2.824 ± 1.604
0.941SerAsn: 0.941 ± 0.319
5.176SerPro: 5.176 ± 0.717
4.235SerGln: 4.235 ± 1.003
10.353SerArg: 10.353 ± 1.552
2.824SerSer: 2.824 ± 2.285
3.765SerThr: 3.765 ± 1.305
6.588SerVal: 6.588 ± 1.054
1.882SerTrp: 1.882 ± 0.637
2.353SerTyr: 2.353 ± 0.928
0.0SerXaa: 0.0 ± 0.0
Thr
5.647ThrAla: 5.647 ± 0.831
0.941ThrCys: 0.941 ± 0.763
5.176ThrAsp: 5.176 ± 1.321
1.412ThrGlu: 1.412 ± 0.609
1.412ThrPhe: 1.412 ± 0.634
3.294ThrGly: 3.294 ± 0.482
1.412ThrHis: 1.412 ± 0.908
2.824ThrIle: 2.824 ± 0.653
2.353ThrLys: 2.353 ± 0.177
5.647ThrLeu: 5.647 ± 1.227
0.941ThrMet: 0.941 ± 1.24
0.471ThrAsn: 0.471 ± 0.62
4.235ThrPro: 4.235 ± 1.003
1.412ThrGln: 1.412 ± 0.491
5.647ThrArg: 5.647 ± 0.707
5.647ThrSer: 5.647 ± 0.492
7.059ThrThr: 7.059 ± 3.264
6.118ThrVal: 6.118 ± 1.709
0.941ThrTrp: 0.941 ± 0.535
0.941ThrTyr: 0.941 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
4.706ValAla: 4.706 ± 0.763
1.412ValCys: 1.412 ± 0.491
2.824ValAsp: 2.824 ± 0.546
4.706ValGlu: 4.706 ± 1.516
4.235ValPhe: 4.235 ± 1.237
2.824ValGly: 2.824 ± 0.546
1.882ValHis: 1.882 ± 0.467
0.941ValIle: 0.941 ± 0.319
3.765ValLys: 3.765 ± 0.472
8.941ValLeu: 8.941 ± 0.782
0.0ValMet: 0.0 ± 0.0
0.471ValAsn: 0.471 ± 0.381
7.059ValPro: 7.059 ± 1.948
4.235ValGln: 4.235 ± 0.629
4.235ValArg: 4.235 ± 1.058
8.471ValSer: 8.471 ± 2.42
3.765ValThr: 3.765 ± 1.298
5.176ValVal: 5.176 ± 0.717
1.882ValTrp: 1.882 ± 1.07
1.412ValTyr: 1.412 ± 0.609
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.606
0.471TrpCys: 0.471 ± 0.303
1.882TrpAsp: 1.882 ± 2.481
1.412TrpGlu: 1.412 ± 0.634
1.412TrpPhe: 1.412 ± 0.609
1.882TrpGly: 1.882 ± 0.939
0.471TrpHis: 0.471 ± 0.303
1.412TrpIle: 1.412 ± 1.861
0.0TrpLys: 0.0 ± 0.0
2.353TrpLeu: 2.353 ± 1.034
0.0TrpMet: 0.0 ± 0.0
0.941TrpAsn: 0.941 ± 0.319
0.0TrpPro: 0.0 ± 0.0
2.353TrpGln: 2.353 ± 1.105
0.941TrpArg: 0.941 ± 0.584
0.941TrpSer: 0.941 ± 0.606
1.412TrpThr: 1.412 ± 0.908
1.412TrpVal: 1.412 ± 0.634
0.471TrpTrp: 0.471 ± 0.62
0.471TrpTyr: 0.471 ± 0.303
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.412TyrAla: 1.412 ± 0.491
0.471TyrCys: 0.471 ± 0.303
0.941TyrAsp: 0.941 ± 0.606
1.882TyrGlu: 1.882 ± 0.799
0.0TyrPhe: 0.0 ± 0.0
0.471TyrGly: 0.471 ± 0.62
2.824TyrHis: 2.824 ± 0.982
0.0TyrIle: 0.0 ± 0.0
0.941TyrLys: 0.941 ± 0.319
1.882TyrLeu: 1.882 ± 0.236
0.0TyrMet: 0.0 ± 0.0
0.941TyrAsn: 0.941 ± 1.24
1.412TyrPro: 1.412 ± 0.634
0.941TyrGln: 0.941 ± 0.584
2.353TyrArg: 2.353 ± 0.788
0.941TyrSer: 0.941 ± 1.24
1.412TyrThr: 1.412 ± 0.609
2.353TyrVal: 2.353 ± 0.77
0.471TyrTrp: 0.471 ± 0.303
0.471TyrTyr: 0.471 ± 0.62
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2126 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski