Amino acid dipepetide frequency for Huangpi Tick Virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.743AlaAla: 2.743 ± 1.405
1.92AlaCys: 1.92 ± 0.937
3.018AlaAsp: 3.018 ± 0.9
1.646AlaGlu: 1.646 ± 0.426
0.823AlaPhe: 0.823 ± 0.231
3.567AlaGly: 3.567 ± 0.687
0.823AlaHis: 0.823 ± 0.529
3.018AlaIle: 3.018 ± 1.588
3.841AlaLys: 3.841 ± 1.121
4.938AlaLeu: 4.938 ± 0.931
1.372AlaMet: 1.372 ± 0.433
1.097AlaAsn: 1.097 ± 0.391
2.743AlaPro: 2.743 ± 1.893
0.823AlaGln: 0.823 ± 0.865
3.841AlaArg: 3.841 ± 0.63
4.115AlaSer: 4.115 ± 1.695
4.115AlaThr: 4.115 ± 0.329
3.841AlaVal: 3.841 ± 1.102
0.274AlaTrp: 0.274 ± 0.16
0.823AlaTyr: 0.823 ± 0.865
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.549CysCys: 0.549 ± 0.203
0.549CysAsp: 0.549 ± 0.567
1.097CysGlu: 1.097 ± 0.603
2.743CysPhe: 2.743 ± 1.674
1.646CysGly: 1.646 ± 0.608
0.274CysHis: 0.274 ± 0.283
2.743CysIle: 2.743 ± 0.96
1.372CysLys: 1.372 ± 1.025
1.92CysLeu: 1.92 ± 0.556
0.549CysMet: 0.549 ± 0.203
1.097CysAsn: 1.097 ± 0.405
1.372CysPro: 1.372 ± 1.025
1.646CysGln: 1.646 ± 1.074
1.92CysArg: 1.92 ± 0.283
4.938CysSer: 4.938 ± 1.989
1.097CysThr: 1.097 ± 0.97
0.823CysVal: 0.823 ± 0.231
0.549CysTrp: 0.549 ± 0.465
1.372CysTyr: 1.372 ± 1.039
0.0CysXaa: 0.0 ± 0.0
Asp
3.841AspAla: 3.841 ± 1.192
2.743AspCys: 2.743 ± 1.352
3.567AspAsp: 3.567 ± 1.731
4.115AspGlu: 4.115 ± 1.688
3.018AspPhe: 3.018 ± 0.826
1.372AspGly: 1.372 ± 0.403
2.195AspHis: 2.195 ± 0.564
1.097AspIle: 1.097 ± 0.391
2.195AspLys: 2.195 ± 0.895
6.31AspLeu: 6.31 ± 1.246
1.372AspMet: 1.372 ± 0.467
1.097AspAsn: 1.097 ± 0.391
2.743AspPro: 2.743 ± 0.753
1.92AspGln: 1.92 ± 0.283
1.92AspArg: 1.92 ± 0.56
3.567AspSer: 3.567 ± 1.053
2.469AspThr: 2.469 ± 0.656
3.567AspVal: 3.567 ± 0.674
1.097AspTrp: 1.097 ± 0.816
2.743AspTyr: 2.743 ± 0.848
0.274AspXaa: 0.274 ± 0.283
Glu
4.115GluAla: 4.115 ± 1.239
1.372GluCys: 1.372 ± 1.417
3.018GluAsp: 3.018 ± 1.413
4.938GluGlu: 4.938 ± 2.194
3.018GluPhe: 3.018 ± 0.807
2.195GluGly: 2.195 ± 0.683
0.549GluHis: 0.549 ± 0.32
4.938GluIle: 4.938 ± 0.418
3.841GluLys: 3.841 ± 0.61
4.115GluLeu: 4.115 ± 1.39
1.372GluMet: 1.372 ± 0.403
2.469GluAsn: 2.469 ± 0.991
1.097GluPro: 1.097 ± 0.577
1.646GluGln: 1.646 ± 0.806
4.664GluArg: 4.664 ± 2.28
5.761GluSer: 5.761 ± 0.493
6.31GluThr: 6.31 ± 1.817
3.567GluVal: 3.567 ± 1.192
1.372GluTrp: 1.372 ± 0.481
1.646GluTyr: 1.646 ± 0.961
0.0GluXaa: 0.0 ± 0.0
Phe
2.195PheAla: 2.195 ± 0.564
1.646PheCys: 1.646 ± 1.074
2.195PheAsp: 2.195 ± 0.335
3.018PheGlu: 3.018 ± 0.429
3.841PhePhe: 3.841 ± 1.08
2.743PheGly: 2.743 ± 1.223
1.646PheHis: 1.646 ± 0.63
2.743PheIle: 2.743 ± 0.371
3.018PheLys: 3.018 ± 1.413
7.682PheLeu: 7.682 ± 1.046
1.097PheMet: 1.097 ± 0.641
1.92PheAsn: 1.92 ± 0.595
1.646PhePro: 1.646 ± 0.961
2.195PheGln: 2.195 ± 0.448
3.841PheArg: 3.841 ± 1.186
5.213PheSer: 5.213 ± 0.938
2.743PheThr: 2.743 ± 0.962
1.646PheVal: 1.646 ± 0.426
0.274PheTrp: 0.274 ± 0.16
1.646PheTyr: 1.646 ± 0.461
0.0PheXaa: 0.0 ± 0.0
Gly
1.92GlyAla: 1.92 ± 1.605
1.646GlyCys: 1.646 ± 0.608
0.823GlyAsp: 0.823 ± 0.466
3.018GlyGlu: 3.018 ± 1.322
4.664GlyPhe: 4.664 ± 0.814
3.018GlyGly: 3.018 ± 0.652
1.097GlyHis: 1.097 ± 0.405
4.938GlyIle: 4.938 ± 0.427
3.018GlyLys: 3.018 ± 1.345
5.213GlyLeu: 5.213 ± 0.89
3.292GlyMet: 3.292 ± 1.131
2.469GlyAsn: 2.469 ± 0.409
1.92GlyPro: 1.92 ± 0.56
1.097GlyGln: 1.097 ± 0.341
3.567GlyArg: 3.567 ± 0.567
3.018GlySer: 3.018 ± 1.073
2.469GlyThr: 2.469 ± 0.818
3.292GlyVal: 3.292 ± 1.572
1.372GlyTrp: 1.372 ± 0.467
1.097GlyTyr: 1.097 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
0.549HisAla: 0.549 ± 0.203
1.372HisCys: 1.372 ± 0.403
0.823HisAsp: 0.823 ± 0.48
1.097HisGlu: 1.097 ± 0.341
1.646HisPhe: 1.646 ± 0.461
1.372HisGly: 1.372 ± 0.662
2.195HisHis: 2.195 ± 0.625
1.372HisIle: 1.372 ± 0.376
2.195HisLys: 2.195 ± 1.029
1.372HisLeu: 1.372 ± 0.467
0.823HisMet: 0.823 ± 0.464
0.549HisAsn: 0.549 ± 0.203
1.646HisPro: 1.646 ± 0.347
1.097HisGln: 1.097 ± 0.341
1.372HisArg: 1.372 ± 0.801
2.743HisSer: 2.743 ± 0.354
0.823HisThr: 0.823 ± 0.529
3.018HisVal: 3.018 ± 0.851
0.274HisTrp: 0.274 ± 0.16
0.274HisTyr: 0.274 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
3.292IleAla: 3.292 ± 0.695
2.743IleCys: 2.743 ± 0.786
4.115IleAsp: 4.115 ± 0.739
2.195IleGlu: 2.195 ± 0.564
1.646IlePhe: 1.646 ± 0.352
4.115IleGly: 4.115 ± 1.151
2.195IleHis: 2.195 ± 0.81
4.39IleIle: 4.39 ± 0.946
2.469IleLys: 2.469 ± 0.694
7.407IleLeu: 7.407 ± 0.665
2.195IleMet: 2.195 ± 0.683
3.018IleAsn: 3.018 ± 1.065
2.195IlePro: 2.195 ± 0.81
3.018IleGln: 3.018 ± 0.256
1.646IleArg: 1.646 ± 0.793
6.31IleSer: 6.31 ± 2.197
2.743IleThr: 2.743 ± 0.707
3.018IleVal: 3.018 ± 1.835
1.372IleTrp: 1.372 ± 1.217
2.195IleTyr: 2.195 ± 1.573
0.0IleXaa: 0.0 ± 0.0
Lys
4.664LysAla: 4.664 ± 0.964
1.097LysCys: 1.097 ± 0.973
4.115LysAsp: 4.115 ± 2.552
4.39LysGlu: 4.39 ± 0.594
1.372LysPhe: 1.372 ± 0.315
2.469LysGly: 2.469 ± 1.058
1.097LysHis: 1.097 ± 0.641
4.115LysIle: 4.115 ± 1.961
4.115LysLys: 4.115 ± 0.938
5.213LysLeu: 5.213 ± 0.996
3.567LysMet: 3.567 ± 0.885
2.195LysAsn: 2.195 ± 0.875
1.646LysPro: 1.646 ± 0.426
1.646LysGln: 1.646 ± 0.461
3.292LysArg: 3.292 ± 0.79
4.664LysSer: 4.664 ± 0.813
3.567LysThr: 3.567 ± 1.026
3.567LysVal: 3.567 ± 0.687
1.372LysTrp: 1.372 ± 0.467
1.372LysTyr: 1.372 ± 0.403
0.0LysXaa: 0.0 ± 0.0
Leu
4.115LeuAla: 4.115 ± 1.254
1.372LeuCys: 1.372 ± 0.898
3.841LeuAsp: 3.841 ± 0.815
6.859LeuGlu: 6.859 ± 1.329
5.213LeuPhe: 5.213 ± 0.897
7.682LeuGly: 7.682 ± 0.728
3.018LeuHis: 3.018 ± 1.11
6.584LeuIle: 6.584 ± 2.143
8.505LeuLys: 8.505 ± 1.127
8.505LeuLeu: 8.505 ± 2.049
1.92LeuMet: 1.92 ± 0.805
3.841LeuAsn: 3.841 ± 0.364
3.841LeuPro: 3.841 ± 0.916
2.743LeuGln: 2.743 ± 0.848
6.859LeuArg: 6.859 ± 0.766
9.328LeuSer: 9.328 ± 1.147
6.036LeuThr: 6.036 ± 0.9
6.036LeuVal: 6.036 ± 1.3
0.274LeuTrp: 0.274 ± 0.16
2.469LeuTyr: 2.469 ± 0.751
0.0LeuXaa: 0.0 ± 0.0
Met
2.469MetAla: 2.469 ± 1.763
0.549MetCys: 0.549 ± 0.203
2.195MetAsp: 2.195 ± 0.94
1.372MetGlu: 1.372 ± 0.403
1.92MetPhe: 1.92 ± 0.556
0.549MetGly: 0.549 ± 0.203
1.097MetHis: 1.097 ± 1.053
1.097MetIle: 1.097 ± 0.816
2.469MetLys: 2.469 ± 0.694
3.292MetLeu: 3.292 ± 0.855
2.469MetMet: 2.469 ± 1.374
1.646MetAsn: 1.646 ± 0.793
0.274MetPro: 0.274 ± 0.533
0.549MetGln: 0.549 ± 0.203
1.372MetArg: 1.372 ± 0.403
2.469MetSer: 2.469 ± 0.793
1.92MetThr: 1.92 ± 0.895
0.549MetVal: 0.549 ± 0.32
0.274MetTrp: 0.274 ± 0.16
0.274MetTyr: 0.274 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
1.372AsnAla: 1.372 ± 1.33
0.823AsnCys: 0.823 ± 0.85
1.92AsnAsp: 1.92 ± 0.808
2.195AsnGlu: 2.195 ± 0.625
2.743AsnPhe: 2.743 ± 0.962
1.372AsnGly: 1.372 ± 0.801
0.549AsnHis: 0.549 ± 0.32
1.372AsnIle: 1.372 ± 0.481
1.92AsnLys: 1.92 ± 1.255
4.39AsnLeu: 4.39 ± 0.854
0.823AsnMet: 0.823 ± 1.087
0.823AsnAsn: 0.823 ± 0.48
4.664AsnPro: 4.664 ± 0.217
1.097AsnGln: 1.097 ± 0.641
2.195AsnArg: 2.195 ± 0.729
3.292AsnSer: 3.292 ± 1.024
2.743AsnThr: 2.743 ± 0.382
2.469AsnVal: 2.469 ± 1.664
0.274AsnTrp: 0.274 ± 0.283
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.92ProAla: 1.92 ± 0.56
0.549ProCys: 0.549 ± 0.585
1.92ProAsp: 1.92 ± 0.76
3.018ProGlu: 3.018 ± 0.433
3.567ProPhe: 3.567 ± 0.947
3.292ProGly: 3.292 ± 0.852
1.372ProHis: 1.372 ± 0.662
2.195ProIle: 2.195 ± 1.119
1.646ProLys: 1.646 ± 0.608
2.469ProLeu: 2.469 ± 0.997
0.823ProMet: 0.823 ± 0.466
1.372ProAsn: 1.372 ± 0.758
2.469ProPro: 2.469 ± 0.818
1.92ProGln: 1.92 ± 0.491
1.372ProArg: 1.372 ± 0.66
3.567ProSer: 3.567 ± 0.719
3.567ProThr: 3.567 ± 1.284
2.743ProVal: 2.743 ± 0.698
1.097ProTrp: 1.097 ± 0.561
0.274ProTyr: 0.274 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
1.92GlnAla: 1.92 ± 0.595
1.646GlnCys: 1.646 ± 1.477
0.823GlnAsp: 0.823 ± 0.529
3.018GlnGlu: 3.018 ± 0.979
1.92GlnPhe: 1.92 ± 0.491
2.743GlnGly: 2.743 ± 1.065
1.372GlnHis: 1.372 ± 0.481
2.195GlnIle: 2.195 ± 0.448
2.743GlnLys: 2.743 ± 0.382
1.646GlnLeu: 1.646 ± 0.352
1.372GlnMet: 1.372 ± 0.446
1.92GlnAsn: 1.92 ± 0.805
1.372GlnPro: 1.372 ± 0.467
1.372GlnGln: 1.372 ± 0.376
2.743GlnArg: 2.743 ± 1.263
1.646GlnSer: 1.646 ± 0.916
2.469GlnThr: 2.469 ± 0.818
1.097GlnVal: 1.097 ± 0.359
0.549GlnTrp: 0.549 ± 0.203
0.823GlnTyr: 0.823 ± 0.48
0.0GlnXaa: 0.0 ± 0.0
Arg
2.195ArgAla: 2.195 ± 1.154
1.92ArgCys: 1.92 ± 0.556
3.292ArgAsp: 3.292 ± 0.79
3.567ArgGlu: 3.567 ± 0.886
3.292ArgPhe: 3.292 ± 0.695
2.743ArgGly: 2.743 ± 0.924
1.372ArgHis: 1.372 ± 0.66
5.487ArgIle: 5.487 ± 0.627
3.292ArgLys: 3.292 ± 0.191
4.664ArgLeu: 4.664 ± 0.217
1.646ArgMet: 1.646 ± 1.318
1.646ArgAsn: 1.646 ± 0.333
1.646ArgPro: 1.646 ± 0.678
3.292ArgGln: 3.292 ± 2.256
2.195ArgArg: 2.195 ± 0.875
5.487ArgSer: 5.487 ± 1.052
2.743ArgThr: 2.743 ± 0.354
3.841ArgVal: 3.841 ± 1.045
0.823ArgTrp: 0.823 ± 0.48
1.646ArgTyr: 1.646 ± 0.961
0.0ArgXaa: 0.0 ± 0.0
Ser
4.115SerAla: 4.115 ± 1.101
2.469SerCys: 2.469 ± 2.155
6.859SerAsp: 6.859 ± 1.475
5.487SerGlu: 5.487 ± 0.652
4.39SerPhe: 4.39 ± 1.435
4.938SerGly: 4.938 ± 0.758
2.195SerHis: 2.195 ± 0.335
3.841SerIle: 3.841 ± 0.621
4.938SerLys: 4.938 ± 0.427
10.7SerLeu: 10.7 ± 0.887
0.549SerMet: 0.549 ± 0.527
3.567SerAsn: 3.567 ± 0.655
3.018SerPro: 3.018 ± 1.11
3.567SerGln: 3.567 ± 1.469
5.487SerArg: 5.487 ± 1.029
9.328SerSer: 9.328 ± 2.104
5.761SerThr: 5.761 ± 1.191
6.036SerVal: 6.036 ± 1.21
2.195SerTrp: 2.195 ± 0.809
0.823SerTyr: 0.823 ± 0.447
0.0SerXaa: 0.0 ± 0.0
Thr
3.567ThrAla: 3.567 ± 0.886
1.646ThrCys: 1.646 ± 0.997
4.115ThrAsp: 4.115 ± 0.598
3.841ThrGlu: 3.841 ± 1.568
3.018ThrPhe: 3.018 ± 0.47
4.115ThrGly: 4.115 ± 0.996
0.823ThrHis: 0.823 ± 0.466
4.39ThrIle: 4.39 ± 0.638
2.469ThrLys: 2.469 ± 0.384
7.682ThrLeu: 7.682 ± 2.31
0.823ThrMet: 0.823 ± 0.259
3.292ThrAsn: 3.292 ± 1.572
2.743ThrPro: 2.743 ± 1.076
1.097ThrGln: 1.097 ± 0.485
2.743ThrArg: 2.743 ± 1.205
5.213ThrSer: 5.213 ± 1.046
4.39ThrThr: 4.39 ± 1.153
3.567ThrVal: 3.567 ± 1.069
0.823ThrTrp: 0.823 ± 0.48
2.195ThrTyr: 2.195 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
3.018ValAla: 3.018 ± 1.263
1.372ValCys: 1.372 ± 0.66
3.567ValAsp: 3.567 ± 1.013
5.487ValGlu: 5.487 ± 0.949
3.018ValPhe: 3.018 ± 1.11
1.92ValGly: 1.92 ± 0.491
0.823ValHis: 0.823 ± 0.464
3.018ValIle: 3.018 ± 1.345
3.018ValLys: 3.018 ± 0.897
6.31ValLeu: 6.31 ± 0.39
1.097ValMet: 1.097 ± 0.744
1.646ValAsn: 1.646 ± 0.608
1.646ValPro: 1.646 ± 0.929
3.841ValGln: 3.841 ± 0.695
3.567ValArg: 3.567 ± 0.674
5.213ValSer: 5.213 ± 0.733
3.292ValThr: 3.292 ± 0.644
3.841ValVal: 3.841 ± 0.874
1.097ValTrp: 1.097 ± 0.485
2.195ValTyr: 2.195 ± 0.94
0.274ValXaa: 0.274 ± 0.571
Trp
0.274TrpAla: 0.274 ± 0.16
0.823TrpCys: 0.823 ± 0.48
1.372TrpAsp: 1.372 ± 1.266
0.274TrpGlu: 0.274 ± 0.16
0.823TrpPhe: 0.823 ± 0.466
0.823TrpGly: 0.823 ± 0.466
0.0TrpHis: 0.0 ± 0.0
1.646TrpIle: 1.646 ± 0.932
0.549TrpLys: 0.549 ± 0.527
1.92TrpLeu: 1.92 ± 0.808
0.823TrpMet: 0.823 ± 0.48
0.274TrpAsn: 0.274 ± 0.16
0.549TrpPro: 0.549 ± 1.065
0.0TrpGln: 0.0 ± 0.0
1.372TrpArg: 1.372 ± 0.481
1.372TrpSer: 1.372 ± 0.403
1.646TrpThr: 1.646 ± 0.352
1.646TrpVal: 1.646 ± 0.932
0.0TrpTrp: 0.0 ± 0.0
0.549TrpTyr: 0.549 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.097TyrAla: 1.097 ± 0.391
0.0TyrCys: 0.0 ± 0.0
1.097TyrAsp: 1.097 ± 0.391
1.372TyrGlu: 1.372 ± 0.786
0.274TyrPhe: 0.274 ± 0.16
0.549TyrGly: 0.549 ± 0.32
1.92TyrHis: 1.92 ± 0.805
1.097TyrIle: 1.097 ± 0.359
1.92TyrLys: 1.92 ± 0.44
3.292TyrLeu: 3.292 ± 0.524
0.274TyrMet: 0.274 ± 0.16
1.097TyrAsn: 1.097 ± 0.341
1.92TyrPro: 1.92 ± 0.593
0.823TyrGln: 0.823 ± 0.934
0.823TyrArg: 0.823 ± 0.48
3.018TyrSer: 3.018 ± 0.648
1.646TyrThr: 1.646 ± 0.461
0.823TyrVal: 0.823 ± 0.48
1.372TyrTrp: 1.372 ± 0.481
0.823TyrTyr: 0.823 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.274XaaMet: 0.274 ± 0.571
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.274XaaVal: 0.274 ± 0.283
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3646 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski