Amino acid dipepetide frequency for Beihai picorna-like virus 54

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.755AlaAla: 1.755 ± 0.06
1.053AlaCys: 1.053 ± 0.374
3.51AlaAsp: 3.51 ± 0.393
5.967AlaGlu: 5.967 ± 0.411
2.106AlaPhe: 2.106 ± 0.278
5.265AlaGly: 5.265 ± 1.359
1.404AlaHis: 1.404 ± 0.157
3.159AlaIle: 3.159 ± 0.61
1.755AlaLys: 1.755 ± 0.574
6.669AlaLeu: 6.669 ± 1.05
0.702AlaMet: 0.702 ± 0.079
4.212AlaAsn: 4.212 ± 2.524
4.212AlaPro: 4.212 ± 0.985
1.755AlaGln: 1.755 ± 0.966
5.265AlaArg: 5.265 ± 0.694
6.669AlaSer: 6.669 ± 0.49
2.106AlaThr: 2.106 ± 0.749
5.616AlaVal: 5.616 ± 0.911
2.106AlaTrp: 2.106 ± 0.236
1.404AlaTyr: 1.404 ± 0.157
0.0AlaXaa: 0.0 ± 0.0
Cys
0.351CysAla: 0.351 ± 0.296
0.351CysCys: 0.351 ± 0.217
1.053CysAsp: 1.053 ± 0.652
1.404CysGlu: 1.404 ± 0.157
1.053CysPhe: 1.053 ± 0.374
1.053CysGly: 1.053 ± 0.139
0.702CysHis: 0.702 ± 0.435
1.053CysIle: 1.053 ± 0.652
1.053CysLys: 1.053 ± 0.652
1.053CysLeu: 1.053 ± 0.374
0.351CysMet: 0.351 ± 0.296
0.702CysAsn: 0.702 ± 0.592
0.351CysPro: 0.351 ± 0.217
0.351CysGln: 0.351 ± 0.217
1.404CysArg: 1.404 ± 0.157
2.457CysSer: 2.457 ± 0.018
1.053CysThr: 1.053 ± 0.652
0.702CysVal: 0.702 ± 0.435
0.0CysTrp: 0.0 ± 0.0
0.351CysTyr: 0.351 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
5.616AspAla: 5.616 ± 1.142
1.404AspCys: 1.404 ± 0.869
4.914AspAsp: 4.914 ± 0.55
4.563AspGlu: 4.563 ± 0.254
3.861AspPhe: 3.861 ± 0.175
3.861AspGly: 3.861 ± 2.228
1.053AspHis: 1.053 ± 0.652
3.51AspIle: 3.51 ± 1.933
1.404AspLys: 1.404 ± 0.869
4.563AspLeu: 4.563 ± 0.254
0.702AspMet: 0.702 ± 0.592
3.861AspAsn: 3.861 ± 2.228
3.51AspPro: 3.51 ± 0.906
2.457AspGln: 2.457 ± 1.521
2.808AspArg: 2.808 ± 0.712
4.914AspSer: 4.914 ± 0.477
3.861AspThr: 3.861 ± 0.175
3.51AspVal: 3.51 ± 0.906
1.755AspTrp: 1.755 ± 0.06
2.808AspTyr: 2.808 ± 0.314
0.0AspXaa: 0.0 ± 0.0
Glu
2.106GluAla: 2.106 ± 0.278
1.404GluCys: 1.404 ± 0.356
3.861GluAsp: 3.861 ± 0.175
4.212GluGlu: 4.212 ± 0.555
3.159GluPhe: 3.159 ± 0.416
3.159GluGly: 3.159 ± 1.443
0.702GluHis: 0.702 ± 0.435
4.212GluIle: 4.212 ± 0.555
2.457GluLys: 2.457 ± 1.521
2.457GluLeu: 2.457 ± 1.008
3.861GluMet: 3.861 ± 0.175
2.106GluAsn: 2.106 ± 0.278
5.967GluPro: 5.967 ± 0.924
3.159GluGln: 3.159 ± 0.61
3.51GluArg: 3.51 ± 0.634
2.106GluSer: 2.106 ± 1.262
2.106GluThr: 2.106 ± 0.278
4.212GluVal: 4.212 ± 1.498
2.457GluTrp: 2.457 ± 0.495
2.808GluTyr: 2.808 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
3.861PheAla: 3.861 ± 0.338
0.351PheCys: 0.351 ± 0.217
3.159PheAsp: 3.159 ± 0.416
2.106PheGlu: 2.106 ± 0.749
3.159PhePhe: 3.159 ± 0.61
3.159PheGly: 3.159 ± 0.61
1.404PheHis: 1.404 ± 0.356
4.563PheIle: 4.563 ± 1.286
1.755PheLys: 1.755 ± 0.06
4.212PheLeu: 4.212 ± 0.471
1.053PheMet: 1.053 ± 0.652
2.457PheAsn: 2.457 ± 1.558
3.51PhePro: 3.51 ± 0.906
1.404PheGln: 1.404 ± 0.67
1.755PheArg: 1.755 ± 0.06
4.212PheSer: 4.212 ± 0.471
2.457PheThr: 2.457 ± 0.532
2.457PheVal: 2.457 ± 1.008
0.0PheTrp: 0.0 ± 0.0
2.106PheTyr: 2.106 ± 1.262
0.0PheXaa: 0.0 ± 0.0
Gly
2.457GlyAla: 2.457 ± 1.045
1.053GlyCys: 1.053 ± 0.652
4.914GlyAsp: 4.914 ± 0.55
4.914GlyGlu: 4.914 ± 0.037
2.808GlyPhe: 2.808 ± 0.199
4.212GlyGly: 4.212 ± 0.985
1.053GlyHis: 1.053 ± 0.139
1.755GlyIle: 1.755 ± 0.453
3.51GlyLys: 3.51 ± 1.66
4.212GlyLeu: 4.212 ± 1.582
2.106GlyMet: 2.106 ± 0.236
4.212GlyAsn: 4.212 ± 0.042
2.106GlyPro: 2.106 ± 0.749
2.457GlyGln: 2.457 ± 0.495
4.212GlyArg: 4.212 ± 0.471
5.967GlySer: 5.967 ± 0.102
3.861GlyThr: 3.861 ± 0.175
2.808GlyVal: 2.808 ± 0.314
0.351GlyTrp: 0.351 ± 0.296
1.404GlyTyr: 1.404 ± 0.356
0.0GlyXaa: 0.0 ± 0.0
His
2.106HisAla: 2.106 ± 0.236
0.351HisCys: 0.351 ± 0.217
1.755HisAsp: 1.755 ± 1.087
3.159HisGlu: 3.159 ± 0.93
3.159HisPhe: 3.159 ± 0.416
1.053HisGly: 1.053 ± 0.652
0.702HisHis: 0.702 ± 0.435
0.351HisIle: 0.351 ± 0.217
1.053HisLys: 1.053 ± 0.652
2.457HisLeu: 2.457 ± 0.495
2.106HisMet: 2.106 ± 1.304
0.702HisAsn: 0.702 ± 0.435
1.404HisPro: 1.404 ± 0.356
0.702HisGln: 0.702 ± 0.079
1.404HisArg: 1.404 ± 0.356
0.0HisSer: 0.0 ± 0.0
0.351HisThr: 0.351 ± 0.217
1.755HisVal: 1.755 ± 0.574
0.351HisTrp: 0.351 ± 0.217
1.053HisTyr: 1.053 ± 0.652
0.0HisXaa: 0.0 ± 0.0
Ile
4.563IleAla: 4.563 ± 1.28
0.0IleCys: 0.0 ± 0.0
3.159IleAsp: 3.159 ± 0.61
4.212IleGlu: 4.212 ± 0.555
2.457IlePhe: 2.457 ± 0.532
5.967IleGly: 5.967 ± 0.102
1.404IleHis: 1.404 ± 0.869
2.808IleIle: 2.808 ± 0.712
3.159IleLys: 3.159 ± 1.443
2.457IleLeu: 2.457 ± 0.018
2.808IleMet: 2.808 ± 0.712
4.212IleAsn: 4.212 ± 0.471
2.106IlePro: 2.106 ± 0.278
2.106IleGln: 2.106 ± 0.278
4.914IleArg: 4.914 ± 1.063
4.212IleSer: 4.212 ± 0.985
4.563IleThr: 4.563 ± 2.307
4.563IleVal: 4.563 ± 0.254
0.702IleTrp: 0.702 ± 0.079
2.457IleTyr: 2.457 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
2.457LysAla: 2.457 ± 1.008
0.351LysCys: 0.351 ± 0.217
2.457LysAsp: 2.457 ± 1.008
2.106LysGlu: 2.106 ± 0.278
1.053LysPhe: 1.053 ± 0.374
2.106LysGly: 2.106 ± 0.791
1.404LysHis: 1.404 ± 0.356
3.861LysIle: 3.861 ± 0.851
3.51LysLys: 3.51 ± 1.66
2.808LysLeu: 2.808 ± 1.226
2.457LysMet: 2.457 ± 0.495
1.053LysAsn: 1.053 ± 0.139
2.106LysPro: 2.106 ± 0.791
1.404LysGln: 1.404 ± 0.356
1.755LysArg: 1.755 ± 0.574
3.159LysSer: 3.159 ± 1.956
4.212LysThr: 4.212 ± 2.095
1.404LysVal: 1.404 ± 0.157
1.755LysTrp: 1.755 ± 1.087
1.755LysTyr: 1.755 ± 1.087
0.0LysXaa: 0.0 ± 0.0
Leu
5.616LeuAla: 5.616 ± 0.398
1.755LeuCys: 1.755 ± 1.087
6.669LeuAsp: 6.669 ± 0.49
3.51LeuGlu: 3.51 ± 0.634
2.808LeuPhe: 2.808 ± 0.712
1.053LeuGly: 1.053 ± 0.139
2.106LeuHis: 2.106 ± 1.304
5.616LeuIle: 5.616 ± 1.425
4.212LeuLys: 4.212 ± 1.068
5.265LeuLeu: 5.265 ± 2.234
1.404LeuMet: 1.404 ± 0.133
3.159LeuAsn: 3.159 ± 0.61
4.914LeuPro: 4.914 ± 1.063
2.808LeuGln: 2.808 ± 0.199
4.212LeuArg: 4.212 ± 0.471
4.914LeuSer: 4.914 ± 0.477
7.371LeuThr: 7.371 ± 0.568
5.265LeuVal: 5.265 ± 0.846
1.053LeuTrp: 1.053 ± 0.139
1.404LeuTyr: 1.404 ± 0.157
0.0LeuXaa: 0.0 ± 0.0
Met
3.159MetAla: 3.159 ± 0.61
0.702MetCys: 0.702 ± 0.592
2.106MetAsp: 2.106 ± 0.278
0.702MetGlu: 0.702 ± 0.592
1.404MetPhe: 1.404 ± 0.356
1.755MetGly: 1.755 ± 0.06
1.755MetHis: 1.755 ± 1.087
2.106MetIle: 2.106 ± 0.278
1.053MetLys: 1.053 ± 0.139
2.106MetLeu: 2.106 ± 0.791
1.755MetMet: 1.755 ± 0.06
2.457MetAsn: 2.457 ± 0.495
2.457MetPro: 2.457 ± 0.495
0.351MetGln: 0.351 ± 0.217
1.404MetArg: 1.404 ± 0.157
2.106MetSer: 2.106 ± 0.791
2.457MetThr: 2.457 ± 0.018
3.159MetVal: 3.159 ± 0.93
0.702MetTrp: 0.702 ± 0.435
1.053MetTyr: 1.053 ± 0.374
0.0MetXaa: 0.0 ± 0.0
Asn
5.616AsnAla: 5.616 ± 0.628
0.351AsnCys: 0.351 ± 0.217
3.159AsnAsp: 3.159 ± 2.15
0.702AsnGlu: 0.702 ± 0.079
1.053AsnPhe: 1.053 ± 0.652
3.861AsnGly: 3.861 ± 0.851
1.053AsnHis: 1.053 ± 0.139
2.106AsnIle: 2.106 ± 0.749
1.755AsnLys: 1.755 ± 0.574
5.616AsnLeu: 5.616 ± 1.655
2.808AsnMet: 2.808 ± 0.712
2.457AsnAsn: 2.457 ± 0.018
4.212AsnPro: 4.212 ± 2.524
2.457AsnGln: 2.457 ± 0.018
1.755AsnArg: 1.755 ± 1.087
1.755AsnSer: 1.755 ± 0.574
3.861AsnThr: 3.861 ± 2.228
4.563AsnVal: 4.563 ± 1.794
0.702AsnTrp: 0.702 ± 0.435
1.755AsnTyr: 1.755 ± 0.574
0.0AsnXaa: 0.0 ± 0.0
Pro
4.914ProAla: 4.914 ± 1.063
1.053ProCys: 1.053 ± 0.374
3.51ProAsp: 3.51 ± 0.393
4.212ProGlu: 4.212 ± 1.068
4.914ProPhe: 4.914 ± 2.09
3.159ProGly: 3.159 ± 0.097
1.755ProHis: 1.755 ± 0.574
5.616ProIle: 5.616 ± 2.168
1.053ProLys: 1.053 ± 0.652
4.914ProLeu: 4.914 ± 0.477
1.404ProMet: 1.404 ± 0.519
1.404ProAsn: 1.404 ± 0.67
4.914ProPro: 4.914 ± 1.576
1.404ProGln: 1.404 ± 0.67
2.457ProArg: 2.457 ± 1.045
3.51ProSer: 3.51 ± 0.393
3.51ProThr: 3.51 ± 0.393
3.159ProVal: 3.159 ± 1.123
1.053ProTrp: 1.053 ± 0.139
2.457ProTyr: 2.457 ± 1.045
0.0ProXaa: 0.0 ± 0.0
Gln
2.808GlnAla: 2.808 ± 0.827
0.0GlnCys: 0.0 ± 0.0
2.808GlnAsp: 2.808 ± 1.341
1.404GlnGlu: 1.404 ± 0.356
0.702GlnPhe: 0.702 ± 0.435
0.351GlnGly: 0.351 ± 0.217
0.351GlnHis: 0.351 ± 0.217
2.106GlnIle: 2.106 ± 0.791
0.351GlnLys: 0.351 ± 0.217
3.159GlnLeu: 3.159 ± 0.097
2.106GlnMet: 2.106 ± 0.236
1.053GlnAsn: 1.053 ± 0.139
3.159GlnPro: 3.159 ± 0.416
0.702GlnGln: 0.702 ± 0.592
1.404GlnArg: 1.404 ± 0.67
1.755GlnSer: 1.755 ± 0.453
1.404GlnThr: 1.404 ± 0.157
2.106GlnVal: 2.106 ± 0.791
0.702GlnTrp: 0.702 ± 0.435
1.053GlnTyr: 1.053 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
3.861ArgAla: 3.861 ± 0.175
2.106ArgCys: 2.106 ± 0.236
3.159ArgAsp: 3.159 ± 1.637
2.808ArgGlu: 2.808 ± 1.226
3.51ArgPhe: 3.51 ± 0.121
3.159ArgGly: 3.159 ± 0.416
2.106ArgHis: 2.106 ± 0.791
4.212ArgIle: 4.212 ± 1.498
2.808ArgLys: 2.808 ± 1.739
3.861ArgLeu: 3.861 ± 0.851
1.755ArgMet: 1.755 ± 0.574
2.106ArgAsn: 2.106 ± 0.278
2.808ArgPro: 2.808 ± 0.314
1.755ArgGln: 1.755 ± 0.06
2.457ArgArg: 2.457 ± 1.008
2.808ArgSer: 2.808 ± 0.199
1.404ArgThr: 1.404 ± 0.356
5.616ArgVal: 5.616 ± 1.142
0.702ArgTrp: 0.702 ± 0.079
1.755ArgTyr: 1.755 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
4.212SerAla: 4.212 ± 0.471
1.755SerCys: 1.755 ± 0.574
6.669SerAsp: 6.669 ± 2.029
4.914SerGlu: 4.914 ± 0.037
2.457SerPhe: 2.457 ± 0.495
5.967SerGly: 5.967 ± 0.411
1.404SerHis: 1.404 ± 0.356
5.616SerIle: 5.616 ± 0.628
2.808SerLys: 2.808 ± 1.226
5.967SerLeu: 5.967 ± 0.615
1.053SerMet: 1.053 ± 0.888
2.808SerAsn: 2.808 ± 1.226
1.053SerPro: 1.053 ± 0.374
1.755SerGln: 1.755 ± 0.06
3.159SerArg: 3.159 ± 0.416
3.159SerSer: 3.159 ± 0.416
4.914SerThr: 4.914 ± 0.55
2.808SerVal: 2.808 ± 0.314
0.351SerTrp: 0.351 ± 0.296
3.159SerTyr: 3.159 ± 0.097
0.0SerXaa: 0.0 ± 0.0
Thr
3.861ThrAla: 3.861 ± 1.202
1.053ThrCys: 1.053 ± 0.374
2.457ThrAsp: 2.457 ± 0.495
3.861ThrGlu: 3.861 ± 0.175
3.51ThrPhe: 3.51 ± 1.933
2.457ThrGly: 2.457 ± 0.532
0.702ThrHis: 0.702 ± 0.079
3.51ThrIle: 3.51 ± 0.906
3.51ThrLys: 3.51 ± 1.147
4.563ThrLeu: 4.563 ± 0.259
2.457ThrMet: 2.457 ± 0.495
4.563ThrAsn: 4.563 ± 0.259
5.616ThrPro: 5.616 ± 1.142
1.053ThrGln: 1.053 ± 0.374
2.106ThrArg: 2.106 ± 0.236
6.318ThrSer: 6.318 ± 1.733
3.861ThrThr: 3.861 ± 0.175
3.51ThrVal: 3.51 ± 0.634
1.404ThrTrp: 1.404 ± 0.157
1.755ThrTyr: 1.755 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
4.563ValAla: 4.563 ± 0.259
1.053ValCys: 1.053 ± 0.374
3.159ValAsp: 3.159 ± 0.097
3.861ValGlu: 3.861 ± 0.338
3.51ValPhe: 3.51 ± 0.393
4.914ValGly: 4.914 ± 0.477
2.457ValHis: 2.457 ± 1.008
4.914ValIle: 4.914 ± 1.063
2.808ValLys: 2.808 ± 0.827
4.212ValLeu: 4.212 ± 0.985
1.755ValMet: 1.755 ± 0.574
4.563ValAsn: 4.563 ± 0.254
3.159ValPro: 3.159 ± 0.61
1.053ValGln: 1.053 ± 0.652
6.318ValArg: 6.318 ± 0.833
3.51ValSer: 3.51 ± 0.121
3.51ValThr: 3.51 ± 0.906
4.563ValVal: 4.563 ± 0.254
0.351ValTrp: 0.351 ± 0.296
1.053ValTyr: 1.053 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
0.702TrpAla: 0.702 ± 0.435
0.351TrpCys: 0.351 ± 0.217
0.702TrpAsp: 0.702 ± 0.079
0.0TrpGlu: 0.0 ± 0.0
1.053TrpPhe: 1.053 ± 0.374
1.053TrpGly: 1.053 ± 0.139
0.702TrpHis: 0.702 ± 0.435
1.053TrpIle: 1.053 ± 0.139
1.404TrpLys: 1.404 ± 0.356
1.404TrpLeu: 1.404 ± 0.356
1.053TrpMet: 1.053 ± 0.652
0.351TrpAsn: 0.351 ± 0.296
0.351TrpPro: 0.351 ± 0.217
0.351TrpGln: 0.351 ± 0.296
1.404TrpArg: 1.404 ± 0.157
1.404TrpSer: 1.404 ± 0.157
1.404TrpThr: 1.404 ± 0.157
1.755TrpVal: 1.755 ± 0.06
0.351TrpTrp: 0.351 ± 0.217
0.702TrpTyr: 0.702 ± 0.435
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.106TyrAla: 2.106 ± 0.278
0.351TyrCys: 0.351 ± 0.296
1.755TyrAsp: 1.755 ± 0.06
1.404TyrGlu: 1.404 ± 0.157
1.404TyrPhe: 1.404 ± 0.67
2.808TyrGly: 2.808 ± 0.712
1.755TyrHis: 1.755 ± 0.453
0.702TyrIle: 0.702 ± 0.435
1.755TyrLys: 1.755 ± 0.06
3.51TyrLeu: 3.51 ± 1.419
0.702TyrMet: 0.702 ± 0.079
3.159TyrAsn: 3.159 ± 0.097
2.457TyrPro: 2.457 ± 0.532
0.0TyrGln: 0.0 ± 0.0
1.053TyrArg: 1.053 ± 0.652
1.053TyrSer: 1.053 ± 0.139
3.861TyrThr: 3.861 ± 0.689
1.755TyrVal: 1.755 ± 1.087
0.702TyrTrp: 0.702 ± 0.079
0.702TyrTyr: 0.702 ± 0.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski