Amino acid dipepetide frequency for Wenling picorna-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.957AlaAla: 6.957 ± 1.751
2.174AlaCys: 2.174 ± 1.217
3.043AlaAsp: 3.043 ± 0.275
2.174AlaGlu: 2.174 ± 0.503
2.609AlaPhe: 2.609 ± 0.746
6.522AlaGly: 6.522 ± 0.078
1.304AlaHis: 1.304 ± 0.73
3.478AlaIle: 3.478 ± 0.196
3.478AlaLys: 3.478 ± 0.518
7.391AlaLeu: 7.391 ± 0.565
2.174AlaMet: 2.174 ± 0.302
2.609AlaAsn: 2.609 ± 2.113
1.304AlaPro: 1.304 ± 0.73
4.783AlaGln: 4.783 ± 0.181
5.217AlaArg: 5.217 ± 0.063
5.652AlaSer: 5.652 ± 1.021
3.043AlaThr: 3.043 ± 0.44
4.348AlaVal: 4.348 ± 1.139
0.87AlaTrp: 0.87 ± 0.487
2.609AlaTyr: 2.609 ± 0.683
0.0AlaXaa: 0.0 ± 0.0
Cys
1.739CysAla: 1.739 ± 0.974
0.435CysCys: 0.435 ± 0.243
0.87CysAsp: 0.87 ± 0.228
0.87CysGlu: 0.87 ± 0.487
1.739CysPhe: 1.739 ± 0.974
3.043CysGly: 3.043 ± 1.704
0.0CysHis: 0.0 ± 0.0
0.435CysIle: 0.435 ± 0.243
2.174CysLys: 2.174 ± 1.217
1.304CysLeu: 1.304 ± 0.016
0.435CysMet: 0.435 ± 0.471
0.87CysAsn: 0.87 ± 0.487
0.87CysPro: 0.87 ± 0.228
0.87CysGln: 0.87 ± 0.487
0.435CysArg: 0.435 ± 0.243
1.304CysSer: 1.304 ± 0.73
0.87CysThr: 0.87 ± 0.487
2.174CysVal: 2.174 ± 1.217
0.0CysTrp: 0.0 ± 0.0
0.435CysTyr: 0.435 ± 0.471
0.0CysXaa: 0.0 ± 0.0
Asp
6.087AspAla: 6.087 ± 0.165
0.87AspCys: 0.87 ± 0.487
3.043AspAsp: 3.043 ± 0.275
6.087AspGlu: 6.087 ± 1.979
4.783AspPhe: 4.783 ± 0.534
3.913AspGly: 3.913 ± 0.762
1.304AspHis: 1.304 ± 0.016
1.304AspIle: 1.304 ± 1.414
5.217AspLys: 5.217 ± 0.652
5.652AspLeu: 5.652 ± 1.736
1.304AspMet: 1.304 ± 0.699
2.174AspAsn: 2.174 ± 0.927
3.043AspPro: 3.043 ± 1.155
3.478AspGln: 3.478 ± 0.518
2.609AspArg: 2.609 ± 0.683
2.174AspSer: 2.174 ± 0.212
1.739AspThr: 1.739 ± 0.259
7.826AspVal: 7.826 ± 1.335
0.0AspTrp: 0.0 ± 0.0
0.435AspTyr: 0.435 ± 0.243
0.0AspXaa: 0.0 ± 0.0
Glu
1.739GluAla: 1.739 ± 0.974
0.435GluCys: 0.435 ± 0.243
2.174GluAsp: 2.174 ± 0.503
1.304GluGlu: 1.304 ± 0.016
2.609GluPhe: 2.609 ± 0.031
3.913GluGly: 3.913 ± 0.047
1.304GluHis: 1.304 ± 0.73
3.043GluIle: 3.043 ± 0.275
2.174GluLys: 2.174 ± 1.217
2.174GluLeu: 2.174 ± 1.217
1.739GluMet: 1.739 ± 0.456
1.304GluAsn: 1.304 ± 0.016
3.478GluPro: 3.478 ± 1.233
1.739GluGln: 1.739 ± 0.456
3.043GluArg: 3.043 ± 1.704
5.652GluSer: 5.652 ± 1.021
3.043GluThr: 3.043 ± 1.155
3.913GluVal: 3.913 ± 1.477
1.304GluTrp: 1.304 ± 0.73
1.739GluTyr: 1.739 ± 0.259
0.0GluXaa: 0.0 ± 0.0
Phe
2.609PheAla: 2.609 ± 0.683
3.913PheCys: 3.913 ± 1.477
3.913PheAsp: 3.913 ± 1.477
2.609PheGlu: 2.609 ± 0.746
4.348PhePhe: 4.348 ± 0.291
4.348PheGly: 4.348 ± 2.568
1.739PheHis: 1.739 ± 1.17
3.043PheIle: 3.043 ± 1.155
3.478PheLys: 3.478 ± 0.518
3.043PheLeu: 3.043 ± 0.275
0.435PheMet: 0.435 ± 0.243
1.304PheAsn: 1.304 ± 0.699
1.304PhePro: 1.304 ± 0.73
3.478PheGln: 3.478 ± 0.196
3.913PheArg: 3.913 ± 0.047
3.913PheSer: 3.913 ± 0.668
2.174PheThr: 2.174 ± 0.503
4.348PheVal: 4.348 ± 1.72
0.435PheTrp: 0.435 ± 0.471
0.87PheTyr: 0.87 ± 0.487
0.0PheXaa: 0.0 ± 0.0
Gly
6.522GlyAla: 6.522 ± 1.351
1.739GlyCys: 1.739 ± 0.456
7.826GlyAsp: 7.826 ± 0.621
5.217GlyGlu: 5.217 ± 2.207
1.304GlyPhe: 1.304 ± 0.699
3.043GlyGly: 3.043 ± 0.275
0.435GlyHis: 0.435 ± 0.243
3.913GlyIle: 3.913 ± 0.668
3.043GlyLys: 3.043 ± 0.275
6.522GlyLeu: 6.522 ± 0.636
3.043GlyMet: 3.043 ± 1.155
2.609GlyAsn: 2.609 ± 0.031
3.478GlyPro: 3.478 ± 0.911
2.174GlyGln: 2.174 ± 0.212
5.652GlyArg: 5.652 ± 0.306
6.522GlySer: 6.522 ± 3.495
3.913GlyThr: 3.913 ± 0.762
9.13GlyVal: 9.13 ± 1.539
0.0GlyTrp: 0.0 ± 0.0
1.304GlyTyr: 1.304 ± 0.73
0.0GlyXaa: 0.0 ± 0.0
His
0.87HisAla: 0.87 ± 0.228
0.87HisCys: 0.87 ± 0.487
2.174HisAsp: 2.174 ± 1.217
1.304HisGlu: 1.304 ± 0.016
0.0HisPhe: 0.0 ± 0.0
2.174HisGly: 2.174 ± 1.217
0.435HisHis: 0.435 ± 0.243
1.304HisIle: 1.304 ± 0.73
0.87HisLys: 0.87 ± 0.228
2.174HisLeu: 2.174 ± 0.503
0.87HisMet: 0.87 ± 0.228
0.0HisAsn: 0.0 ± 0.0
0.435HisPro: 0.435 ± 0.471
0.435HisGln: 0.435 ± 0.471
0.435HisArg: 0.435 ± 0.243
2.174HisSer: 2.174 ± 0.503
1.304HisThr: 1.304 ± 0.016
4.348HisVal: 4.348 ± 0.291
0.435HisTrp: 0.435 ± 0.243
2.174HisTyr: 2.174 ± 1.642
0.0HisXaa: 0.0 ± 0.0
Ile
1.739IleAla: 1.739 ± 0.259
0.87IleCys: 0.87 ± 0.487
4.783IleAsp: 4.783 ± 0.895
1.739IleGlu: 1.739 ± 0.456
0.435IlePhe: 0.435 ± 0.243
3.043IleGly: 3.043 ± 0.44
0.0IleHis: 0.0 ± 0.0
0.435IleIle: 0.435 ± 0.243
0.0IleLys: 0.0 ± 0.0
1.739IleLeu: 1.739 ± 0.259
1.304IleMet: 1.304 ± 0.016
2.609IleAsn: 2.609 ± 1.398
3.913IlePro: 3.913 ± 0.668
1.304IleGln: 1.304 ± 0.73
2.174IleArg: 2.174 ± 0.212
5.217IleSer: 5.217 ± 1.367
2.609IleThr: 2.609 ± 1.398
3.478IleVal: 3.478 ± 0.196
0.435IleTrp: 0.435 ± 0.471
1.304IleTyr: 1.304 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
2.609LysAla: 2.609 ± 0.031
0.435LysCys: 0.435 ± 0.243
4.783LysAsp: 4.783 ± 0.534
3.043LysGlu: 3.043 ± 0.275
2.609LysPhe: 2.609 ± 0.031
3.913LysGly: 3.913 ± 0.668
0.435LysHis: 0.435 ± 0.243
3.478LysIle: 3.478 ± 1.233
0.435LysLys: 0.435 ± 0.471
3.478LysLeu: 3.478 ± 0.196
0.435LysMet: 0.435 ± 0.243
1.304LysAsn: 1.304 ± 0.016
1.304LysPro: 1.304 ± 0.699
1.304LysGln: 1.304 ± 0.73
1.739LysArg: 1.739 ± 0.974
2.174LysSer: 2.174 ± 0.503
2.174LysThr: 2.174 ± 0.503
4.783LysVal: 4.783 ± 0.181
0.0LysTrp: 0.0 ± 0.0
2.174LysTyr: 2.174 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
8.261LeuAla: 8.261 ± 1.767
0.435LeuCys: 0.435 ± 0.243
5.217LeuAsp: 5.217 ± 2.081
2.609LeuGlu: 2.609 ± 0.746
6.087LeuPhe: 6.087 ± 0.165
4.348LeuGly: 4.348 ± 0.291
2.174LeuHis: 2.174 ± 0.212
2.609LeuIle: 2.609 ± 0.031
3.913LeuLys: 3.913 ± 0.762
6.957LeuLeu: 6.957 ± 1.037
0.435LeuMet: 0.435 ± 0.243
2.609LeuAsn: 2.609 ± 1.398
1.304LeuPro: 1.304 ± 0.73
2.609LeuGln: 2.609 ± 0.683
6.087LeuArg: 6.087 ± 1.264
10.0LeuSer: 10.0 ± 3.692
3.478LeuThr: 3.478 ± 0.911
7.826LeuVal: 7.826 ± 1.335
0.87LeuTrp: 0.87 ± 0.487
2.174LeuTyr: 2.174 ± 0.503
0.0LeuXaa: 0.0 ± 0.0
Met
0.87MetAla: 0.87 ± 0.487
0.435MetCys: 0.435 ± 0.243
1.304MetAsp: 1.304 ± 0.016
0.87MetGlu: 0.87 ± 0.487
0.435MetPhe: 0.435 ± 0.471
1.304MetGly: 1.304 ± 0.016
2.174MetHis: 2.174 ± 0.927
1.739MetIle: 1.739 ± 0.456
0.87MetLys: 0.87 ± 0.487
0.87MetLeu: 0.87 ± 0.487
0.435MetMet: 0.435 ± 0.243
0.87MetAsn: 0.87 ± 0.228
1.304MetPro: 1.304 ± 0.73
1.304MetGln: 1.304 ± 0.016
2.609MetArg: 2.609 ± 0.031
1.304MetSer: 1.304 ± 1.414
0.87MetThr: 0.87 ± 0.228
3.913MetVal: 3.913 ± 0.668
0.87MetTrp: 0.87 ± 0.228
2.174MetTyr: 2.174 ± 0.927
0.0MetXaa: 0.0 ± 0.0
Asn
4.348AsnAla: 4.348 ± 2.568
0.0AsnCys: 0.0 ± 0.0
0.435AsnAsp: 0.435 ± 0.471
1.739AsnGlu: 1.739 ± 0.259
2.174AsnPhe: 2.174 ± 0.927
1.304AsnGly: 1.304 ± 0.699
1.304AsnHis: 1.304 ± 0.699
1.304AsnIle: 1.304 ± 0.016
0.87AsnLys: 0.87 ± 0.943
2.174AsnLeu: 2.174 ± 0.503
1.739AsnMet: 1.739 ± 0.974
0.87AsnAsn: 0.87 ± 0.228
4.348AsnPro: 4.348 ± 1.854
2.609AsnGln: 2.609 ± 0.683
0.435AsnArg: 0.435 ± 0.243
0.87AsnSer: 0.87 ± 0.487
2.174AsnThr: 2.174 ± 0.212
3.043AsnVal: 3.043 ± 2.584
0.0AsnTrp: 0.0 ± 0.0
1.739AsnTyr: 1.739 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
2.174ProAla: 2.174 ± 0.212
0.435ProCys: 0.435 ± 0.243
2.174ProAsp: 2.174 ± 0.212
0.87ProGlu: 0.87 ± 0.228
3.913ProPhe: 3.913 ± 1.477
3.913ProGly: 3.913 ± 1.382
1.739ProHis: 1.739 ± 0.974
0.87ProIle: 0.87 ± 0.943
2.174ProLys: 2.174 ± 0.927
7.391ProLeu: 7.391 ± 3.008
2.174ProMet: 2.174 ± 0.503
0.87ProAsn: 0.87 ± 0.228
2.609ProPro: 2.609 ± 0.746
0.87ProGln: 0.87 ± 0.487
2.609ProArg: 2.609 ± 0.031
6.087ProSer: 6.087 ± 2.309
1.739ProThr: 1.739 ± 0.456
3.478ProVal: 3.478 ± 1.233
0.0ProTrp: 0.0 ± 0.0
3.913ProTyr: 3.913 ± 1.382
0.0ProXaa: 0.0 ± 0.0
Gln
3.043GlnAla: 3.043 ± 0.99
1.304GlnCys: 1.304 ± 0.016
2.174GlnAsp: 2.174 ± 0.503
1.739GlnGlu: 1.739 ± 0.974
1.739GlnPhe: 1.739 ± 0.456
3.478GlnGly: 3.478 ± 1.626
1.739GlnHis: 1.739 ± 0.974
2.174GlnIle: 2.174 ± 0.503
3.478GlnLys: 3.478 ± 0.518
1.304GlnLeu: 1.304 ± 0.699
0.87GlnMet: 0.87 ± 0.228
1.304GlnAsn: 1.304 ± 0.016
1.739GlnPro: 1.739 ± 1.885
1.304GlnGln: 1.304 ± 0.016
1.304GlnArg: 1.304 ± 0.73
3.043GlnSer: 3.043 ± 0.44
0.87GlnThr: 0.87 ± 0.487
3.043GlnVal: 3.043 ± 0.99
0.435GlnTrp: 0.435 ± 0.243
1.304GlnTyr: 1.304 ± 0.73
0.0GlnXaa: 0.0 ± 0.0
Arg
3.913ArgAla: 3.913 ± 0.047
2.609ArgCys: 2.609 ± 0.746
3.043ArgAsp: 3.043 ± 0.99
0.87ArgGlu: 0.87 ± 0.487
4.783ArgPhe: 4.783 ± 0.895
7.391ArgGly: 7.391 ± 0.864
3.478ArgHis: 3.478 ± 0.518
1.304ArgIle: 1.304 ± 0.016
1.739ArgLys: 1.739 ± 0.259
3.478ArgLeu: 3.478 ± 0.196
1.739ArgMet: 1.739 ± 0.592
1.304ArgAsn: 1.304 ± 0.016
2.174ArgPro: 2.174 ± 0.503
1.739ArgGln: 1.739 ± 0.974
5.652ArgArg: 5.652 ± 2.451
2.609ArgSer: 2.609 ± 0.031
0.87ArgThr: 0.87 ± 0.487
6.087ArgVal: 6.087 ± 1.979
0.87ArgTrp: 0.87 ± 0.228
3.478ArgTyr: 3.478 ± 1.948
0.0ArgXaa: 0.0 ± 0.0
Ser
5.652SerAla: 5.652 ± 0.408
0.87SerCys: 0.87 ± 0.487
2.174SerAsp: 2.174 ± 0.212
4.348SerGlu: 4.348 ± 1.72
3.913SerPhe: 3.913 ± 0.047
5.217SerGly: 5.217 ± 2.081
1.739SerHis: 1.739 ± 0.456
2.609SerIle: 2.609 ± 0.683
2.609SerLys: 2.609 ± 0.746
8.261SerLeu: 8.261 ± 3.236
2.174SerMet: 2.174 ± 0.927
4.783SerAsn: 4.783 ± 0.181
5.217SerPro: 5.217 ± 2.796
1.739SerGln: 1.739 ± 0.456
5.217SerArg: 5.217 ± 0.063
6.957SerSer: 6.957 ± 1.108
3.913SerThr: 3.913 ± 0.047
8.261SerVal: 8.261 ± 1.807
0.87SerTrp: 0.87 ± 0.487
2.174SerTyr: 2.174 ± 2.356
0.0SerXaa: 0.0 ± 0.0
Thr
4.783ThrAla: 4.783 ± 1.249
0.435ThrCys: 0.435 ± 0.243
2.609ThrAsp: 2.609 ± 0.683
1.304ThrGlu: 1.304 ± 0.73
2.174ThrPhe: 2.174 ± 0.503
3.913ThrGly: 3.913 ± 0.047
1.304ThrHis: 1.304 ± 0.73
2.609ThrIle: 2.609 ± 0.683
1.739ThrLys: 1.739 ± 0.259
2.174ThrLeu: 2.174 ± 0.927
0.87ThrMet: 0.87 ± 0.228
2.174ThrAsn: 2.174 ± 0.927
5.217ThrPro: 5.217 ± 0.652
1.304ThrGln: 1.304 ± 0.73
0.87ThrArg: 0.87 ± 0.487
2.609ThrSer: 2.609 ± 0.683
2.609ThrThr: 2.609 ± 2.113
3.043ThrVal: 3.043 ± 0.275
1.304ThrTrp: 1.304 ± 0.016
1.304ThrTyr: 1.304 ± 0.016
0.0ThrXaa: 0.0 ± 0.0
Val
4.783ValAla: 4.783 ± 1.249
2.174ValCys: 2.174 ± 1.217
7.826ValAsp: 7.826 ± 0.094
6.087ValGlu: 6.087 ± 3.024
7.826ValPhe: 7.826 ± 0.094
5.217ValGly: 5.217 ± 1.492
1.304ValHis: 1.304 ± 0.016
2.174ValIle: 2.174 ± 0.927
3.043ValLys: 3.043 ± 0.44
7.391ValLeu: 7.391 ± 1.28
1.304ValMet: 1.304 ± 0.73
2.609ValAsn: 2.609 ± 1.398
5.652ValPro: 5.652 ± 0.408
3.913ValGln: 3.913 ± 0.762
6.087ValArg: 6.087 ± 0.55
7.826ValSer: 7.826 ± 0.621
4.783ValThr: 4.783 ± 1.249
10.435ValVal: 10.435 ± 1.304
1.739ValTrp: 1.739 ± 0.259
5.217ValTyr: 5.217 ± 0.778
0.0ValXaa: 0.0 ± 0.0
Trp
0.87TrpAla: 0.87 ± 0.228
0.0TrpCys: 0.0 ± 0.0
0.435TrpAsp: 0.435 ± 0.243
0.435TrpGlu: 0.435 ± 0.243
1.304TrpPhe: 1.304 ± 0.016
1.739TrpGly: 1.739 ± 0.974
0.435TrpHis: 0.435 ± 0.243
0.87TrpIle: 0.87 ± 0.943
0.0TrpLys: 0.0 ± 0.0
1.304TrpLeu: 1.304 ± 0.016
0.435TrpMet: 0.435 ± 0.243
0.435TrpAsn: 0.435 ± 0.243
0.435TrpPro: 0.435 ± 0.243
0.0TrpGln: 0.0 ± 0.0
1.304TrpArg: 1.304 ± 0.016
1.304TrpSer: 1.304 ± 0.016
0.435TrpThr: 0.435 ± 0.243
0.0TrpVal: 0.0 ± 0.0
0.435TrpTrp: 0.435 ± 0.243
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.609TyrAla: 2.609 ± 0.031
0.435TyrCys: 0.435 ± 0.243
3.043TyrAsp: 3.043 ± 1.155
2.609TyrGlu: 2.609 ± 1.461
0.435TyrPhe: 0.435 ± 0.471
5.652TyrGly: 5.652 ± 0.408
0.435TyrHis: 0.435 ± 0.243
0.435TyrIle: 0.435 ± 0.471
1.304TyrLys: 1.304 ± 0.73
5.217TyrLeu: 5.217 ± 1.367
2.174TyrMet: 2.174 ± 0.927
0.87TyrAsn: 0.87 ± 0.228
0.87TyrPro: 0.87 ± 0.487
0.435TyrGln: 0.435 ± 0.243
1.739TyrArg: 1.739 ± 0.259
1.304TyrSer: 1.304 ± 1.414
1.739TyrThr: 1.739 ± 0.259
3.913TyrVal: 3.913 ± 1.477
1.304TyrTrp: 1.304 ± 0.016
1.739TyrTyr: 1.739 ± 0.974
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2301 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski