Amino acid dipepetide frequency for Marine RNA virus PAL_E4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.441AlaAla: 5.441 ± 0.016
2.332AlaCys: 2.332 ± 0.087
4.664AlaAsp: 4.664 ± 1.14
3.498AlaGlu: 3.498 ± 1.184
3.887AlaPhe: 3.887 ± 0.364
6.607AlaGly: 6.607 ± 2.0
2.332AlaHis: 2.332 ± 0.744
2.332AlaIle: 2.332 ± 0.087
2.721AlaLys: 2.721 ± 1.651
4.664AlaLeu: 4.664 ± 0.831
0.777AlaMet: 0.777 ± 0.467
2.721AlaAsn: 2.721 ± 0.994
3.887AlaPro: 3.887 ± 1.022
1.166AlaGln: 1.166 ± 0.044
5.052AlaArg: 5.052 ± 0.408
3.887AlaSer: 3.887 ± 1.608
4.275AlaThr: 4.275 ± 1.374
9.328AlaVal: 9.328 ± 1.663
0.0AlaTrp: 0.0 ± 0.0
2.332AlaTyr: 2.332 ± 1.227
0.0AlaXaa: 0.0 ± 0.0
Cys
1.555CysAla: 1.555 ± 0.277
0.389CysCys: 0.389 ± 0.234
0.389CysAsp: 0.389 ± 0.424
1.166CysGlu: 1.166 ± 0.044
1.555CysPhe: 1.555 ± 1.037
2.332CysGly: 2.332 ± 1.402
0.389CysHis: 0.389 ± 0.234
0.389CysIle: 0.389 ± 0.234
0.0CysLys: 0.0 ± 0.0
1.943CysLeu: 1.943 ± 0.511
0.389CysMet: 0.389 ± 0.234
1.555CysAsn: 1.555 ± 0.277
0.389CysPro: 0.389 ± 0.234
0.389CysGln: 0.389 ± 0.424
0.0CysArg: 0.0 ± 0.0
0.389CysSer: 0.389 ± 0.234
0.777CysThr: 0.777 ± 0.467
0.777CysVal: 0.777 ± 0.19
0.0CysTrp: 0.0 ± 0.0
1.166CysTyr: 1.166 ± 0.044
0.0CysXaa: 0.0 ± 0.0
Asp
5.052AspAla: 5.052 ± 0.408
0.777AspCys: 0.777 ± 0.467
3.887AspAsp: 3.887 ± 0.364
4.275AspGlu: 4.275 ± 0.059
3.498AspPhe: 3.498 ± 0.131
3.887AspGly: 3.887 ± 0.293
0.389AspHis: 0.389 ± 0.424
3.498AspIle: 3.498 ± 0.131
2.332AspLys: 2.332 ± 1.402
5.441AspLeu: 5.441 ± 0.673
2.332AspMet: 2.332 ± 0.087
0.777AspAsn: 0.777 ± 0.19
4.664AspPro: 4.664 ± 1.798
0.777AspGln: 0.777 ± 0.19
2.721AspArg: 2.721 ± 0.321
3.498AspSer: 3.498 ± 0.131
4.275AspThr: 4.275 ± 0.059
3.887AspVal: 3.887 ± 0.364
0.777AspTrp: 0.777 ± 0.467
3.109AspTyr: 3.109 ± 0.103
0.0AspXaa: 0.0 ± 0.0
Glu
1.555GluAla: 1.555 ± 0.277
0.389GluCys: 0.389 ± 0.424
2.721GluAsp: 2.721 ± 0.337
3.109GluGlu: 3.109 ± 0.103
2.332GluPhe: 2.332 ± 0.087
1.555GluGly: 1.555 ± 0.277
1.943GluHis: 1.943 ± 0.511
1.555GluIle: 1.555 ± 0.38
0.389GluLys: 0.389 ± 0.234
5.052GluLeu: 5.052 ± 1.722
1.943GluMet: 1.943 ± 1.461
2.332GluAsn: 2.332 ± 1.227
2.332GluPro: 2.332 ± 0.57
0.777GluGln: 0.777 ± 0.847
1.555GluArg: 1.555 ± 0.277
3.887GluSer: 3.887 ± 1.022
1.943GluThr: 1.943 ± 0.146
3.887GluVal: 3.887 ± 0.364
0.777GluTrp: 0.777 ± 0.19
3.109GluTyr: 3.109 ± 1.417
0.0GluXaa: 0.0 ± 0.0
Phe
4.664PheAla: 4.664 ± 0.174
0.777PheCys: 0.777 ± 0.467
3.109PheAsp: 3.109 ± 0.554
0.777PheGlu: 0.777 ± 0.847
3.498PhePhe: 3.498 ± 0.131
4.664PheGly: 4.664 ± 0.174
1.166PheHis: 1.166 ± 0.614
2.332PheIle: 2.332 ± 0.087
0.777PheLys: 0.777 ± 0.467
4.664PheLeu: 4.664 ± 0.174
1.943PheMet: 1.943 ± 1.461
1.166PheAsn: 1.166 ± 0.614
1.555PhePro: 1.555 ± 0.277
1.555PheGln: 1.555 ± 0.38
3.887PheArg: 3.887 ± 0.293
5.83PheSer: 5.83 ± 0.439
3.887PheThr: 3.887 ± 0.364
3.109PheVal: 3.109 ± 0.103
0.777PheTrp: 0.777 ± 0.467
2.721PheTyr: 2.721 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
6.607GlyAla: 6.607 ± 0.028
1.555GlyCys: 1.555 ± 0.277
5.441GlyAsp: 5.441 ± 0.641
3.109GlyGlu: 3.109 ± 0.76
2.721GlyPhe: 2.721 ± 0.337
5.441GlyGly: 5.441 ± 1.988
1.555GlyHis: 1.555 ± 0.277
3.498GlyIle: 3.498 ± 1.184
3.498GlyLys: 3.498 ± 2.102
8.939GlyLeu: 8.939 ± 1.429
2.721GlyMet: 2.721 ± 0.337
3.498GlyAsn: 3.498 ± 0.131
1.943GlyPro: 1.943 ± 0.511
1.555GlyGln: 1.555 ± 0.38
3.498GlyArg: 3.498 ± 0.527
5.441GlySer: 5.441 ± 0.641
3.498GlyThr: 3.498 ± 0.131
6.996GlyVal: 6.996 ± 1.576
1.166GlyTrp: 1.166 ± 0.614
2.332GlyTyr: 2.332 ± 0.087
0.0GlyXaa: 0.0 ± 0.0
His
1.166HisAla: 1.166 ± 0.701
0.0HisCys: 0.0 ± 0.0
0.389HisAsp: 0.389 ± 0.234
1.166HisGlu: 1.166 ± 0.044
1.166HisPhe: 1.166 ± 0.044
0.777HisGly: 0.777 ± 0.467
0.389HisHis: 0.389 ± 0.234
1.555HisIle: 1.555 ± 0.277
0.389HisLys: 0.389 ± 0.234
3.498HisLeu: 3.498 ± 0.788
1.166HisMet: 1.166 ± 0.044
1.166HisAsn: 1.166 ± 0.044
0.777HisPro: 0.777 ± 0.19
0.389HisGln: 0.389 ± 0.234
1.943HisArg: 1.943 ± 0.146
0.389HisSer: 0.389 ± 0.234
1.555HisThr: 1.555 ± 0.277
1.166HisVal: 1.166 ± 0.044
0.777HisTrp: 0.777 ± 0.19
0.389HisTyr: 0.389 ± 0.234
0.0HisXaa: 0.0 ± 0.0
Ile
4.275IleAla: 4.275 ± 1.374
0.777IleCys: 0.777 ± 0.467
3.498IleAsp: 3.498 ± 1.184
1.943IleGlu: 1.943 ± 0.804
0.777IlePhe: 0.777 ± 0.467
2.721IleGly: 2.721 ± 0.321
1.555IleHis: 1.555 ± 0.934
2.332IleIle: 2.332 ± 0.087
2.721IleLys: 2.721 ± 0.321
3.498IleLeu: 3.498 ± 1.184
1.943IleMet: 1.943 ± 1.168
3.498IleAsn: 3.498 ± 1.841
2.721IlePro: 2.721 ± 0.994
0.777IleGln: 0.777 ± 0.19
4.275IleArg: 4.275 ± 1.374
3.887IleSer: 3.887 ± 1.022
5.83IleThr: 5.83 ± 1.754
5.441IleVal: 5.441 ± 1.956
0.389IleTrp: 0.389 ± 0.234
1.555IleTyr: 1.555 ± 0.934
0.0IleXaa: 0.0 ± 0.0
Lys
1.943LysAla: 1.943 ± 0.511
0.0LysCys: 0.0 ± 0.0
2.332LysAsp: 2.332 ± 0.744
2.332LysGlu: 2.332 ± 0.744
1.943LysPhe: 1.943 ± 0.511
2.721LysGly: 2.721 ± 1.635
1.555LysHis: 1.555 ± 0.277
0.777LysIle: 0.777 ± 0.467
3.887LysLys: 3.887 ± 1.679
5.441LysLeu: 5.441 ± 0.673
0.777LysMet: 0.777 ± 0.19
2.332LysAsn: 2.332 ± 1.227
1.166LysPro: 1.166 ± 1.271
1.555LysGln: 1.555 ± 0.277
3.109LysArg: 3.109 ± 0.103
3.498LysSer: 3.498 ± 2.102
1.943LysThr: 1.943 ± 1.168
3.109LysVal: 3.109 ± 1.212
0.0LysTrp: 0.0 ± 0.0
0.777LysTyr: 0.777 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
5.83LeuAla: 5.83 ± 0.439
1.943LeuCys: 1.943 ± 0.804
3.887LeuAsp: 3.887 ± 1.679
1.943LeuGlu: 1.943 ± 0.804
3.109LeuPhe: 3.109 ± 0.103
6.218LeuGly: 6.218 ± 1.109
0.777LeuHis: 0.777 ± 0.467
6.607LeuIle: 6.607 ± 1.342
3.109LeuLys: 3.109 ± 0.554
5.83LeuLeu: 5.83 ± 2.19
1.943LeuMet: 1.943 ± 1.168
3.887LeuAsn: 3.887 ± 0.364
6.218LeuPro: 6.218 ± 1.52
1.166LeuGln: 1.166 ± 0.044
4.275LeuArg: 4.275 ± 1.255
7.384LeuSer: 7.384 ± 0.162
8.939LeuThr: 8.939 ± 0.115
6.996LeuVal: 6.996 ± 2.233
0.389LeuTrp: 0.389 ± 0.234
2.721LeuTyr: 2.721 ± 0.321
0.0LeuXaa: 0.0 ± 0.0
Met
2.721MetAla: 2.721 ± 0.994
1.166MetCys: 1.166 ± 0.614
3.109MetAsp: 3.109 ± 0.103
1.943MetGlu: 1.943 ± 0.804
0.777MetPhe: 0.777 ± 0.467
2.721MetGly: 2.721 ± 0.994
0.0MetHis: 0.0 ± 0.0
1.166MetIle: 1.166 ± 0.701
1.943MetLys: 1.943 ± 0.511
1.943MetLeu: 1.943 ± 0.146
0.777MetMet: 0.777 ± 0.19
0.389MetAsn: 0.389 ± 0.234
2.332MetPro: 2.332 ± 0.087
0.0MetGln: 0.0 ± 0.0
1.943MetArg: 1.943 ± 0.511
2.721MetSer: 2.721 ± 0.337
1.943MetThr: 1.943 ± 1.168
3.109MetVal: 3.109 ± 1.212
0.777MetTrp: 0.777 ± 0.467
1.943MetTyr: 1.943 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
1.555AsnAla: 1.555 ± 1.037
0.389AsnCys: 0.389 ± 0.234
1.555AsnAsp: 1.555 ± 0.38
0.389AsnGlu: 0.389 ± 0.234
3.887AsnPhe: 3.887 ± 0.293
4.275AsnGly: 4.275 ± 0.717
1.555AsnHis: 1.555 ± 0.934
1.166AsnIle: 1.166 ± 1.271
0.777AsnLys: 0.777 ± 0.19
3.498AsnLeu: 3.498 ± 0.527
2.721AsnMet: 2.721 ± 0.978
2.721AsnAsn: 2.721 ± 0.321
8.162AsnPro: 8.162 ± 1.667
0.777AsnGln: 0.777 ± 0.19
2.332AsnArg: 2.332 ± 1.885
3.498AsnSer: 3.498 ± 0.131
3.109AsnThr: 3.109 ± 2.732
3.109AsnVal: 3.109 ± 0.103
0.0AsnTrp: 0.0 ± 0.0
1.943AsnTyr: 1.943 ± 0.146
0.0AsnXaa: 0.0 ± 0.0
Pro
5.441ProAla: 5.441 ± 3.959
0.777ProCys: 0.777 ± 0.467
2.332ProAsp: 2.332 ± 0.57
1.555ProGlu: 1.555 ± 0.38
4.664ProPhe: 4.664 ± 0.483
3.109ProGly: 3.109 ± 0.76
1.166ProHis: 1.166 ± 0.614
5.052ProIle: 5.052 ± 2.879
0.777ProLys: 0.777 ± 0.467
3.887ProLeu: 3.887 ± 0.364
1.166ProMet: 1.166 ± 0.701
2.721ProAsn: 2.721 ± 0.337
2.721ProPro: 2.721 ± 0.337
2.332ProGln: 2.332 ± 0.57
2.332ProArg: 2.332 ± 0.744
4.275ProSer: 4.275 ± 0.059
3.498ProThr: 3.498 ± 1.184
6.218ProVal: 6.218 ± 1.52
0.777ProTrp: 0.777 ± 0.19
1.943ProTyr: 1.943 ± 0.511
0.0ProXaa: 0.0 ± 0.0
Gln
0.777GlnAla: 0.777 ± 0.19
0.777GlnCys: 0.777 ± 0.19
1.166GlnAsp: 1.166 ± 0.044
1.166GlnGlu: 1.166 ± 0.614
0.777GlnPhe: 0.777 ± 0.19
1.166GlnGly: 1.166 ± 0.044
0.389GlnHis: 0.389 ± 0.234
1.555GlnIle: 1.555 ± 0.277
2.332GlnLys: 2.332 ± 0.744
1.943GlnLeu: 1.943 ± 0.146
1.166GlnMet: 1.166 ± 0.044
1.166GlnAsn: 1.166 ± 0.614
0.777GlnPro: 0.777 ± 0.847
0.389GlnGln: 0.389 ± 0.234
0.777GlnArg: 0.777 ± 0.467
2.721GlnSer: 2.721 ± 0.994
1.943GlnThr: 1.943 ± 0.511
2.332GlnVal: 2.332 ± 1.227
0.0GlnTrp: 0.0 ± 0.0
0.389GlnTyr: 0.389 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
4.275ArgAla: 4.275 ± 1.912
0.0ArgCys: 0.0 ± 0.0
2.721ArgAsp: 2.721 ± 0.978
2.332ArgGlu: 2.332 ± 0.087
3.887ArgPhe: 3.887 ± 0.95
3.498ArgGly: 3.498 ± 1.841
0.389ArgHis: 0.389 ± 0.234
2.721ArgIle: 2.721 ± 0.994
1.943ArgLys: 1.943 ± 0.511
3.887ArgLeu: 3.887 ± 1.022
2.332ArgMet: 2.332 ± 0.087
1.166ArgAsn: 1.166 ± 0.614
2.721ArgPro: 2.721 ± 0.337
1.166ArgGln: 1.166 ± 0.614
2.721ArgArg: 2.721 ± 0.321
5.052ArgSer: 5.052 ± 0.907
1.943ArgThr: 1.943 ± 0.511
3.498ArgVal: 3.498 ± 0.527
0.777ArgTrp: 0.777 ± 0.19
1.943ArgTyr: 1.943 ± 0.511
0.0ArgXaa: 0.0 ± 0.0
Ser
6.996SerAla: 6.996 ± 0.396
1.166SerCys: 1.166 ± 0.701
4.275SerAsp: 4.275 ± 0.717
3.887SerGlu: 3.887 ± 1.022
5.052SerPhe: 5.052 ± 0.408
6.218SerGly: 6.218 ± 0.206
1.943SerHis: 1.943 ± 0.511
5.441SerIle: 5.441 ± 0.673
1.943SerLys: 1.943 ± 0.146
8.162SerLeu: 8.162 ± 1.619
1.943SerMet: 1.943 ± 0.804
3.887SerAsn: 3.887 ± 0.364
3.498SerPro: 3.498 ± 1.184
1.943SerGln: 1.943 ± 0.146
2.721SerArg: 2.721 ± 0.337
5.052SerSer: 5.052 ± 0.907
4.275SerThr: 4.275 ± 0.059
5.441SerVal: 5.441 ± 0.641
1.166SerTrp: 1.166 ± 0.701
3.109SerTyr: 3.109 ± 1.417
0.0SerXaa: 0.0 ± 0.0
Thr
1.943ThrAla: 1.943 ± 0.146
1.166ThrCys: 1.166 ± 0.701
6.218ThrAsp: 6.218 ± 0.451
2.332ThrGlu: 2.332 ± 0.57
2.721ThrPhe: 2.721 ± 0.994
4.275ThrGly: 4.275 ± 0.059
0.389ThrHis: 0.389 ± 0.234
4.275ThrIle: 4.275 ± 1.374
3.109ThrLys: 3.109 ± 1.212
4.275ThrLeu: 4.275 ± 0.598
1.555ThrMet: 1.555 ± 0.144
5.052ThrAsn: 5.052 ± 1.564
5.441ThrPro: 5.441 ± 1.33
3.109ThrGln: 3.109 ± 0.103
1.166ThrArg: 1.166 ± 0.044
5.83ThrSer: 5.83 ± 0.439
4.275ThrThr: 4.275 ± 0.717
5.052ThrVal: 5.052 ± 0.249
0.777ThrTrp: 0.777 ± 0.19
1.166ThrTyr: 1.166 ± 0.044
0.0ThrXaa: 0.0 ± 0.0
Val
6.996ValAla: 6.996 ± 0.919
0.777ValCys: 0.777 ± 0.467
5.83ValAsp: 5.83 ± 0.439
4.664ValGlu: 4.664 ± 2.146
2.332ValPhe: 2.332 ± 0.57
7.384ValGly: 7.384 ± 1.152
1.166ValHis: 1.166 ± 0.614
4.664ValIle: 4.664 ± 1.489
3.887ValLys: 3.887 ± 0.364
4.664ValLeu: 4.664 ± 2.146
3.887ValMet: 3.887 ± 0.277
4.275ValAsn: 4.275 ± 0.598
4.275ValPro: 4.275 ± 0.717
2.721ValGln: 2.721 ± 0.978
3.109ValArg: 3.109 ± 0.103
7.773ValSer: 7.773 ± 0.071
3.498ValThr: 3.498 ± 0.527
4.275ValVal: 4.275 ± 0.059
1.166ValTrp: 1.166 ± 0.044
3.498ValTyr: 3.498 ± 0.788
0.0ValXaa: 0.0 ± 0.0
Trp
0.389TrpAla: 0.389 ± 0.234
0.0TrpCys: 0.0 ± 0.0
1.166TrpAsp: 1.166 ± 0.614
0.389TrpGlu: 0.389 ± 0.234
0.389TrpPhe: 0.389 ± 0.234
1.943TrpGly: 1.943 ± 0.804
0.0TrpHis: 0.0 ± 0.0
1.555TrpIle: 1.555 ± 0.934
1.166TrpLys: 1.166 ± 0.044
0.777TrpLeu: 0.777 ± 0.467
0.389TrpMet: 0.389 ± 0.234
0.777TrpAsn: 0.777 ± 0.467
0.389TrpPro: 0.389 ± 0.234
0.389TrpGln: 0.389 ± 0.424
0.0TrpArg: 0.0 ± 0.0
1.166TrpSer: 1.166 ± 0.614
0.0TrpThr: 0.0 ± 0.0
0.389TrpVal: 0.389 ± 0.234
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.721TyrAla: 2.721 ± 1.635
0.777TyrCys: 0.777 ± 0.19
1.166TyrAsp: 1.166 ± 0.701
1.166TyrGlu: 1.166 ± 0.044
3.887TyrPhe: 3.887 ± 1.022
3.498TyrGly: 3.498 ± 0.788
1.166TyrHis: 1.166 ± 0.044
1.943TyrIle: 1.943 ± 0.146
3.498TyrLys: 3.498 ± 0.527
1.166TyrLeu: 1.166 ± 1.271
1.166TyrMet: 1.166 ± 0.044
2.332TyrAsn: 2.332 ± 1.885
0.777TyrPro: 0.777 ± 0.19
0.777TyrGln: 0.777 ± 0.467
1.555TyrArg: 1.555 ± 0.38
1.943TyrSer: 1.943 ± 0.146
3.109TyrThr: 3.109 ± 0.76
2.721TyrVal: 2.721 ± 0.978
0.777TyrTrp: 0.777 ± 0.19
1.555TyrTyr: 1.555 ± 1.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2574 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski