Amino acid dipepetide frequency for Otarine picobirnavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.098AlaAla: 7.098 ± 0.255
0.0AlaCys: 0.0 ± 0.0
2.366AlaAsp: 2.366 ± 0.793
3.155AlaGlu: 3.155 ± 1.978
1.577AlaPhe: 1.577 ± 1.189
2.366AlaGly: 2.366 ± 0.793
1.577AlaHis: 1.577 ± 1.189
2.366AlaIle: 2.366 ± 0.774
5.521AlaLys: 5.521 ± 2.567
4.732AlaLeu: 4.732 ± 1.586
3.155AlaMet: 3.155 ± 1.353
7.098AlaAsn: 7.098 ± 2.112
3.155AlaPro: 3.155 ± 1.353
3.155AlaGln: 3.155 ± 2.158
2.366AlaArg: 2.366 ± 1.783
5.521AlaSer: 5.521 ± 3.072
3.155AlaThr: 3.155 ± 1.672
4.732AlaVal: 4.732 ± 1.094
0.789AlaTrp: 0.789 ± 0.586
3.155AlaTyr: 3.155 ± 1.353
0.0AlaXaa: 0.0 ± 0.0
Cys
1.577CysAla: 1.577 ± 0.365
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.789CysGlu: 0.789 ± 0.586
0.789CysPhe: 0.789 ± 0.594
0.789CysGly: 0.789 ± 0.586
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.789CysLeu: 0.789 ± 0.586
0.789CysMet: 0.789 ± 0.586
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.577CysGln: 1.577 ± 0.365
0.789CysArg: 0.789 ± 0.586
0.789CysSer: 0.789 ± 0.594
1.577CysThr: 1.577 ± 1.172
0.789CysVal: 0.789 ± 0.586
0.0CysTrp: 0.0 ± 0.0
0.789CysTyr: 0.789 ± 0.594
0.0CysXaa: 0.0 ± 0.0
Asp
3.943AspAla: 3.943 ± 1.934
0.789AspCys: 0.789 ± 0.586
6.309AspAsp: 6.309 ± 2.648
1.577AspGlu: 1.577 ± 0.365
1.577AspPhe: 1.577 ± 1.172
2.366AspGly: 2.366 ± 0.774
3.155AspHis: 3.155 ± 0.729
5.521AspIle: 5.521 ± 1.382
1.577AspLys: 1.577 ± 0.365
6.309AspLeu: 6.309 ± 1.819
2.366AspMet: 2.366 ± 0.774
0.789AspAsn: 0.789 ± 0.594
3.155AspPro: 3.155 ± 1.353
4.732AspGln: 4.732 ± 1.549
3.155AspArg: 3.155 ± 1.324
4.732AspSer: 4.732 ± 1.094
1.577AspThr: 1.577 ± 0.365
3.943AspVal: 3.943 ± 1.385
1.577AspTrp: 1.577 ± 0.365
3.155AspTyr: 3.155 ± 0.729
0.0AspXaa: 0.0 ± 0.0
Glu
2.366GluAla: 2.366 ± 1.758
0.0GluCys: 0.0 ± 0.0
4.732GluAsp: 4.732 ± 1.094
2.366GluGlu: 2.366 ± 1.783
2.366GluPhe: 2.366 ± 0.774
0.789GluGly: 0.789 ± 0.586
0.0GluHis: 0.0 ± 0.0
2.366GluIle: 2.366 ± 2.321
2.366GluLys: 2.366 ± 0.899
3.155GluLeu: 3.155 ± 2.158
2.366GluMet: 2.366 ± 0.899
4.732GluAsn: 4.732 ± 0.525
1.577GluPro: 1.577 ± 1.189
3.943GluGln: 3.943 ± 4.973
3.155GluArg: 3.155 ± 3.724
6.309GluSer: 6.309 ± 2.831
2.366GluThr: 2.366 ± 0.774
2.366GluVal: 2.366 ± 0.793
0.789GluTrp: 0.789 ± 0.586
0.789GluTyr: 0.789 ± 0.586
0.0GluXaa: 0.0 ± 0.0
Phe
3.155PheAla: 3.155 ± 2.344
0.789PheCys: 0.789 ± 0.586
3.943PheAsp: 3.943 ± 1.059
0.789PheGlu: 0.789 ± 1.26
0.789PhePhe: 0.789 ± 1.26
1.577PheGly: 1.577 ± 1.295
0.0PheHis: 0.0 ± 0.0
1.577PheIle: 1.577 ± 0.365
1.577PheLys: 1.577 ± 1.189
1.577PheLeu: 1.577 ± 1.172
1.577PheMet: 1.577 ± 1.189
2.366PheAsn: 2.366 ± 0.899
1.577PhePro: 1.577 ± 1.172
2.366PheGln: 2.366 ± 0.899
2.366PheArg: 2.366 ± 1.298
3.155PheSer: 3.155 ± 0.729
1.577PheThr: 1.577 ± 1.295
1.577PheVal: 1.577 ± 1.189
0.0PheTrp: 0.0 ± 0.0
0.789PheTyr: 0.789 ± 0.586
0.0PheXaa: 0.0 ± 0.0
Gly
2.366GlyAla: 2.366 ± 0.793
0.789GlyCys: 0.789 ± 0.594
3.155GlyAsp: 3.155 ± 1.324
1.577GlyGlu: 1.577 ± 1.189
1.577GlyPhe: 1.577 ± 1.295
3.943GlyGly: 3.943 ± 0.543
0.789GlyHis: 0.789 ± 0.594
4.732GlyIle: 4.732 ± 1.094
2.366GlyLys: 2.366 ± 0.774
4.732GlyLeu: 4.732 ± 1.798
2.366GlyMet: 2.366 ± 0.682
2.366GlyAsn: 2.366 ± 1.298
2.366GlyPro: 2.366 ± 1.758
2.366GlyGln: 2.366 ± 1.758
1.577GlyArg: 1.577 ± 0.365
7.886GlySer: 7.886 ± 1.823
5.521GlyThr: 5.521 ± 2.098
6.309GlyVal: 6.309 ± 0.414
0.789GlyTrp: 0.789 ± 0.586
3.943GlyTyr: 3.943 ± 2.148
0.0GlyXaa: 0.0 ± 0.0
His
1.577HisAla: 1.577 ± 1.189
0.789HisCys: 0.789 ± 0.594
1.577HisAsp: 1.577 ± 1.295
0.789HisGlu: 0.789 ± 1.26
0.789HisPhe: 0.789 ± 0.586
1.577HisGly: 1.577 ± 1.172
1.577HisHis: 1.577 ± 0.365
0.0HisIle: 0.0 ± 0.0
0.789HisLys: 0.789 ± 0.594
0.789HisLeu: 0.789 ± 0.586
0.789HisMet: 0.789 ± 0.594
3.943HisAsn: 3.943 ± 4.809
2.366HisPro: 2.366 ± 0.793
0.0HisGln: 0.0 ± 0.0
2.366HisArg: 2.366 ± 0.899
0.0HisSer: 0.0 ± 0.0
2.366HisThr: 2.366 ± 0.793
1.577HisVal: 1.577 ± 0.365
0.0HisTrp: 0.0 ± 0.0
1.577HisTyr: 1.577 ± 0.365
0.0HisXaa: 0.0 ± 0.0
Ile
5.521IleAla: 5.521 ± 3.111
0.789IleCys: 0.789 ± 0.586
6.309IleAsp: 6.309 ± 1.819
3.943IleGlu: 3.943 ± 1.385
0.789IlePhe: 0.789 ± 0.594
1.577IleGly: 1.577 ± 1.133
0.789IleHis: 0.789 ± 0.586
0.789IleIle: 0.789 ± 0.594
0.0IleLys: 0.0 ± 0.0
3.943IleLeu: 3.943 ± 1.896
1.577IleMet: 1.577 ± 1.189
3.155IleAsn: 3.155 ± 1.672
2.366IlePro: 2.366 ± 0.774
0.789IleGln: 0.789 ± 0.586
3.943IleArg: 3.943 ± 1.385
0.0IleSer: 0.0 ± 0.0
1.577IleThr: 1.577 ± 0.365
2.366IleVal: 2.366 ± 0.774
0.789IleTrp: 0.789 ± 0.594
3.155IleTyr: 3.155 ± 2.377
0.0IleXaa: 0.0 ± 0.0
Lys
2.366LysAla: 2.366 ± 0.899
0.0LysCys: 0.0 ± 0.0
4.732LysAsp: 4.732 ± 2.521
4.732LysGlu: 4.732 ± 2.293
1.577LysPhe: 1.577 ± 1.172
0.789LysGly: 0.789 ± 0.586
0.789LysHis: 0.789 ± 0.586
1.577LysIle: 1.577 ± 0.365
3.155LysLys: 3.155 ± 1.978
3.943LysLeu: 3.943 ± 1.96
2.366LysMet: 2.366 ± 0.774
2.366LysAsn: 2.366 ± 0.774
1.577LysPro: 1.577 ± 1.172
2.366LysGln: 2.366 ± 1.758
3.943LysArg: 3.943 ± 1.385
5.521LysSer: 5.521 ± 1.285
1.577LysThr: 1.577 ± 0.365
3.155LysVal: 3.155 ± 0.729
2.366LysTrp: 2.366 ± 0.793
2.366LysTyr: 2.366 ± 0.774
0.0LysXaa: 0.0 ± 0.0
Leu
3.155LeuAla: 3.155 ± 2.158
0.789LeuCys: 0.789 ± 0.594
1.577LeuAsp: 1.577 ± 1.172
4.732LeuGlu: 4.732 ± 0.525
0.789LeuPhe: 0.789 ± 1.26
11.041LeuGly: 11.041 ± 3.237
0.789LeuHis: 0.789 ± 0.594
1.577LeuIle: 1.577 ± 0.365
3.943LeuLys: 3.943 ± 0.543
3.943LeuLeu: 3.943 ± 1.092
4.732LeuMet: 4.732 ± 1.094
6.309LeuAsn: 6.309 ± 2.658
3.155LeuPro: 3.155 ± 0.729
3.155LeuGln: 3.155 ± 0.729
7.098LeuArg: 7.098 ± 2.352
6.309LeuSer: 6.309 ± 1.536
5.521LeuThr: 5.521 ± 3.055
5.521LeuVal: 5.521 ± 2.089
0.0LeuTrp: 0.0 ± 0.0
3.155LeuTyr: 3.155 ± 2.377
0.0LeuXaa: 0.0 ± 0.0
Met
4.732MetAla: 4.732 ± 1.563
0.0MetCys: 0.0 ± 0.0
2.366MetAsp: 2.366 ± 0.793
0.0MetGlu: 0.0 ± 0.0
3.155MetPhe: 3.155 ± 0.729
2.366MetGly: 2.366 ± 0.774
0.789MetHis: 0.789 ± 0.594
2.366MetIle: 2.366 ± 0.793
4.732MetLys: 4.732 ± 1.094
2.366MetLeu: 2.366 ± 0.793
0.0MetMet: 0.0 ± 0.0
1.577MetAsn: 1.577 ± 0.365
2.366MetPro: 2.366 ± 0.774
2.366MetGln: 2.366 ± 0.774
0.789MetArg: 0.789 ± 1.26
5.521MetSer: 5.521 ± 3.111
3.155MetThr: 3.155 ± 0.804
1.577MetVal: 1.577 ± 0.365
0.789MetTrp: 0.789 ± 0.594
1.577MetTyr: 1.577 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
3.155AsnAla: 3.155 ± 1.353
1.577AsnCys: 1.577 ± 0.365
1.577AsnAsp: 1.577 ± 0.365
3.943AsnGlu: 3.943 ± 3.429
2.366AsnPhe: 2.366 ± 2.321
1.577AsnGly: 1.577 ± 0.365
0.789AsnHis: 0.789 ± 0.594
5.521AsnIle: 5.521 ± 1.406
3.155AsnLys: 3.155 ± 2.266
6.309AsnLeu: 6.309 ± 3.641
1.577AsnMet: 1.577 ± 1.133
4.732AsnAsn: 4.732 ± 3.566
5.521AsnPro: 5.521 ± 1.382
3.155AsnGln: 3.155 ± 2.589
5.521AsnArg: 5.521 ± 2.934
4.732AsnSer: 4.732 ± 1.563
4.732AsnThr: 4.732 ± 0.525
5.521AsnVal: 5.521 ± 2.079
0.0AsnTrp: 0.0 ± 0.0
1.577AsnTyr: 1.577 ± 0.365
0.0AsnXaa: 0.0 ± 0.0
Pro
3.155ProAla: 3.155 ± 1.353
0.789ProCys: 0.789 ± 0.586
5.521ProAsp: 5.521 ± 1.382
2.366ProGlu: 2.366 ± 0.793
2.366ProPhe: 2.366 ± 1.758
3.943ProGly: 3.943 ± 1.082
0.0ProHis: 0.0 ± 0.0
0.789ProIle: 0.789 ± 0.586
2.366ProLys: 2.366 ± 0.774
3.943ProLeu: 3.943 ± 1.896
0.789ProMet: 0.789 ± 0.586
3.155ProAsn: 3.155 ± 0.729
0.0ProPro: 0.0 ± 0.0
2.366ProGln: 2.366 ± 0.793
0.789ProArg: 0.789 ± 0.586
3.155ProSer: 3.155 ± 0.729
5.521ProThr: 5.521 ± 3.111
3.155ProVal: 3.155 ± 1.353
0.0ProTrp: 0.0 ± 0.0
1.577ProTyr: 1.577 ± 0.365
0.0ProXaa: 0.0 ± 0.0
Gln
1.577GlnAla: 1.577 ± 1.133
1.577GlnCys: 1.577 ± 1.172
1.577GlnAsp: 1.577 ± 0.365
4.732GlnGlu: 4.732 ± 3.131
3.155GlnPhe: 3.155 ± 2.158
1.577GlnGly: 1.577 ± 0.365
3.155GlnHis: 3.155 ± 2.158
2.366GlnIle: 2.366 ± 1.298
3.943GlnLys: 3.943 ± 1.896
1.577GlnLeu: 1.577 ± 1.295
1.577GlnMet: 1.577 ± 0.693
2.366GlnAsn: 2.366 ± 2.321
1.577GlnPro: 1.577 ± 1.172
2.366GlnGln: 2.366 ± 2.487
3.155GlnArg: 3.155 ± 1.01
1.577GlnSer: 1.577 ± 0.365
3.155GlnThr: 3.155 ± 0.804
3.943GlnVal: 3.943 ± 1.896
0.0GlnTrp: 0.0 ± 0.0
1.577GlnTyr: 1.577 ± 1.172
0.0GlnXaa: 0.0 ± 0.0
Arg
4.732ArgAla: 4.732 ± 1.872
0.789ArgCys: 0.789 ± 0.586
2.366ArgAsp: 2.366 ± 1.758
3.155ArgGlu: 3.155 ± 2.158
1.577ArgPhe: 1.577 ± 0.365
3.155ArgGly: 3.155 ± 0.729
6.309ArgHis: 6.309 ± 5.745
2.366ArgIle: 2.366 ± 0.793
0.0ArgLys: 0.0 ± 0.0
3.943ArgLeu: 3.943 ± 1.059
0.789ArgMet: 0.789 ± 0.854
3.943ArgAsn: 3.943 ± 1.934
1.577ArgPro: 1.577 ± 1.189
2.366ArgGln: 2.366 ± 0.774
4.732ArgArg: 4.732 ± 2.293
3.943ArgSer: 3.943 ± 3.418
4.732ArgThr: 4.732 ± 0.793
3.155ArgVal: 3.155 ± 2.344
1.577ArgTrp: 1.577 ± 1.172
1.577ArgTyr: 1.577 ± 1.189
0.0ArgXaa: 0.0 ± 0.0
Ser
4.732SerAla: 4.732 ± 2.671
0.0SerCys: 0.0 ± 0.0
3.155SerAsp: 3.155 ± 1.353
3.943SerGlu: 3.943 ± 2.147
2.366SerPhe: 2.366 ± 0.774
7.098SerGly: 7.098 ± 1.747
1.577SerHis: 1.577 ± 1.172
2.366SerIle: 2.366 ± 0.793
5.521SerLys: 5.521 ± 2.089
5.521SerLeu: 5.521 ± 2.079
4.732SerMet: 4.732 ± 1.586
7.886SerAsn: 7.886 ± 6.858
3.155SerPro: 3.155 ± 0.729
4.732SerGln: 4.732 ± 3.374
1.577SerArg: 1.577 ± 1.172
4.732SerSer: 4.732 ± 1.563
6.309SerThr: 6.309 ± 1.862
3.943SerVal: 3.943 ± 0.543
0.789SerTrp: 0.789 ± 0.594
1.577SerTyr: 1.577 ± 0.365
0.0SerXaa: 0.0 ± 0.0
Thr
3.155ThrAla: 3.155 ± 1.353
0.789ThrCys: 0.789 ± 0.586
1.577ThrAsp: 1.577 ± 1.172
2.366ThrGlu: 2.366 ± 2.321
2.366ThrPhe: 2.366 ± 1.783
3.155ThrGly: 3.155 ± 0.729
0.789ThrHis: 0.789 ± 0.594
4.732ThrIle: 4.732 ± 0.525
6.309ThrLys: 6.309 ± 1.833
5.521ThrLeu: 5.521 ± 1.406
3.155ThrMet: 3.155 ± 1.353
4.732ThrAsn: 4.732 ± 0.525
4.732ThrPro: 4.732 ± 2.521
2.366ThrGln: 2.366 ± 0.899
2.366ThrArg: 2.366 ± 1.783
3.155ThrSer: 3.155 ± 1.01
7.098ThrThr: 7.098 ± 3.895
1.577ThrVal: 1.577 ± 1.189
0.789ThrTrp: 0.789 ± 0.586
5.521ThrTyr: 5.521 ± 1.382
0.0ThrXaa: 0.0 ± 0.0
Val
4.732ValAla: 4.732 ± 1.931
0.789ValCys: 0.789 ± 0.586
4.732ValAsp: 4.732 ± 1.549
1.577ValGlu: 1.577 ± 0.365
1.577ValPhe: 1.577 ± 0.365
6.309ValGly: 6.309 ± 1.609
0.789ValHis: 0.789 ± 0.586
2.366ValIle: 2.366 ± 0.774
3.155ValLys: 3.155 ± 2.344
8.675ValLeu: 8.675 ± 1.143
3.155ValMet: 3.155 ± 1.353
3.155ValAsn: 3.155 ± 1.324
2.366ValPro: 2.366 ± 0.793
0.0ValGln: 0.0 ± 0.0
3.155ValArg: 3.155 ± 1.324
4.732ValSer: 4.732 ± 2.521
0.789ValThr: 0.789 ± 0.594
2.366ValVal: 2.366 ± 1.758
1.577ValTrp: 1.577 ± 1.172
3.155ValTyr: 3.155 ± 1.324
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.789TrpAsp: 0.789 ± 0.594
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.789TrpGly: 0.789 ± 0.586
1.577TrpHis: 1.577 ± 1.172
0.789TrpIle: 0.789 ± 0.586
0.0TrpLys: 0.0 ± 0.0
2.366TrpLeu: 2.366 ± 0.793
2.366TrpMet: 2.366 ± 0.774
0.0TrpAsn: 0.0 ± 0.0
0.789TrpPro: 0.789 ± 0.586
0.0TrpGln: 0.0 ± 0.0
0.789TrpArg: 0.789 ± 0.586
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.789TrpVal: 0.789 ± 0.586
0.789TrpTrp: 0.789 ± 0.586
1.577TrpTyr: 1.577 ± 1.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.943TyrAla: 3.943 ± 0.543
0.789TyrCys: 0.789 ± 0.594
3.943TyrAsp: 3.943 ± 1.059
2.366TyrGlu: 2.366 ± 1.758
1.577TyrPhe: 1.577 ± 1.172
4.732TyrGly: 4.732 ± 2.521
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
0.789TyrLys: 0.789 ± 0.594
3.155TyrLeu: 3.155 ± 0.729
1.577TyrMet: 1.577 ± 1.189
2.366TyrAsn: 2.366 ± 0.793
2.366TyrPro: 2.366 ± 0.774
3.155TyrGln: 3.155 ± 0.729
3.155TyrArg: 3.155 ± 0.729
3.943TyrSer: 3.943 ± 0.543
3.943TyrThr: 3.943 ± 1.934
0.789TyrVal: 0.789 ± 0.594
0.0TyrTrp: 0.0 ± 0.0
1.577TyrTyr: 1.577 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1269 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski