Amino acid dipepetide frequency for Human associated porprismacovirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.777AlaAla: 4.777 ± 0.596
0.0AlaCys: 0.0 ± 0.0
7.962AlaAsp: 7.962 ± 0.253
1.592AlaGlu: 1.592 ± 1.281
0.0AlaPhe: 0.0 ± 0.0
4.777AlaGly: 4.777 ± 0.596
1.592AlaHis: 1.592 ± 1.281
4.777AlaIle: 4.777 ± 1.624
1.592AlaLys: 1.592 ± 1.281
6.369AlaLeu: 6.369 ± 5.125
1.592AlaMet: 1.592 ± 1.281
4.777AlaAsn: 4.777 ± 0.596
4.777AlaPro: 4.777 ± 0.596
1.592AlaGln: 1.592 ± 1.281
0.0AlaArg: 0.0 ± 0.0
14.331AlaSer: 14.331 ± 6.226
4.777AlaThr: 4.777 ± 0.596
1.592AlaVal: 1.592 ± 0.938
3.185AlaTrp: 3.185 ± 0.343
1.592AlaTyr: 1.592 ± 1.281
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.592CysIle: 1.592 ± 1.281
3.185CysLys: 3.185 ± 2.563
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.592CysSer: 1.592 ± 0.938
0.0CysThr: 0.0 ± 0.0
1.592CysVal: 1.592 ± 1.281
0.0CysTrp: 0.0 ± 0.0
1.592CysTyr: 1.592 ± 1.281
0.0CysXaa: 0.0 ± 0.0
Asp
3.185AspAla: 3.185 ± 1.877
1.592AspCys: 1.592 ± 1.281
0.0AspAsp: 0.0 ± 0.0
3.185AspGlu: 3.185 ± 2.563
1.592AspPhe: 1.592 ± 1.281
6.369AspGly: 6.369 ± 1.534
0.0AspHis: 0.0 ± 0.0
1.592AspIle: 1.592 ± 1.281
3.185AspLys: 3.185 ± 2.563
1.592AspLeu: 1.592 ± 0.938
3.185AspMet: 3.185 ± 1.877
3.185AspAsn: 3.185 ± 1.877
3.185AspPro: 3.185 ± 1.877
4.777AspGln: 4.777 ± 2.815
6.369AspArg: 6.369 ± 2.905
1.592AspSer: 1.592 ± 1.281
4.777AspThr: 4.777 ± 0.596
3.185AspVal: 3.185 ± 1.877
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.777GluAla: 4.777 ± 1.624
0.0GluCys: 0.0 ± 0.0
1.592GluAsp: 1.592 ± 0.938
1.592GluGlu: 1.592 ± 0.938
1.592GluPhe: 1.592 ± 0.938
4.777GluGly: 4.777 ± 1.624
1.592GluHis: 1.592 ± 1.281
3.185GluIle: 3.185 ± 0.343
3.185GluLys: 3.185 ± 2.563
1.592GluLeu: 1.592 ± 0.938
0.0GluMet: 0.0 ± 0.0
3.185GluAsn: 3.185 ± 1.877
0.0GluPro: 0.0 ± 0.0
1.592GluGln: 1.592 ± 1.281
3.185GluArg: 3.185 ± 2.563
7.962GluSer: 7.962 ± 1.967
3.185GluThr: 3.185 ± 0.343
3.185GluVal: 3.185 ± 0.343
1.592GluTrp: 1.592 ± 1.281
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.185PheAsp: 3.185 ± 1.877
3.185PheGlu: 3.185 ± 0.343
3.185PhePhe: 3.185 ± 0.343
4.777PheGly: 4.777 ± 0.596
1.592PheHis: 1.592 ± 0.938
0.0PheIle: 0.0 ± 0.0
3.185PheLys: 3.185 ± 1.877
1.592PheLeu: 1.592 ± 0.938
4.777PheMet: 4.777 ± 2.418
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
6.369PheArg: 6.369 ± 1.534
0.0PheSer: 0.0 ± 0.0
3.185PheThr: 3.185 ± 0.343
1.592PheVal: 1.592 ± 0.938
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.369GlyAla: 6.369 ± 0.686
1.592GlyCys: 1.592 ± 0.938
0.0GlyAsp: 0.0 ± 0.0
3.185GlyGlu: 3.185 ± 1.877
7.962GlyPhe: 7.962 ± 2.472
4.777GlyGly: 4.777 ± 1.624
0.0GlyHis: 0.0 ± 0.0
4.777GlyIle: 4.777 ± 2.815
6.369GlyLys: 6.369 ± 5.125
4.777GlyLeu: 4.777 ± 1.624
0.0GlyMet: 0.0 ± 0.0
6.369GlyAsn: 6.369 ± 1.534
0.0GlyPro: 0.0 ± 0.0
0.0GlyGln: 0.0 ± 0.0
3.185GlyArg: 3.185 ± 0.343
4.777GlySer: 4.777 ± 0.596
9.554GlyThr: 9.554 ± 5.631
6.369GlyVal: 6.369 ± 0.686
3.185GlyTrp: 3.185 ± 0.343
1.592GlyTyr: 1.592 ± 1.281
0.0GlyXaa: 0.0 ± 0.0
His
1.592HisAla: 1.592 ± 1.281
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.185HisGly: 3.185 ± 1.877
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.592HisThr: 1.592 ± 0.938
3.185HisVal: 3.185 ± 0.343
1.592HisTrp: 1.592 ± 1.281
1.592HisTyr: 1.592 ± 0.938
0.0HisXaa: 0.0 ± 0.0
Ile
3.185IleAla: 3.185 ± 0.343
1.592IleCys: 1.592 ± 1.281
6.369IleAsp: 6.369 ± 0.686
3.185IleGlu: 3.185 ± 2.563
3.185IlePhe: 3.185 ± 1.877
1.592IleGly: 1.592 ± 0.938
1.592IleHis: 1.592 ± 0.938
6.369IleIle: 6.369 ± 0.686
1.592IleLys: 1.592 ± 1.281
7.962IleLeu: 7.962 ± 2.472
1.592IleMet: 1.592 ± 1.281
1.592IleAsn: 1.592 ± 1.281
4.777IlePro: 4.777 ± 1.624
1.592IleGln: 1.592 ± 0.938
6.369IleArg: 6.369 ± 5.125
1.592IleSer: 1.592 ± 0.938
4.777IleThr: 4.777 ± 0.596
1.592IleVal: 1.592 ± 1.281
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.369LysAla: 6.369 ± 0.686
0.0LysCys: 0.0 ± 0.0
1.592LysAsp: 1.592 ± 1.281
3.185LysGlu: 3.185 ± 2.563
3.185LysPhe: 3.185 ± 1.877
7.962LysGly: 7.962 ± 2.472
0.0LysHis: 0.0 ± 0.0
1.592LysIle: 1.592 ± 0.938
1.592LysLys: 1.592 ± 1.281
6.369LysLeu: 6.369 ± 2.905
1.592LysMet: 1.592 ± 1.281
3.185LysAsn: 3.185 ± 2.563
0.0LysPro: 0.0 ± 0.0
3.185LysGln: 3.185 ± 0.343
1.592LysArg: 1.592 ± 1.281
1.592LysSer: 1.592 ± 1.281
0.0LysThr: 0.0 ± 0.0
1.592LysVal: 1.592 ± 1.281
4.777LysTrp: 4.777 ± 1.624
1.592LysTyr: 1.592 ± 1.281
0.0LysXaa: 0.0 ± 0.0
Leu
4.777LeuAla: 4.777 ± 0.596
0.0LeuCys: 0.0 ± 0.0
4.777LeuAsp: 4.777 ± 0.596
4.777LeuGlu: 4.777 ± 1.624
1.592LeuPhe: 1.592 ± 0.938
3.185LeuGly: 3.185 ± 0.343
0.0LeuHis: 0.0 ± 0.0
1.592LeuIle: 1.592 ± 1.281
1.592LeuLys: 1.592 ± 1.281
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.87
1.592LeuAsn: 1.592 ± 0.938
11.146LeuPro: 11.146 ± 6.569
6.369LeuGln: 6.369 ± 3.754
0.0LeuArg: 0.0 ± 0.0
3.185LeuSer: 3.185 ± 0.343
6.369LeuThr: 6.369 ± 0.686
6.369LeuVal: 6.369 ± 2.905
1.592LeuTrp: 1.592 ± 1.281
7.962LeuTyr: 7.962 ± 1.967
0.0LeuXaa: 0.0 ± 0.0
Met
1.592MetAla: 1.592 ± 0.938
0.0MetCys: 0.0 ± 0.0
1.592MetAsp: 1.592 ± 0.938
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.592MetGly: 1.592 ± 0.938
0.0MetHis: 0.0 ± 0.0
3.185MetIle: 3.185 ± 2.563
0.0MetLys: 0.0 ± 0.0
1.592MetLeu: 1.592 ± 0.938
0.0MetMet: 0.0 ± 0.0
1.592MetAsn: 1.592 ± 0.938
4.777MetPro: 4.777 ± 2.815
0.0MetGln: 0.0 ± 0.0
3.185MetArg: 3.185 ± 0.343
3.185MetSer: 3.185 ± 1.877
0.0MetThr: 0.0 ± 0.0
3.185MetVal: 3.185 ± 2.563
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.592AsnAla: 1.592 ± 0.938
0.0AsnCys: 0.0 ± 0.0
3.185AsnAsp: 3.185 ± 2.563
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
4.777AsnGly: 4.777 ± 0.596
0.0AsnHis: 0.0 ± 0.0
3.185AsnIle: 3.185 ± 0.343
1.592AsnLys: 1.592 ± 0.938
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
3.185AsnAsn: 3.185 ± 1.877
3.185AsnPro: 3.185 ± 1.877
1.592AsnGln: 1.592 ± 1.281
1.592AsnArg: 1.592 ± 0.938
3.185AsnSer: 3.185 ± 1.877
9.554AsnThr: 9.554 ± 1.191
7.962AsnVal: 7.962 ± 0.253
0.0AsnTrp: 0.0 ± 0.0
3.185AsnTyr: 3.185 ± 1.877
0.0AsnXaa: 0.0 ± 0.0
Pro
4.777ProAla: 4.777 ± 2.815
0.0ProCys: 0.0 ± 0.0
1.592ProAsp: 1.592 ± 0.938
3.185ProGlu: 3.185 ± 1.877
1.592ProPhe: 1.592 ± 0.938
1.592ProGly: 1.592 ± 0.938
0.0ProHis: 0.0 ± 0.0
4.777ProIle: 4.777 ± 2.815
3.185ProLys: 3.185 ± 0.343
4.777ProLeu: 4.777 ± 0.596
0.0ProMet: 0.0 ± 0.0
6.369ProAsn: 6.369 ± 1.534
4.777ProPro: 4.777 ± 0.596
0.0ProGln: 0.0 ± 0.0
7.962ProArg: 7.962 ± 0.253
1.592ProSer: 1.592 ± 0.938
7.962ProThr: 7.962 ± 0.253
3.185ProVal: 3.185 ± 1.877
0.0ProTrp: 0.0 ± 0.0
1.592ProTyr: 1.592 ± 1.281
0.0ProXaa: 0.0 ± 0.0
Gln
4.777GlnAla: 4.777 ± 0.596
1.592GlnCys: 1.592 ± 1.281
0.0GlnAsp: 0.0 ± 0.0
3.185GlnGlu: 3.185 ± 1.877
1.592GlnPhe: 1.592 ± 0.938
0.0GlnGly: 0.0 ± 0.0
1.592GlnHis: 1.592 ± 0.938
4.777GlnIle: 4.777 ± 1.624
1.592GlnLys: 1.592 ± 0.938
1.592GlnLeu: 1.592 ± 1.281
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.592GlnPro: 1.592 ± 1.281
0.0GlnGln: 0.0 ± 0.0
1.592GlnArg: 1.592 ± 1.281
3.185GlnSer: 3.185 ± 0.343
1.592GlnThr: 1.592 ± 0.938
3.185GlnVal: 3.185 ± 1.877
1.592GlnTrp: 1.592 ± 1.281
1.592GlnTyr: 1.592 ± 0.938
0.0GlnXaa: 0.0 ± 0.0
Arg
6.369ArgAla: 6.369 ± 5.125
0.0ArgCys: 0.0 ± 0.0
1.592ArgAsp: 1.592 ± 0.938
1.592ArgGlu: 1.592 ± 1.281
1.592ArgPhe: 1.592 ± 1.281
7.962ArgGly: 7.962 ± 4.187
0.0ArgHis: 0.0 ± 0.0
6.369ArgIle: 6.369 ± 2.905
3.185ArgLys: 3.185 ± 1.877
1.592ArgLeu: 1.592 ± 0.938
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
3.185ArgPro: 3.185 ± 2.563
0.0ArgGln: 0.0 ± 0.0
3.185ArgArg: 3.185 ± 0.343
3.185ArgSer: 3.185 ± 0.343
1.592ArgThr: 1.592 ± 0.938
6.369ArgVal: 6.369 ± 1.534
1.592ArgTrp: 1.592 ± 1.281
6.369ArgTyr: 6.369 ± 0.686
0.0ArgXaa: 0.0 ± 0.0
Ser
4.777SerAla: 4.777 ± 1.624
1.592SerCys: 1.592 ± 1.281
3.185SerAsp: 3.185 ± 1.877
1.592SerGlu: 1.592 ± 1.281
0.0SerPhe: 0.0 ± 0.0
6.369SerGly: 6.369 ± 1.534
0.0SerHis: 0.0 ± 0.0
1.592SerIle: 1.592 ± 1.281
1.592SerLys: 1.592 ± 1.281
4.777SerLeu: 4.777 ± 2.815
1.592SerMet: 1.592 ± 0.938
1.592SerAsn: 1.592 ± 1.281
6.369SerPro: 6.369 ± 3.754
0.0SerGln: 0.0 ± 0.0
1.592SerArg: 1.592 ± 1.281
4.777SerSer: 4.777 ± 0.596
4.777SerThr: 4.777 ± 2.815
6.369SerVal: 6.369 ± 1.534
4.777SerTrp: 4.777 ± 1.624
4.777SerTyr: 4.777 ± 2.815
0.0SerXaa: 0.0 ± 0.0
Thr
7.962ThrAla: 7.962 ± 0.253
0.0ThrCys: 0.0 ± 0.0
3.185ThrAsp: 3.185 ± 0.343
0.0ThrGlu: 0.0 ± 0.0
3.185ThrPhe: 3.185 ± 1.877
7.962ThrGly: 7.962 ± 0.253
3.185ThrHis: 3.185 ± 1.877
1.592ThrIle: 1.592 ± 0.938
3.185ThrLys: 3.185 ± 0.343
7.962ThrLeu: 7.962 ± 2.472
3.185ThrMet: 3.185 ± 1.877
6.369ThrAsn: 6.369 ± 0.686
6.369ThrPro: 6.369 ± 1.534
3.185ThrGln: 3.185 ± 0.343
1.592ThrArg: 1.592 ± 1.281
1.592ThrSer: 1.592 ± 0.938
4.777ThrThr: 4.777 ± 2.815
7.962ThrVal: 7.962 ± 2.472
0.0ThrTrp: 0.0 ± 0.0
3.185ThrTyr: 3.185 ± 1.877
0.0ThrXaa: 0.0 ± 0.0
Val
3.185ValAla: 3.185 ± 2.563
0.0ValCys: 0.0 ± 0.0
6.369ValAsp: 6.369 ± 0.686
6.369ValGlu: 6.369 ± 0.686
3.185ValPhe: 3.185 ± 0.343
3.185ValGly: 3.185 ± 0.343
0.0ValHis: 0.0 ± 0.0
6.369ValIle: 6.369 ± 1.534
6.369ValLys: 6.369 ± 0.686
11.146ValLeu: 11.146 ± 2.31
3.185ValMet: 3.185 ± 1.877
3.185ValAsn: 3.185 ± 1.877
3.185ValPro: 3.185 ± 0.343
4.777ValGln: 4.777 ± 1.624
3.185ValArg: 3.185 ± 1.877
1.592ValSer: 1.592 ± 1.281
6.369ValThr: 6.369 ± 1.534
6.369ValVal: 6.369 ± 3.754
3.185ValTrp: 3.185 ± 0.343
1.592ValTyr: 1.592 ± 1.281
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.592TrpCys: 1.592 ± 1.281
0.0TrpAsp: 0.0 ± 0.0
1.592TrpGlu: 1.592 ± 1.281
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.592TrpIle: 1.592 ± 1.281
3.185TrpLys: 3.185 ± 0.343
3.185TrpLeu: 3.185 ± 0.343
3.185TrpMet: 3.185 ± 0.343
1.592TrpAsn: 1.592 ± 0.938
0.0TrpPro: 0.0 ± 0.0
1.592TrpGln: 1.592 ± 1.281
4.777TrpArg: 4.777 ± 1.624
1.592TrpSer: 1.592 ± 1.281
0.0TrpThr: 0.0 ± 0.0
1.592TrpVal: 1.592 ± 1.281
0.0TrpTrp: 0.0 ± 0.0
1.592TrpTyr: 1.592 ± 1.281
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.592TyrAla: 1.592 ± 0.938
0.0TyrCys: 0.0 ± 0.0
6.369TyrAsp: 6.369 ± 0.686
6.369TyrGlu: 6.369 ± 0.686
3.185TyrPhe: 3.185 ± 1.877
0.0TyrGly: 0.0 ± 0.0
1.592TyrHis: 1.592 ± 1.281
1.592TyrIle: 1.592 ± 1.281
1.592TyrLys: 1.592 ± 0.938
1.592TyrLeu: 1.592 ± 0.938
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.592TyrPro: 1.592 ± 0.938
4.777TyrGln: 4.777 ± 0.596
0.0TyrArg: 0.0 ± 0.0
1.592TyrSer: 1.592 ± 0.938
1.592TyrThr: 1.592 ± 1.281
6.369TyrVal: 6.369 ± 5.125
0.0TyrTrp: 0.0 ± 0.0
4.777TyrTyr: 4.777 ± 0.596
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (629 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski