Amino acid dipepetide frequency for Gammarus sp. amphipod associated circular virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.604AlaAla: 3.604 ± 2.412
1.802AlaCys: 1.802 ± 1.206
5.405AlaAsp: 5.405 ± 1.059
1.802AlaGlu: 1.802 ± 1.206
3.604AlaPhe: 3.604 ± 0.148
3.604AlaGly: 3.604 ± 0.148
1.802AlaHis: 1.802 ± 1.206
9.009AlaIle: 9.009 ± 1.649
1.802AlaLys: 1.802 ± 1.206
5.405AlaLeu: 5.405 ± 3.618
1.802AlaMet: 1.802 ± 1.354
1.802AlaAsn: 1.802 ± 1.206
3.604AlaPro: 3.604 ± 2.707
3.604AlaGln: 3.604 ± 2.707
9.009AlaArg: 9.009 ± 1.649
1.802AlaSer: 1.802 ± 1.354
3.604AlaThr: 3.604 ± 0.148
1.802AlaVal: 1.802 ± 1.354
0.0AlaTrp: 0.0 ± 0.0
3.604AlaTyr: 3.604 ± 0.148
0.0AlaXaa: 0.0 ± 0.0
Cys
1.802CysAla: 1.802 ± 1.206
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
3.604CysPhe: 3.604 ± 2.412
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.604CysLys: 3.604 ± 0.148
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.802CysAsn: 1.802 ± 1.206
1.802CysPro: 1.802 ± 1.354
0.0CysGln: 0.0 ± 0.0
1.802CysArg: 1.802 ± 1.206
1.802CysSer: 1.802 ± 1.206
1.802CysThr: 1.802 ± 1.354
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.802AspAla: 1.802 ± 1.354
0.0AspCys: 0.0 ± 0.0
1.802AspAsp: 1.802 ± 1.206
0.0AspGlu: 0.0 ± 0.0
3.604AspPhe: 3.604 ± 0.148
3.604AspGly: 3.604 ± 2.412
0.0AspHis: 0.0 ± 0.0
1.802AspIle: 1.802 ± 1.206
3.604AspLys: 3.604 ± 0.148
9.009AspLeu: 9.009 ± 0.911
0.0AspMet: 0.0 ± 0.0
5.405AspAsn: 5.405 ± 4.061
1.802AspPro: 1.802 ± 1.206
0.0AspGln: 0.0 ± 0.0
9.009AspArg: 9.009 ± 3.471
3.604AspSer: 3.604 ± 0.148
3.604AspThr: 3.604 ± 0.148
3.604AspVal: 3.604 ± 0.148
3.604AspTrp: 3.604 ± 2.412
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.405GluAla: 5.405 ± 1.059
1.802GluCys: 1.802 ± 1.206
3.604GluAsp: 3.604 ± 2.707
1.802GluGlu: 1.802 ± 1.354
1.802GluPhe: 1.802 ± 1.206
1.802GluGly: 1.802 ± 1.206
0.0GluHis: 0.0 ± 0.0
1.802GluIle: 1.802 ± 1.206
1.802GluLys: 1.802 ± 1.206
7.207GluLeu: 7.207 ± 0.295
5.405GluMet: 5.405 ± 1.501
0.0GluAsn: 0.0 ± 0.0
3.604GluPro: 3.604 ± 0.148
5.405GluGln: 5.405 ± 1.501
1.802GluArg: 1.802 ± 1.206
1.802GluSer: 1.802 ± 1.206
0.0GluThr: 0.0 ± 0.0
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
1.802GluTyr: 1.802 ± 1.206
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.802PheCys: 1.802 ± 1.206
0.0PheAsp: 0.0 ± 0.0
5.405PheGlu: 5.405 ± 1.501
3.604PhePhe: 3.604 ± 2.707
3.604PheGly: 3.604 ± 2.412
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.802PheLys: 1.802 ± 1.206
3.604PheLeu: 3.604 ± 2.412
0.0PheMet: 0.0 ± 0.0
1.802PheAsn: 1.802 ± 1.206
3.604PhePro: 3.604 ± 2.412
1.802PheGln: 1.802 ± 1.354
7.207PheArg: 7.207 ± 2.855
5.405PheSer: 5.405 ± 1.059
1.802PheThr: 1.802 ± 1.206
5.405PheVal: 5.405 ± 1.059
0.0PheTrp: 0.0 ± 0.0
1.802PheTyr: 1.802 ± 1.206
0.0PheXaa: 0.0 ± 0.0
Gly
5.405GlyAla: 5.405 ± 4.061
0.0GlyCys: 0.0 ± 0.0
7.207GlyAsp: 7.207 ± 2.855
5.405GlyGlu: 5.405 ± 1.059
5.405GlyPhe: 5.405 ± 1.059
0.0GlyGly: 0.0 ± 0.0
0.0GlyHis: 0.0 ± 0.0
1.802GlyIle: 1.802 ± 1.206
7.207GlyLys: 7.207 ± 2.265
1.802GlyLeu: 1.802 ± 1.354
0.0GlyMet: 0.0 ± 0.0
5.405GlyAsn: 5.405 ± 3.618
0.0GlyPro: 0.0 ± 0.0
3.604GlyGln: 3.604 ± 2.707
1.802GlyArg: 1.802 ± 1.206
9.009GlySer: 9.009 ± 1.649
1.802GlyThr: 1.802 ± 1.206
0.0GlyVal: 0.0 ± 0.0
1.802GlyTrp: 1.802 ± 1.206
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.802HisPhe: 1.802 ± 1.206
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.802HisLeu: 1.802 ± 1.206
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
3.604HisVal: 3.604 ± 2.412
0.0HisTrp: 0.0 ± 0.0
1.802HisTyr: 1.802 ± 1.354
0.0HisXaa: 0.0 ± 0.0
Ile
3.604IleAla: 3.604 ± 0.148
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
0.0IleGlu: 0.0 ± 0.0
5.405IlePhe: 5.405 ± 3.618
9.009IleGly: 9.009 ± 0.911
1.802IleHis: 1.802 ± 1.206
3.604IleIle: 3.604 ± 2.412
1.802IleLys: 1.802 ± 1.354
3.604IleLeu: 3.604 ± 2.412
0.0IleMet: 0.0 ± 0.0
14.414IleAsn: 14.414 ± 3.15
1.802IlePro: 1.802 ± 1.354
0.0IleGln: 0.0 ± 0.0
0.0IleArg: 0.0 ± 0.0
3.604IleSer: 3.604 ± 0.148
1.802IleThr: 1.802 ± 1.354
5.405IleVal: 5.405 ± 1.501
0.0IleTrp: 0.0 ± 0.0
3.604IleTyr: 3.604 ± 0.148
0.0IleXaa: 0.0 ± 0.0
Lys
5.405LysAla: 5.405 ± 1.501
0.0LysCys: 0.0 ± 0.0
3.604LysAsp: 3.604 ± 2.412
1.802LysGlu: 1.802 ± 1.206
0.0LysPhe: 0.0 ± 0.0
1.802LysGly: 1.802 ± 1.354
0.0LysHis: 0.0 ± 0.0
5.405LysIle: 5.405 ± 1.059
5.405LysLys: 5.405 ± 1.059
1.802LysLeu: 1.802 ± 1.354
1.802LysMet: 1.802 ± 1.354
1.802LysAsn: 1.802 ± 1.354
1.802LysPro: 1.802 ± 1.354
0.0LysGln: 0.0 ± 0.0
10.811LysArg: 10.811 ± 0.443
3.604LysSer: 3.604 ± 0.148
7.207LysThr: 7.207 ± 0.295
1.802LysVal: 1.802 ± 1.206
1.802LysTrp: 1.802 ± 1.206
3.604LysTyr: 3.604 ± 0.148
0.0LysXaa: 0.0 ± 0.0
Leu
9.009LeuAla: 9.009 ± 0.911
0.0LeuCys: 0.0 ± 0.0
3.604LeuAsp: 3.604 ± 0.148
1.802LeuGlu: 1.802 ± 1.206
0.0LeuPhe: 0.0 ± 0.0
1.802LeuGly: 1.802 ± 1.206
0.0LeuHis: 0.0 ± 0.0
1.802LeuIle: 1.802 ± 1.354
1.802LeuLys: 1.802 ± 1.206
3.604LeuLeu: 3.604 ± 0.148
1.802LeuMet: 1.802 ± 1.206
1.802LeuAsn: 1.802 ± 1.354
3.604LeuPro: 3.604 ± 0.148
5.405LeuGln: 5.405 ± 3.618
0.0LeuArg: 0.0 ± 0.0
5.405LeuSer: 5.405 ± 1.501
12.613LeuThr: 12.613 ± 1.797
3.604LeuVal: 3.604 ± 2.412
1.802LeuTrp: 1.802 ± 1.354
3.604LeuTyr: 3.604 ± 2.707
0.0LeuXaa: 0.0 ± 0.0
Met
1.802MetAla: 1.802 ± 1.354
0.0MetCys: 0.0 ± 0.0
1.802MetAsp: 1.802 ± 1.206
0.0MetGlu: 0.0 ± 0.0
1.802MetPhe: 1.802 ± 1.354
0.0MetGly: 0.0 ± 0.0
1.802MetHis: 1.802 ± 1.354
1.802MetIle: 1.802 ± 1.206
0.0MetLys: 0.0 ± 0.0
3.604MetLeu: 3.604 ± 0.148
1.802MetMet: 1.802 ± 1.354
0.0MetAsn: 0.0 ± 0.0
1.802MetPro: 1.802 ± 1.354
0.0MetGln: 0.0 ± 0.0
3.604MetArg: 3.604 ± 0.148
0.0MetSer: 0.0 ± 0.0
5.405MetThr: 5.405 ± 1.059
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.802MetTyr: 1.802 ± 1.354
0.0MetXaa: 0.0 ± 0.0
Asn
7.207AsnAla: 7.207 ± 0.295
0.0AsnCys: 0.0 ± 0.0
3.604AsnAsp: 3.604 ± 0.148
5.405AsnGlu: 5.405 ± 1.059
0.0AsnPhe: 0.0 ± 0.0
3.604AsnGly: 3.604 ± 0.148
0.0AsnHis: 0.0 ± 0.0
1.802AsnIle: 1.802 ± 1.206
0.0AsnLys: 0.0 ± 0.0
0.0AsnLeu: 0.0 ± 0.0
3.604AsnMet: 3.604 ± 2.128
3.604AsnAsn: 3.604 ± 0.148
1.802AsnPro: 1.802 ± 1.206
7.207AsnGln: 7.207 ± 2.855
3.604AsnArg: 3.604 ± 2.707
1.802AsnSer: 1.802 ± 1.354
1.802AsnThr: 1.802 ± 1.206
5.405AsnVal: 5.405 ± 1.501
1.802AsnTrp: 1.802 ± 1.354
5.405AsnTyr: 5.405 ± 1.501
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.802ProCys: 1.802 ± 1.354
3.604ProAsp: 3.604 ± 2.412
3.604ProGlu: 3.604 ± 0.148
0.0ProPhe: 0.0 ± 0.0
3.604ProGly: 3.604 ± 0.148
3.604ProHis: 3.604 ± 2.412
5.405ProIle: 5.405 ± 1.501
9.009ProLys: 9.009 ± 1.649
3.604ProLeu: 3.604 ± 2.707
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
3.604ProPro: 3.604 ± 2.412
0.0ProGln: 0.0 ± 0.0
7.207ProArg: 7.207 ± 0.295
3.604ProSer: 3.604 ± 2.412
1.802ProThr: 1.802 ± 1.206
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.802ProTyr: 1.802 ± 1.354
0.0ProXaa: 0.0 ± 0.0
Gln
1.802GlnAla: 1.802 ± 1.206
0.0GlnCys: 0.0 ± 0.0
1.802GlnAsp: 1.802 ± 1.206
1.802GlnGlu: 1.802 ± 1.354
1.802GlnPhe: 1.802 ± 1.354
5.405GlnGly: 5.405 ± 1.059
0.0GlnHis: 0.0 ± 0.0
1.802GlnIle: 1.802 ± 1.354
3.604GlnLys: 3.604 ± 2.707
0.0GlnLeu: 0.0 ± 0.0
3.604GlnMet: 3.604 ± 0.148
3.604GlnAsn: 3.604 ± 2.707
1.802GlnPro: 1.802 ± 1.206
1.802GlnGln: 1.802 ± 1.354
3.604GlnArg: 3.604 ± 2.707
3.604GlnSer: 3.604 ± 0.148
5.405GlnThr: 5.405 ± 1.501
1.802GlnVal: 1.802 ± 1.206
0.0GlnTrp: 0.0 ± 0.0
3.604GlnTyr: 3.604 ± 2.412
0.0GlnXaa: 0.0 ± 0.0
Arg
3.604ArgAla: 3.604 ± 2.707
0.0ArgCys: 0.0 ± 0.0
7.207ArgAsp: 7.207 ± 0.295
7.207ArgGlu: 7.207 ± 0.295
7.207ArgPhe: 7.207 ± 0.295
7.207ArgGly: 7.207 ± 2.855
0.0ArgHis: 0.0 ± 0.0
10.811ArgIle: 10.811 ± 3.003
9.009ArgLys: 9.009 ± 4.209
1.802ArgLeu: 1.802 ± 1.206
3.604ArgMet: 3.604 ± 0.148
5.405ArgAsn: 5.405 ± 1.501
1.802ArgPro: 1.802 ± 1.354
5.405ArgGln: 5.405 ± 4.061
5.405ArgArg: 5.405 ± 4.061
0.0ArgSer: 0.0 ± 0.0
5.405ArgThr: 5.405 ± 1.059
5.405ArgVal: 5.405 ± 1.059
0.0ArgTrp: 0.0 ± 0.0
3.604ArgTyr: 3.604 ± 2.412
0.0ArgXaa: 0.0 ± 0.0
Ser
5.405SerAla: 5.405 ± 1.059
3.604SerCys: 3.604 ± 2.412
1.802SerAsp: 1.802 ± 1.354
1.802SerGlu: 1.802 ± 1.206
1.802SerPhe: 1.802 ± 1.206
5.405SerGly: 5.405 ± 1.501
0.0SerHis: 0.0 ± 0.0
1.802SerIle: 1.802 ± 1.206
1.802SerLys: 1.802 ± 1.206
5.405SerLeu: 5.405 ± 4.061
0.0SerMet: 0.0 ± 0.0
5.405SerAsn: 5.405 ± 1.501
3.604SerPro: 3.604 ± 0.148
3.604SerGln: 3.604 ± 0.148
3.604SerArg: 3.604 ± 2.707
9.009SerSer: 9.009 ± 3.471
3.604SerThr: 3.604 ± 0.148
3.604SerVal: 3.604 ± 2.412
0.0SerTrp: 0.0 ± 0.0
3.604SerTyr: 3.604 ± 0.148
0.0SerXaa: 0.0 ± 0.0
Thr
5.405ThrAla: 5.405 ± 1.501
0.0ThrCys: 0.0 ± 0.0
3.604ThrAsp: 3.604 ± 2.412
3.604ThrGlu: 3.604 ± 0.148
3.604ThrPhe: 3.604 ± 0.148
1.802ThrGly: 1.802 ± 1.354
0.0ThrHis: 0.0 ± 0.0
5.405ThrIle: 5.405 ± 1.501
5.405ThrLys: 5.405 ± 1.059
0.0ThrLeu: 0.0 ± 0.0
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
5.405ThrPro: 5.405 ± 1.059
1.802ThrGln: 1.802 ± 1.206
5.405ThrArg: 5.405 ± 1.501
7.207ThrSer: 7.207 ± 0.295
1.802ThrThr: 1.802 ± 1.354
5.405ThrVal: 5.405 ± 1.059
3.604ThrTrp: 3.604 ± 0.148
1.802ThrTyr: 1.802 ± 1.206
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
3.604ValCys: 3.604 ± 2.707
5.405ValAsp: 5.405 ± 1.059
1.802ValGlu: 1.802 ± 1.206
1.802ValPhe: 1.802 ± 1.206
1.802ValGly: 1.802 ± 1.354
0.0ValHis: 0.0 ± 0.0
3.604ValIle: 3.604 ± 2.412
0.0ValLys: 0.0 ± 0.0
7.207ValLeu: 7.207 ± 0.295
1.802ValMet: 1.802 ± 1.918
1.802ValAsn: 1.802 ± 1.206
5.405ValPro: 5.405 ± 3.618
5.405ValGln: 5.405 ± 3.618
3.604ValArg: 3.604 ± 2.707
1.802ValSer: 1.802 ± 1.354
1.802ValThr: 1.802 ± 1.206
1.802ValVal: 1.802 ± 1.206
0.0ValTrp: 0.0 ± 0.0
3.604ValTyr: 3.604 ± 0.148
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.802TrpAsp: 1.802 ± 1.206
1.802TrpGlu: 1.802 ± 1.354
1.802TrpPhe: 1.802 ± 1.206
1.802TrpGly: 1.802 ± 1.354
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.802TrpAsn: 1.802 ± 1.354
1.802TrpPro: 1.802 ± 1.206
1.802TrpGln: 1.802 ± 1.206
1.802TrpArg: 1.802 ± 1.354
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.802TrpVal: 1.802 ± 1.206
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
7.207TyrAla: 7.207 ± 4.825
3.604TyrCys: 3.604 ± 2.412
0.0TyrAsp: 0.0 ± 0.0
1.802TyrGlu: 1.802 ± 1.206
0.0TyrPhe: 0.0 ± 0.0
1.802TyrGly: 1.802 ± 1.206
0.0TyrHis: 0.0 ± 0.0
1.802TyrIle: 1.802 ± 1.354
1.802TyrLys: 1.802 ± 1.354
3.604TyrLeu: 3.604 ± 0.148
0.0TyrMet: 0.0 ± 0.0
1.802TyrAsn: 1.802 ± 1.354
3.604TyrPro: 3.604 ± 2.707
0.0TyrGln: 0.0 ± 0.0
10.811TyrArg: 10.811 ± 3.003
1.802TyrSer: 1.802 ± 1.206
0.0TyrThr: 0.0 ± 0.0
3.604TyrVal: 3.604 ± 0.148
1.802TyrTrp: 1.802 ± 1.354
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski