Amino acid dipepetide frequency for Avian gyrovirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.914AlaAla: 4.914 ± 3.209
2.457AlaCys: 2.457 ± 0.939
4.914AlaAsp: 4.914 ± 3.744
1.229AlaGlu: 1.229 ± 2.216
2.457AlaPhe: 2.457 ± 1.494
4.914AlaGly: 4.914 ± 1.878
0.0AlaHis: 0.0 ± 0.0
2.457AlaIle: 2.457 ± 0.939
3.686AlaLys: 3.686 ± 1.594
4.914AlaLeu: 4.914 ± 1.878
4.914AlaMet: 4.914 ± 2.988
1.229AlaAsn: 1.229 ± 0.747
11.057AlaPro: 11.057 ± 3.215
3.686AlaGln: 3.686 ± 2.241
4.914AlaArg: 4.914 ± 1.878
8.6AlaSer: 8.6 ± 3.531
7.371AlaThr: 7.371 ± 1.19
0.0AlaVal: 0.0 ± 0.0
1.229AlaTrp: 1.229 ± 0.747
1.229AlaTyr: 1.229 ± 2.216
0.0AlaXaa: 0.0 ± 0.0
Cys
1.229CysAla: 1.229 ± 1.209
0.0CysCys: 0.0 ± 0.0
1.229CysAsp: 1.229 ± 1.209
1.229CysGlu: 1.229 ± 1.209
2.457CysPhe: 2.457 ± 1.494
1.229CysGly: 1.229 ± 1.209
1.229CysHis: 1.229 ± 0.747
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.229CysPro: 1.229 ± 2.216
1.229CysGln: 1.229 ± 0.747
0.0CysArg: 0.0 ± 0.0
2.457CysSer: 2.457 ± 0.939
2.457CysThr: 2.457 ± 0.939
1.229CysVal: 1.229 ± 0.747
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.143AspAla: 6.143 ± 2.011
2.457AspCys: 2.457 ± 2.417
7.371AspAsp: 7.371 ± 4.168
3.686AspGlu: 3.686 ± 2.032
0.0AspPhe: 0.0 ± 0.0
2.457AspGly: 2.457 ± 2.269
0.0AspHis: 0.0 ± 0.0
1.229AspIle: 1.229 ± 0.747
0.0AspLys: 0.0 ± 0.0
4.914AspLeu: 4.914 ± 3.209
1.229AspMet: 1.229 ± 1.209
0.0AspAsn: 0.0 ± 0.0
6.143AspPro: 6.143 ± 2.925
1.229AspGln: 1.229 ± 1.209
3.686AspArg: 3.686 ± 1.191
2.457AspSer: 2.457 ± 1.855
3.686AspThr: 3.686 ± 1.191
0.0AspVal: 0.0 ± 0.0
0.0AspTrp: 0.0 ± 0.0
3.686AspTyr: 3.686 ± 2.032
0.0AspXaa: 0.0 ± 0.0
Glu
4.914GluAla: 4.914 ± 1.878
0.0GluCys: 0.0 ± 0.0
3.686GluAsp: 3.686 ± 3.626
1.229GluGlu: 1.229 ± 2.216
1.229GluPhe: 1.229 ± 0.747
3.686GluGly: 3.686 ± 1.594
2.457GluHis: 2.457 ± 1.855
3.686GluIle: 3.686 ± 4.319
0.0GluLys: 0.0 ± 0.0
2.457GluLeu: 2.457 ± 2.269
0.0GluMet: 0.0 ± 0.0
1.229GluAsn: 1.229 ± 2.216
0.0GluPro: 0.0 ± 0.0
2.457GluGln: 2.457 ± 1.494
1.229GluArg: 1.229 ± 2.216
0.0GluSer: 0.0 ± 0.0
4.914GluThr: 4.914 ± 2.136
0.0GluVal: 0.0 ± 0.0
2.457GluTrp: 2.457 ± 1.494
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.229PheAla: 1.229 ± 0.747
0.0PheCys: 0.0 ± 0.0
2.457PheAsp: 2.457 ± 0.939
0.0PheGlu: 0.0 ± 0.0
1.229PhePhe: 1.229 ± 0.747
2.457PheGly: 2.457 ± 1.494
1.229PheHis: 1.229 ± 0.747
1.229PheIle: 1.229 ± 0.747
0.0PheLys: 0.0 ± 0.0
1.229PheLeu: 1.229 ± 0.747
1.229PheMet: 1.229 ± 0.747
2.457PheAsn: 2.457 ± 1.494
3.686PhePro: 3.686 ± 2.241
2.457PheGln: 2.457 ± 0.939
3.686PheArg: 3.686 ± 2.241
2.457PheSer: 2.457 ± 1.855
0.0PheThr: 0.0 ± 0.0
2.457PheVal: 2.457 ± 0.939
0.0PheTrp: 0.0 ± 0.0
3.686PheTyr: 3.686 ± 2.241
0.0PheXaa: 0.0 ± 0.0
Gly
4.914GlyAla: 4.914 ± 1.753
2.457GlyCys: 2.457 ± 1.494
4.914GlyAsp: 4.914 ± 1.878
1.229GlyGlu: 1.229 ± 1.209
3.686GlyPhe: 3.686 ± 1.757
9.828GlyGly: 9.828 ± 2.947
1.229GlyHis: 1.229 ± 0.747
4.914GlyIle: 4.914 ± 1.025
2.457GlyLys: 2.457 ± 0.939
3.686GlyLeu: 3.686 ± 2.032
1.229GlyMet: 1.229 ± 0.747
2.457GlyAsn: 2.457 ± 1.494
3.686GlyPro: 3.686 ± 1.594
2.457GlyGln: 2.457 ± 0.939
6.143GlyArg: 6.143 ± 2.417
9.828GlySer: 9.828 ± 2.84
4.914GlyThr: 4.914 ± 1.878
3.686GlyVal: 3.686 ± 1.594
1.229GlyTrp: 1.229 ± 0.747
1.229GlyTyr: 1.229 ± 2.216
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.229HisAsp: 1.229 ± 1.209
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.457HisLys: 2.457 ± 0.939
2.457HisLeu: 2.457 ± 1.494
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.686HisPro: 3.686 ± 1.757
0.0HisGln: 0.0 ± 0.0
3.686HisArg: 3.686 ± 1.757
1.229HisSer: 1.229 ± 0.747
2.457HisThr: 2.457 ± 0.939
2.457HisVal: 2.457 ± 1.494
2.457HisTrp: 2.457 ± 0.939
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.686IleAla: 3.686 ± 2.032
1.229IleCys: 1.229 ± 1.209
0.0IleAsp: 0.0 ± 0.0
1.229IleGlu: 1.229 ± 2.216
1.229IlePhe: 1.229 ± 0.747
3.686IleGly: 3.686 ± 4.018
0.0IleHis: 0.0 ± 0.0
1.229IleIle: 1.229 ± 2.216
1.229IleLys: 1.229 ± 2.216
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
2.457IleAsn: 2.457 ± 0.939
2.457IlePro: 2.457 ± 0.939
2.457IleGln: 2.457 ± 1.855
1.229IleArg: 1.229 ± 1.209
0.0IleSer: 0.0 ± 0.0
6.143IleThr: 6.143 ± 3.535
1.229IleVal: 1.229 ± 0.747
1.229IleTrp: 1.229 ± 0.747
2.457IleTyr: 2.457 ± 1.855
0.0IleXaa: 0.0 ± 0.0
Lys
2.457LysAla: 2.457 ± 1.494
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
2.457LysGlu: 2.457 ± 4.432
4.914LysPhe: 4.914 ± 1.753
3.686LysGly: 3.686 ± 1.191
1.229LysHis: 1.229 ± 0.747
1.229LysIle: 1.229 ± 0.747
2.457LysLys: 2.457 ± 2.269
4.914LysLeu: 4.914 ± 1.025
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
1.229LysPro: 1.229 ± 0.747
0.0LysGln: 0.0 ± 0.0
4.914LysArg: 4.914 ± 1.878
2.457LysSer: 2.457 ± 1.494
6.143LysThr: 6.143 ± 2.925
1.229LysVal: 1.229 ± 0.747
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
8.6LeuAla: 8.6 ± 0.167
0.0LeuCys: 0.0 ± 0.0
6.143LeuAsp: 6.143 ± 2.011
1.229LeuGlu: 1.229 ± 0.747
3.686LeuPhe: 3.686 ± 2.241
4.914LeuGly: 4.914 ± 1.878
0.0LeuHis: 0.0 ± 0.0
2.457LeuIle: 2.457 ± 2.269
1.229LeuLys: 1.229 ± 1.209
4.914LeuLeu: 4.914 ± 1.962
2.457LeuMet: 2.457 ± 1.277
1.229LeuAsn: 1.229 ± 1.209
4.914LeuPro: 4.914 ± 1.025
3.686LeuGln: 3.686 ± 1.191
11.057LeuArg: 11.057 ± 3.526
3.686LeuSer: 3.686 ± 1.594
6.143LeuThr: 6.143 ± 3.535
6.143LeuVal: 6.143 ± 2.011
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.457MetAla: 2.457 ± 1.494
1.229MetCys: 1.229 ± 0.747
0.0MetAsp: 0.0 ± 0.0
1.229MetGlu: 1.229 ± 0.747
0.0MetPhe: 0.0 ± 0.0
3.686MetGly: 3.686 ± 2.241
1.229MetHis: 1.229 ± 0.747
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.229MetLeu: 1.229 ± 0.747
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.229MetPro: 1.229 ± 0.747
3.686MetGln: 3.686 ± 1.757
1.229MetArg: 1.229 ± 1.209
2.457MetSer: 2.457 ± 0.939
1.229MetThr: 1.229 ± 0.747
0.0MetVal: 0.0 ± 0.0
2.457MetTrp: 2.457 ± 0.939
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.686AsnAla: 3.686 ± 1.191
0.0AsnCys: 0.0 ± 0.0
2.457AsnAsp: 2.457 ± 0.939
1.229AsnGlu: 1.229 ± 0.747
1.229AsnPhe: 1.229 ± 1.209
1.229AsnGly: 1.229 ± 1.209
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
3.686AsnLys: 3.686 ± 1.191
3.686AsnLeu: 3.686 ± 1.757
0.0AsnMet: 0.0 ± 0.0
1.229AsnAsn: 1.229 ± 1.209
6.143AsnPro: 6.143 ± 3.735
2.457AsnGln: 2.457 ± 1.494
0.0AsnArg: 0.0 ± 0.0
0.0AsnSer: 0.0 ± 0.0
0.0AsnThr: 0.0 ± 0.0
3.686AsnVal: 3.686 ± 2.241
2.457AsnTrp: 2.457 ± 0.939
3.686AsnTyr: 3.686 ± 3.626
0.0AsnXaa: 0.0 ± 0.0
Pro
3.686ProAla: 3.686 ± 4.018
0.0ProCys: 0.0 ± 0.0
1.229ProAsp: 1.229 ± 0.747
2.457ProGlu: 2.457 ± 1.855
0.0ProPhe: 0.0 ± 0.0
6.143ProGly: 6.143 ± 0.822
2.457ProHis: 2.457 ± 0.939
2.457ProIle: 2.457 ± 1.855
2.457ProLys: 2.457 ± 1.494
4.914ProLeu: 4.914 ± 3.209
3.686ProMet: 3.686 ± 1.622
4.914ProAsn: 4.914 ± 1.753
8.6ProPro: 8.6 ± 2.61
2.457ProGln: 2.457 ± 1.494
4.914ProArg: 4.914 ± 3.71
6.143ProSer: 6.143 ± 2.417
2.457ProThr: 2.457 ± 2.269
4.914ProVal: 4.914 ± 2.988
3.686ProTrp: 3.686 ± 2.241
6.143ProTyr: 6.143 ± 3.735
0.0ProXaa: 0.0 ± 0.0
Gln
2.457GlnAla: 2.457 ± 1.855
1.229GlnCys: 1.229 ± 1.209
3.686GlnAsp: 3.686 ± 1.191
3.686GlnGlu: 3.686 ± 3.626
0.0GlnPhe: 0.0 ± 0.0
7.371GlnGly: 7.371 ± 4.482
1.229GlnHis: 1.229 ± 0.747
1.229GlnIle: 1.229 ± 2.216
2.457GlnLys: 2.457 ± 1.855
6.143GlnLeu: 6.143 ± 2.011
0.0GlnMet: 0.0 ± 0.0
2.457GlnAsn: 2.457 ± 1.494
1.229GlnPro: 1.229 ± 0.747
1.229GlnGln: 1.229 ± 0.747
4.914GlnArg: 4.914 ± 1.753
0.0GlnSer: 0.0 ± 0.0
6.143GlnThr: 6.143 ± 0.822
1.229GlnVal: 1.229 ± 0.747
2.457GlnTrp: 2.457 ± 1.494
1.229GlnTyr: 1.229 ± 0.747
0.0GlnXaa: 0.0 ± 0.0
Arg
4.914ArgAla: 4.914 ± 3.209
1.229ArgCys: 1.229 ± 0.747
2.457ArgAsp: 2.457 ± 2.417
2.457ArgGlu: 2.457 ± 1.855
2.457ArgPhe: 2.457 ± 1.494
8.6ArgGly: 8.6 ± 2.903
2.457ArgHis: 2.457 ± 1.494
1.229ArgIle: 1.229 ± 0.747
3.686ArgLys: 3.686 ± 2.032
8.6ArgLeu: 8.6 ± 2.61
0.0ArgMet: 0.0 ± 0.0
1.229ArgAsn: 1.229 ± 1.209
3.686ArgPro: 3.686 ± 1.757
6.143ArgGln: 6.143 ± 3.849
23.342ArgArg: 23.342 ± 6.911
8.6ArgSer: 8.6 ± 5.311
3.686ArgThr: 3.686 ± 1.594
6.143ArgVal: 6.143 ± 1.391
4.914ArgTrp: 4.914 ± 1.753
3.686ArgTyr: 3.686 ± 2.241
0.0ArgXaa: 0.0 ± 0.0
Ser
8.6SerAla: 8.6 ± 2.573
1.229SerCys: 1.229 ± 2.216
0.0SerAsp: 0.0 ± 0.0
4.914SerGlu: 4.914 ± 1.025
4.914SerPhe: 4.914 ± 2.988
3.686SerGly: 3.686 ± 3.626
3.686SerHis: 3.686 ± 2.882
6.143SerIle: 6.143 ± 0.822
2.457SerLys: 2.457 ± 1.494
2.457SerLeu: 2.457 ± 1.855
0.0SerMet: 0.0 ± 0.0
2.457SerAsn: 2.457 ± 2.417
4.914SerPro: 4.914 ± 1.025
1.229SerGln: 1.229 ± 0.747
7.371SerArg: 7.371 ± 2.846
7.371SerSer: 7.371 ± 5.907
7.371SerThr: 7.371 ± 2.396
3.686SerVal: 3.686 ± 1.594
0.0SerTrp: 0.0 ± 0.0
1.229SerTyr: 1.229 ± 0.747
0.0SerXaa: 0.0 ± 0.0
Thr
3.686ThrAla: 3.686 ± 1.757
1.229ThrCys: 1.229 ± 1.209
6.143ThrAsp: 6.143 ± 2.011
1.229ThrGlu: 1.229 ± 1.209
1.229ThrPhe: 1.229 ± 0.747
6.143ThrGly: 6.143 ± 1.391
1.229ThrHis: 1.229 ± 1.209
2.457ThrIle: 2.457 ± 2.269
4.914ThrLys: 4.914 ± 1.753
8.6ThrLeu: 8.6 ± 2.61
4.914ThrMet: 4.914 ± 2.988
1.229ThrAsn: 1.229 ± 0.747
3.686ThrPro: 3.686 ± 1.594
7.371ThrGln: 7.371 ± 2.383
4.914ThrArg: 4.914 ± 4.537
4.914ThrSer: 4.914 ± 6.479
8.6ThrThr: 8.6 ± 7.672
2.457ThrVal: 2.457 ± 0.939
2.457ThrTrp: 2.457 ± 2.417
1.229ThrTyr: 1.229 ± 0.747
0.0ThrXaa: 0.0 ± 0.0
Val
3.686ValAla: 3.686 ± 2.241
1.229ValCys: 1.229 ± 0.747
1.229ValAsp: 1.229 ± 1.209
3.686ValGlu: 3.686 ± 1.191
1.229ValPhe: 1.229 ± 0.747
1.229ValGly: 1.229 ± 2.216
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
4.914ValLys: 4.914 ± 2.988
2.457ValLeu: 2.457 ± 1.855
1.229ValMet: 1.229 ± 0.747
6.143ValAsn: 6.143 ± 2.011
2.457ValPro: 2.457 ± 1.494
1.229ValGln: 1.229 ± 0.747
6.143ValArg: 6.143 ± 0.822
1.229ValSer: 1.229 ± 1.209
3.686ValThr: 3.686 ± 2.882
2.457ValVal: 2.457 ± 1.494
0.0ValTrp: 0.0 ± 0.0
1.229ValTyr: 1.229 ± 0.747
0.0ValXaa: 0.0 ± 0.0
Trp
2.457TrpAla: 2.457 ± 1.494
1.229TrpCys: 1.229 ± 0.747
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.229TrpPhe: 1.229 ± 1.209
0.0TrpGly: 0.0 ± 0.0
1.229TrpHis: 1.229 ± 0.747
1.229TrpIle: 1.229 ± 0.747
0.0TrpLys: 0.0 ± 0.0
3.686TrpLeu: 3.686 ± 1.191
1.229TrpMet: 1.229 ± 0.941
0.0TrpAsn: 0.0 ± 0.0
1.229TrpPro: 1.229 ± 0.747
4.914TrpGln: 4.914 ± 1.753
3.686TrpArg: 3.686 ± 1.191
3.686TrpSer: 3.686 ± 2.241
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.229TrpTrp: 1.229 ± 0.747
2.457TrpTyr: 2.457 ± 0.939
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.457TyrAla: 2.457 ± 1.855
0.0TyrCys: 0.0 ± 0.0
1.229TyrAsp: 1.229 ± 1.209
1.229TyrGlu: 1.229 ± 2.216
0.0TyrPhe: 0.0 ± 0.0
1.229TyrGly: 1.229 ± 0.747
1.229TyrHis: 1.229 ± 0.747
0.0TyrIle: 0.0 ± 0.0
1.229TyrLys: 1.229 ± 0.747
1.229TyrLeu: 1.229 ± 1.209
0.0TyrMet: 0.0 ± 0.0
6.143TyrAsn: 6.143 ± 2.417
3.686TyrPro: 3.686 ± 2.241
0.0TyrGln: 0.0 ± 0.0
2.457TyrArg: 2.457 ± 0.939
6.143TyrSer: 6.143 ± 2.011
1.229TyrThr: 1.229 ± 0.747
2.457TyrVal: 2.457 ± 1.855
1.229TyrTrp: 1.229 ± 0.747
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski