Amino acid dipepetide frequency for Gemycircularvirus HV-GcV2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.511AlaAla: 1.511 ± 0.851
1.511AlaCys: 1.511 ± 0.851
6.042AlaAsp: 6.042 ± 1.719
4.532AlaGlu: 4.532 ± 0.957
3.021AlaPhe: 3.021 ± 0.569
1.511AlaGly: 1.511 ± 0.851
0.0AlaHis: 0.0 ± 0.0
1.511AlaIle: 1.511 ± 0.851
1.511AlaLys: 1.511 ± 0.851
4.532AlaLeu: 4.532 ± 2.553
1.511AlaMet: 1.511 ± 0.851
4.532AlaAsn: 4.532 ± 1.511
3.021AlaPro: 3.021 ± 1.702
1.511AlaGln: 1.511 ± 0.851
4.532AlaArg: 4.532 ± 1.511
6.042AlaSer: 6.042 ± 2.517
6.042AlaThr: 6.042 ± 1.719
4.532AlaVal: 4.532 ± 2.882
3.021AlaTrp: 3.021 ± 1.702
4.532AlaTyr: 4.532 ± 2.553
0.0AlaXaa: 0.0 ± 0.0
Cys
1.511CysAla: 1.511 ± 0.851
0.0CysCys: 0.0 ± 0.0
1.511CysAsp: 1.511 ± 0.851
0.0CysGlu: 0.0 ± 0.0
1.511CysPhe: 1.511 ± 1.087
1.511CysGly: 1.511 ± 0.851
0.0CysHis: 0.0 ± 0.0
1.511CysIle: 1.511 ± 0.851
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.511CysAsn: 1.511 ± 1.087
3.021CysPro: 3.021 ± 0.569
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.511CysSer: 1.511 ± 0.851
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.511CysTrp: 1.511 ± 0.851
1.511CysTyr: 1.511 ± 0.851
0.0CysXaa: 0.0 ± 0.0
Asp
1.511AspAla: 1.511 ± 0.851
0.0AspCys: 0.0 ± 0.0
4.532AspAsp: 4.532 ± 1.511
4.532AspGlu: 4.532 ± 0.957
3.021AspPhe: 3.021 ± 0.569
7.553AspGly: 7.553 ± 2.32
1.511AspHis: 1.511 ± 0.851
3.021AspIle: 3.021 ± 0.569
1.511AspLys: 1.511 ± 0.851
1.511AspLeu: 1.511 ± 2.911
1.511AspMet: 1.511 ± 0.851
1.511AspAsn: 1.511 ± 1.087
7.553AspPro: 7.553 ± 5.433
0.0AspGln: 0.0 ± 0.0
3.021AspArg: 3.021 ± 2.173
3.021AspSer: 3.021 ± 2.173
10.574AspThr: 10.574 ± 2.961
9.063AspVal: 9.063 ± 5.763
4.532AspTrp: 4.532 ± 0.957
4.532AspTyr: 4.532 ± 0.957
0.0AspXaa: 0.0 ± 0.0
Glu
1.511GluAla: 1.511 ± 0.851
6.042GluCys: 6.042 ± 1.719
6.042GluAsp: 6.042 ± 1.138
1.511GluGlu: 1.511 ± 0.851
1.511GluPhe: 1.511 ± 0.851
4.532GluGly: 4.532 ± 0.957
1.511GluHis: 1.511 ± 0.851
4.532GluIle: 4.532 ± 2.413
1.511GluLys: 1.511 ± 1.087
3.021GluLeu: 3.021 ± 1.702
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.511GluPro: 1.511 ± 0.851
0.0GluGln: 0.0 ± 0.0
1.511GluArg: 1.511 ± 0.851
9.063GluSer: 9.063 ± 1.706
1.511GluThr: 1.511 ± 1.087
3.021GluVal: 3.021 ± 1.702
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.511PheAla: 1.511 ± 1.087
1.511PheCys: 1.511 ± 1.087
4.532PheAsp: 4.532 ± 2.553
6.042PheGlu: 6.042 ± 1.719
4.532PhePhe: 4.532 ± 2.553
3.021PheGly: 3.021 ± 2.769
1.511PheHis: 1.511 ± 0.851
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
0.0PheLeu: 0.0 ± 0.0
1.511PheMet: 1.511 ± 0.851
0.0PheAsn: 0.0 ± 0.0
3.021PhePro: 3.021 ± 0.569
1.511PheGln: 1.511 ± 1.087
4.532PheArg: 4.532 ± 3.26
3.021PheSer: 3.021 ± 0.569
4.532PheThr: 4.532 ± 1.511
1.511PheVal: 1.511 ± 0.851
4.532PheTrp: 4.532 ± 0.957
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
12.085GlyAla: 12.085 ± 2.221
0.0GlyCys: 0.0 ± 0.0
7.553GlyAsp: 7.553 ± 5.229
3.021GlyGlu: 3.021 ± 1.702
0.0GlyPhe: 0.0 ± 0.0
12.085GlyGly: 12.085 ± 5.201
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
7.553GlyLys: 7.553 ± 2.32
12.085GlyLeu: 12.085 ± 3.949
3.021GlyMet: 3.021 ± 2.138
7.553GlyAsn: 7.553 ± 2.009
4.532GlyPro: 4.532 ± 2.553
1.511GlyGln: 1.511 ± 1.087
3.021GlyArg: 3.021 ± 1.702
6.042GlySer: 6.042 ± 2.57
6.042GlyThr: 6.042 ± 5.731
4.532GlyVal: 4.532 ± 0.957
1.511GlyTrp: 1.511 ± 1.087
4.532GlyTyr: 4.532 ± 1.511
0.0GlyXaa: 0.0 ± 0.0
His
7.553HisAla: 7.553 ± 4.255
1.511HisCys: 1.511 ± 0.851
1.511HisAsp: 1.511 ± 0.851
1.511HisGlu: 1.511 ± 1.087
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.511HisLeu: 1.511 ± 0.851
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.021HisPro: 3.021 ± 1.702
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
4.532HisThr: 4.532 ± 5.618
0.0HisVal: 0.0 ± 0.0
1.511HisTrp: 1.511 ± 0.851
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.021IleAla: 3.021 ± 0.569
1.511IleCys: 1.511 ± 1.087
0.0IleAsp: 0.0 ± 0.0
0.0IleGlu: 0.0 ± 0.0
1.511IlePhe: 1.511 ± 1.087
1.511IleGly: 1.511 ± 2.911
1.511IleHis: 1.511 ± 0.851
1.511IleIle: 1.511 ± 0.851
3.021IleLys: 3.021 ± 0.569
1.511IleLeu: 1.511 ± 1.087
1.511IleMet: 1.511 ± 0.851
0.0IleAsn: 0.0 ± 0.0
1.511IlePro: 1.511 ± 1.087
1.511IleGln: 1.511 ± 2.911
3.021IleArg: 3.021 ± 2.173
1.511IleSer: 1.511 ± 0.851
3.021IleThr: 3.021 ± 0.569
4.532IleVal: 4.532 ± 0.957
0.0IleTrp: 0.0 ± 0.0
3.021IleTyr: 3.021 ± 0.569
0.0IleXaa: 0.0 ± 0.0
Lys
1.511LysAla: 1.511 ± 1.087
1.511LysCys: 1.511 ± 0.851
1.511LysAsp: 1.511 ± 0.851
1.511LysGlu: 1.511 ± 1.087
4.532LysPhe: 4.532 ± 2.553
7.553LysGly: 7.553 ± 1.953
0.0LysHis: 0.0 ± 0.0
1.511LysIle: 1.511 ± 1.087
1.511LysLys: 1.511 ± 1.087
1.511LysLeu: 1.511 ± 0.851
0.0LysMet: 0.0 ± 0.946
1.511LysAsn: 1.511 ± 1.087
1.511LysPro: 1.511 ± 0.851
1.511LysGln: 1.511 ± 1.087
4.532LysArg: 4.532 ± 2.413
0.0LysSer: 0.0 ± 0.0
3.021LysThr: 3.021 ± 0.569
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
4.532LysTyr: 4.532 ± 0.957
0.0LysXaa: 0.0 ± 0.0
Leu
1.511LeuAla: 1.511 ± 0.851
0.0LeuCys: 0.0 ± 0.0
7.553LeuAsp: 7.553 ± 1.324
6.042LeuGlu: 6.042 ± 1.719
3.021LeuPhe: 3.021 ± 0.569
12.085LeuGly: 12.085 ± 5.06
4.532LeuHis: 4.532 ± 2.553
3.021LeuIle: 3.021 ± 0.569
4.532LeuLys: 4.532 ± 0.957
6.042LeuLeu: 6.042 ± 1.138
3.021LeuMet: 3.021 ± 2.769
3.021LeuAsn: 3.021 ± 2.173
1.511LeuPro: 1.511 ± 0.851
0.0LeuGln: 0.0 ± 0.0
7.553LeuArg: 7.553 ± 1.953
1.511LeuSer: 1.511 ± 0.851
6.042LeuThr: 6.042 ± 2.209
6.042LeuVal: 6.042 ± 3.224
1.511LeuTrp: 1.511 ± 0.851
1.511LeuTyr: 1.511 ± 0.851
0.0LeuXaa: 0.0 ± 0.0
Met
3.021MetAla: 3.021 ± 0.569
0.0MetCys: 0.0 ± 0.0
1.511MetAsp: 1.511 ± 2.911
0.0MetGlu: 0.0 ± 0.0
3.021MetPhe: 3.021 ± 0.569
3.021MetGly: 3.021 ± 0.569
0.0MetHis: 0.0 ± 0.0
1.511MetIle: 1.511 ± 1.087
0.0MetLys: 0.0 ± 0.0
3.021MetLeu: 3.021 ± 1.702
1.511MetMet: 1.511 ± 1.087
3.021MetAsn: 3.021 ± 2.173
1.511MetPro: 1.511 ± 2.911
1.511MetGln: 1.511 ± 0.851
0.0MetArg: 0.0 ± 0.0
3.021MetSer: 3.021 ± 0.569
1.511MetThr: 1.511 ± 2.911
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.511AsnCys: 1.511 ± 0.851
4.532AsnAsp: 4.532 ± 3.26
1.511AsnGlu: 1.511 ± 1.087
0.0AsnPhe: 0.0 ± 0.0
6.042AsnGly: 6.042 ± 2.57
0.0AsnHis: 0.0 ± 0.0
3.021AsnIle: 3.021 ± 0.569
3.021AsnLys: 3.021 ± 2.173
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
1.511AsnAsn: 1.511 ± 1.087
6.042AsnPro: 6.042 ± 2.57
1.511AsnGln: 1.511 ± 1.087
0.0AsnArg: 0.0 ± 0.0
1.511AsnSer: 1.511 ± 1.087
3.021AsnThr: 3.021 ± 2.173
1.511AsnVal: 1.511 ± 1.087
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.021ProAla: 3.021 ± 0.569
0.0ProCys: 0.0 ± 0.0
6.042ProAsp: 6.042 ± 4.346
7.553ProGlu: 7.553 ± 3.732
1.511ProPhe: 1.511 ± 1.087
3.021ProGly: 3.021 ± 0.569
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
0.0ProLys: 0.0 ± 0.0
1.511ProLeu: 1.511 ± 0.851
1.511ProMet: 1.511 ± 1.087
1.511ProAsn: 1.511 ± 0.851
3.021ProPro: 3.021 ± 1.702
0.0ProGln: 0.0 ± 0.0
3.021ProArg: 3.021 ± 1.702
7.553ProSer: 7.553 ± 1.324
6.042ProThr: 6.042 ± 1.138
6.042ProVal: 6.042 ± 1.719
0.0ProTrp: 0.0 ± 0.0
1.511ProTyr: 1.511 ± 0.851
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.511GlnAsp: 1.511 ± 1.087
0.0GlnGlu: 0.0 ± 0.0
1.511GlnPhe: 1.511 ± 0.851
3.021GlnGly: 3.021 ± 0.569
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.511GlnLys: 1.511 ± 1.087
3.021GlnLeu: 3.021 ± 0.569
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.511GlnArg: 1.511 ± 1.087
1.511GlnSer: 1.511 ± 0.851
0.0GlnThr: 0.0 ± 0.0
1.511GlnVal: 1.511 ± 2.911
3.021GlnTrp: 3.021 ± 2.173
1.511GlnTyr: 1.511 ± 1.087
0.0GlnXaa: 0.0 ± 0.0
Arg
3.021ArgAla: 3.021 ± 0.569
0.0ArgCys: 0.0 ± 0.0
3.021ArgAsp: 3.021 ± 0.569
4.532ArgGlu: 4.532 ± 0.957
1.511ArgPhe: 1.511 ± 2.911
6.042ArgGly: 6.042 ± 2.57
0.0ArgHis: 0.0 ± 0.0
4.532ArgIle: 4.532 ± 3.211
3.021ArgLys: 3.021 ± 1.702
9.063ArgLeu: 9.063 ± 1.654
1.511ArgMet: 1.511 ± 1.087
0.0ArgAsn: 0.0 ± 0.0
3.021ArgPro: 3.021 ± 1.702
0.0ArgGln: 0.0 ± 0.0
7.553ArgArg: 7.553 ± 2.009
6.042ArgSer: 6.042 ± 1.138
4.532ArgThr: 4.532 ± 3.26
6.042ArgVal: 6.042 ± 1.138
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.511SerAla: 1.511 ± 0.851
0.0SerCys: 0.0 ± 0.0
1.511SerAsp: 1.511 ± 1.087
0.0SerGlu: 0.0 ± 0.0
4.532SerPhe: 4.532 ± 0.957
9.063SerGly: 9.063 ± 3.023
9.063SerHis: 9.063 ± 4.349
1.511SerIle: 1.511 ± 0.851
1.511SerLys: 1.511 ± 1.087
16.616SerLeu: 16.616 ± 3.179
0.0SerMet: 0.0 ± 0.0
6.042SerAsn: 6.042 ± 4.346
1.511SerPro: 1.511 ± 1.087
4.532SerGln: 4.532 ± 0.957
1.511SerArg: 1.511 ± 0.851
1.511SerSer: 1.511 ± 1.087
6.042SerThr: 6.042 ± 2.57
4.532SerVal: 4.532 ± 3.26
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.042ThrAla: 6.042 ± 1.138
0.0ThrCys: 0.0 ± 0.0
4.532ThrAsp: 4.532 ± 2.882
0.0ThrGlu: 0.0 ± 0.0
1.511ThrPhe: 1.511 ± 0.851
7.553ThrGly: 7.553 ± 8.505
1.511ThrHis: 1.511 ± 2.911
4.532ThrIle: 4.532 ± 1.511
1.511ThrLys: 1.511 ± 2.911
10.574ThrLeu: 10.574 ± 1.566
3.021ThrMet: 3.021 ± 2.173
1.511ThrAsn: 1.511 ± 1.087
3.021ThrPro: 3.021 ± 0.569
1.511ThrGln: 1.511 ± 1.087
7.553ThrArg: 7.553 ± 1.953
9.063ThrSer: 9.063 ± 1.706
12.085ThrThr: 12.085 ± 4.352
1.511ThrVal: 1.511 ± 1.087
1.511ThrTrp: 1.511 ± 1.087
6.042ThrTyr: 6.042 ± 2.57
0.0ThrXaa: 0.0 ± 0.0
Val
3.021ValAla: 3.021 ± 2.769
1.511ValCys: 1.511 ± 0.851
9.063ValAsp: 9.063 ± 1.654
3.021ValGlu: 3.021 ± 0.569
6.042ValPhe: 6.042 ± 1.138
4.532ValGly: 4.532 ± 2.882
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
1.511ValLys: 1.511 ± 1.087
1.511ValLeu: 1.511 ± 0.851
4.532ValMet: 4.532 ± 2.996
0.0ValAsn: 0.0 ± 0.0
3.021ValPro: 3.021 ± 1.702
0.0ValGln: 0.0 ± 0.0
6.042ValArg: 6.042 ± 1.138
6.042ValSer: 6.042 ± 1.138
3.021ValThr: 3.021 ± 2.769
4.532ValVal: 4.532 ± 2.413
1.511ValTrp: 1.511 ± 0.851
1.511ValTyr: 1.511 ± 0.851
0.0ValXaa: 0.0 ± 0.0
Trp
6.042TrpAla: 6.042 ± 3.404
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.511TrpPhe: 1.511 ± 1.087
1.511TrpGly: 1.511 ± 0.851
1.511TrpHis: 1.511 ± 1.087
0.0TrpIle: 0.0 ± 0.0
1.511TrpLys: 1.511 ± 0.851
3.021TrpLeu: 3.021 ± 1.702
0.0TrpMet: 0.0 ± 0.0
1.511TrpAsn: 1.511 ± 1.087
0.0TrpPro: 0.0 ± 0.0
1.511TrpGln: 1.511 ± 1.087
3.021TrpArg: 3.021 ± 0.569
1.511TrpSer: 1.511 ± 1.087
0.0TrpThr: 0.0 ± 0.0
1.511TrpVal: 1.511 ± 0.851
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.042TyrAla: 6.042 ± 3.404
0.0TyrCys: 0.0 ± 0.0
1.511TyrAsp: 1.511 ± 1.087
3.021TyrGlu: 3.021 ± 0.569
3.021TyrPhe: 3.021 ± 0.569
1.511TyrGly: 1.511 ± 0.851
0.0TyrHis: 0.0 ± 0.0
3.021TyrIle: 3.021 ± 0.569
6.042TyrLys: 6.042 ± 1.138
0.0TyrLeu: 0.0 ± 0.0
1.511TyrMet: 1.511 ± 1.087
0.0TyrAsn: 0.0 ± 0.0
1.511TyrPro: 1.511 ± 0.851
1.511TyrGln: 1.511 ± 1.087
1.511TyrArg: 1.511 ± 0.851
1.511TyrSer: 1.511 ± 0.851
3.021TyrThr: 3.021 ± 0.569
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
3.021TyrTyr: 3.021 ± 2.173
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski