Amino acid dipepetide frequency for Giant panda associated gemycircularvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.831AlaAla: 4.831 ± 2.438
0.0AlaCys: 0.0 ± 0.0
4.831AlaAsp: 4.831 ± 3.582
4.831AlaGlu: 4.831 ± 2.383
2.415AlaPhe: 2.415 ± 1.791
3.623AlaGly: 3.623 ± 1.587
0.0AlaHis: 0.0 ± 0.0
2.415AlaIle: 2.415 ± 1.229
0.0AlaLys: 0.0 ± 0.0
4.831AlaLeu: 4.831 ± 0.68
1.208AlaMet: 1.208 ± 1.151
7.246AlaAsn: 7.246 ± 2.745
4.831AlaPro: 4.831 ± 2.418
2.415AlaGln: 2.415 ± 1.229
9.662AlaArg: 9.662 ± 1.949
3.623AlaSer: 3.623 ± 1.746
3.623AlaThr: 3.623 ± 0.572
3.623AlaVal: 3.623 ± 2.687
1.208AlaTrp: 1.208 ± 0.896
1.208AlaTyr: 1.208 ± 0.907
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.208CysAsp: 1.208 ± 0.896
0.0CysGlu: 0.0 ± 0.0
1.208CysPhe: 1.208 ± 0.907
1.208CysGly: 1.208 ± 1.481
0.0CysHis: 0.0 ± 0.0
1.208CysIle: 1.208 ± 0.896
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.208CysGln: 1.208 ± 0.896
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
2.415CysThr: 2.415 ± 1.815
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.208CysTyr: 1.208 ± 0.907
0.0CysXaa: 0.0 ± 0.0
Asp
1.208AspAla: 1.208 ± 0.896
0.0AspCys: 0.0 ± 0.0
3.623AspAsp: 3.623 ± 0.572
2.415AspGlu: 2.415 ± 1.34
3.623AspPhe: 3.623 ± 0.572
6.039AspGly: 6.039 ± 3.241
0.0AspHis: 0.0 ± 0.0
2.415AspIle: 2.415 ± 1.791
0.0AspLys: 0.0 ± 0.0
2.415AspLeu: 2.415 ± 0.915
3.623AspMet: 3.623 ± 2.012
0.0AspAsn: 0.0 ± 0.0
4.831AspPro: 4.831 ± 2.227
1.208AspGln: 1.208 ± 0.907
1.208AspArg: 1.208 ± 0.907
2.415AspSer: 2.415 ± 1.229
2.415AspThr: 2.415 ± 1.815
3.623AspVal: 3.623 ± 2.687
2.415AspTrp: 2.415 ± 0.915
6.039AspTyr: 6.039 ± 2.427
0.0AspXaa: 0.0 ± 0.0
Glu
3.623GluAla: 3.623 ± 0.572
1.208GluCys: 1.208 ± 0.896
1.208GluAsp: 1.208 ± 0.896
3.623GluGlu: 3.623 ± 1.587
3.623GluPhe: 3.623 ± 2.687
3.623GluGly: 3.623 ± 1.567
1.208GluHis: 1.208 ± 0.896
0.0GluIle: 0.0 ± 0.0
1.208GluLys: 1.208 ± 0.907
6.039GluLeu: 6.039 ± 3.241
0.0GluMet: 0.0 ± 0.0
7.246GluAsn: 7.246 ± 1.763
0.0GluPro: 0.0 ± 0.0
4.831GluGln: 4.831 ± 2.459
3.623GluArg: 3.623 ± 1.746
2.415GluSer: 2.415 ± 1.229
2.415GluThr: 2.415 ± 1.34
4.831GluVal: 4.831 ± 2.05
4.831GluTrp: 4.831 ± 2.227
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.415PheAla: 2.415 ± 1.791
1.208PheCys: 1.208 ± 0.896
2.415PheAsp: 2.415 ± 1.791
1.208PheGlu: 1.208 ± 0.896
2.415PhePhe: 2.415 ± 0.915
3.623PheGly: 3.623 ± 1.56
0.0PheHis: 0.0 ± 0.0
1.208PheIle: 1.208 ± 1.481
0.0PheLys: 0.0 ± 0.0
3.623PheLeu: 3.623 ± 2.675
1.208PheMet: 1.208 ± 0.896
4.831PheAsn: 4.831 ± 2.418
1.208PhePro: 1.208 ± 0.896
0.0PheGln: 0.0 ± 0.0
3.623PheArg: 3.623 ± 1.567
2.415PheSer: 2.415 ± 1.815
1.208PheThr: 1.208 ± 0.907
1.208PheVal: 1.208 ± 0.896
2.415PheTrp: 2.415 ± 1.229
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
8.454GlyAla: 8.454 ± 1.188
1.208GlyCys: 1.208 ± 0.907
6.039GlyAsp: 6.039 ± 1.703
3.623GlyGlu: 3.623 ± 0.572
0.0GlyPhe: 0.0 ± 0.0
10.87GlyGly: 10.87 ± 2.599
1.208GlyHis: 1.208 ± 0.896
2.415GlyIle: 2.415 ± 0.915
3.623GlyLys: 3.623 ± 2.687
8.454GlyLeu: 8.454 ± 3.239
4.831GlyMet: 4.831 ± 2.418
4.831GlyAsn: 4.831 ± 1.83
2.415GlyPro: 2.415 ± 1.229
1.208GlyGln: 1.208 ± 0.896
6.039GlyArg: 6.039 ± 1.483
7.246GlySer: 7.246 ± 1.144
6.039GlyThr: 6.039 ± 1.856
2.415GlyVal: 2.415 ± 1.229
2.415GlyTrp: 2.415 ± 1.229
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
7.246HisAla: 7.246 ± 3.851
0.0HisCys: 0.0 ± 0.0
2.415HisAsp: 2.415 ± 1.791
1.208HisGlu: 1.208 ± 0.907
0.0HisPhe: 0.0 ± 0.0
1.208HisGly: 1.208 ± 1.481
2.415HisHis: 2.415 ± 1.229
1.208HisIle: 1.208 ± 0.896
0.0HisLys: 0.0 ± 0.0
1.208HisLeu: 1.208 ± 0.896
0.0HisMet: 0.0 ± 0.0
1.208HisAsn: 1.208 ± 1.481
1.208HisPro: 1.208 ± 0.896
1.208HisGln: 1.208 ± 1.481
0.0HisArg: 0.0 ± 0.0
3.623HisSer: 3.623 ± 2.675
0.0HisThr: 0.0 ± 0.0
1.208HisVal: 1.208 ± 0.896
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.623IleAla: 3.623 ± 1.567
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
3.623IleGlu: 3.623 ± 2.687
2.415IlePhe: 2.415 ± 0.915
1.208IleGly: 1.208 ± 0.907
0.0IleHis: 0.0 ± 0.0
4.831IleIle: 4.831 ± 1.83
1.208IleLys: 1.208 ± 0.907
2.415IleLeu: 2.415 ± 1.815
0.0IleMet: 0.0 ± 0.0
2.415IleAsn: 2.415 ± 1.34
1.208IlePro: 1.208 ± 1.481
3.623IleGln: 3.623 ± 2.675
1.208IleArg: 1.208 ± 0.907
7.246IleSer: 7.246 ± 5.141
1.208IleThr: 1.208 ± 0.896
1.208IleVal: 1.208 ± 0.896
1.208IleTrp: 1.208 ± 0.896
1.208IleTyr: 1.208 ± 0.896
0.0IleXaa: 0.0 ± 0.0
Lys
1.208LysAla: 1.208 ± 0.896
0.0LysCys: 0.0 ± 0.0
1.208LysAsp: 1.208 ± 0.896
1.208LysGlu: 1.208 ± 0.896
2.415LysPhe: 2.415 ± 1.791
1.208LysGly: 1.208 ± 0.907
1.208LysHis: 1.208 ± 1.481
0.0LysIle: 0.0 ± 0.0
2.415LysLys: 2.415 ± 0.915
1.208LysLeu: 1.208 ± 0.896
3.623LysMet: 3.623 ± 0.863
1.208LysAsn: 1.208 ± 0.907
1.208LysPro: 1.208 ± 0.896
1.208LysGln: 1.208 ± 0.907
3.623LysArg: 3.623 ± 2.722
2.415LysSer: 2.415 ± 1.34
4.831LysThr: 4.831 ± 0.889
2.415LysVal: 2.415 ± 0.915
2.415LysTrp: 2.415 ± 1.791
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
4.831LeuAla: 4.831 ± 1.83
0.0LeuCys: 0.0 ± 0.0
3.623LeuAsp: 3.623 ± 0.572
7.246LeuGlu: 7.246 ± 3.688
2.415LeuPhe: 2.415 ± 1.815
6.039LeuGly: 6.039 ± 4.478
1.208LeuHis: 1.208 ± 0.896
2.415LeuIle: 2.415 ± 1.34
2.415LeuLys: 2.415 ± 1.34
7.246LeuLeu: 7.246 ± 5.011
3.623LeuMet: 3.623 ± 4.443
3.623LeuAsn: 3.623 ± 2.675
3.623LeuPro: 3.623 ± 1.587
1.208LeuGln: 1.208 ± 1.481
8.454LeuArg: 8.454 ± 4.95
14.493LeuSer: 14.493 ± 6.477
2.415LeuThr: 2.415 ± 1.229
6.039LeuVal: 6.039 ± 2.405
0.0LeuTrp: 0.0 ± 0.0
2.415LeuTyr: 2.415 ± 0.915
0.0LeuXaa: 0.0 ± 0.0
Met
1.208MetAla: 1.208 ± 0.907
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.208MetGlu: 1.208 ± 1.481
0.0MetPhe: 0.0 ± 0.0
4.831MetGly: 4.831 ± 2.05
1.208MetHis: 1.208 ± 0.896
1.208MetIle: 1.208 ± 1.481
0.0MetLys: 0.0 ± 0.0
3.623MetLeu: 3.623 ± 2.675
0.0MetMet: 0.0 ± 0.0
1.208MetAsn: 1.208 ± 0.907
1.208MetPro: 1.208 ± 0.907
1.208MetGln: 1.208 ± 1.481
3.623MetArg: 3.623 ± 1.587
6.039MetSer: 6.039 ± 1.483
0.0MetThr: 0.0 ± 0.0
2.415MetVal: 2.415 ± 1.229
1.208MetTrp: 1.208 ± 0.907
1.208MetTyr: 1.208 ± 0.907
0.0MetXaa: 0.0 ± 0.0
Asn
3.623AsnAla: 3.623 ± 1.587
1.208AsnCys: 1.208 ± 0.896
2.415AsnAsp: 2.415 ± 0.915
4.831AsnGlu: 4.831 ± 1.83
1.208AsnPhe: 1.208 ± 0.907
4.831AsnGly: 4.831 ± 4.111
2.415AsnHis: 2.415 ± 1.791
3.623AsnIle: 3.623 ± 1.587
2.415AsnLys: 2.415 ± 0.915
4.831AsnLeu: 4.831 ± 2.681
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.415AsnPro: 2.415 ± 1.815
1.208AsnGln: 1.208 ± 0.907
1.208AsnArg: 1.208 ± 0.907
3.623AsnSer: 3.623 ± 2.57
1.208AsnThr: 1.208 ± 0.896
2.415AsnVal: 2.415 ± 1.815
1.208AsnTrp: 1.208 ± 0.907
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.208ProCys: 1.208 ± 1.481
2.415ProAsp: 2.415 ± 1.229
6.039ProGlu: 6.039 ± 2.405
0.0ProPhe: 0.0 ± 0.0
2.415ProGly: 2.415 ± 0.915
0.0ProHis: 0.0 ± 0.0
1.208ProIle: 1.208 ± 1.481
1.208ProLys: 1.208 ± 0.896
1.208ProLeu: 1.208 ± 0.907
2.415ProMet: 2.415 ± 1.34
1.208ProAsn: 1.208 ± 0.896
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
1.208ProArg: 1.208 ± 0.896
10.87ProSer: 10.87 ± 0.61
2.415ProThr: 2.415 ± 1.229
2.415ProVal: 2.415 ± 1.791
2.415ProTrp: 2.415 ± 1.815
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.208GlnAla: 1.208 ± 0.896
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.415GlnGlu: 2.415 ± 2.962
2.415GlnPhe: 2.415 ± 1.791
3.623GlnGly: 3.623 ± 1.746
0.0GlnHis: 0.0 ± 0.0
1.208GlnIle: 1.208 ± 0.896
2.415GlnLys: 2.415 ± 2.962
3.623GlnLeu: 3.623 ± 0.572
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
2.415GlnGln: 2.415 ± 0.915
0.0GlnArg: 0.0 ± 0.0
6.039GlnSer: 6.039 ± 3.748
1.208GlnThr: 1.208 ± 0.907
2.415GlnVal: 2.415 ± 1.229
1.208GlnTrp: 1.208 ± 0.907
2.415GlnTyr: 2.415 ± 0.915
0.0GlnXaa: 0.0 ± 0.0
Arg
1.208ArgAla: 1.208 ± 1.481
0.0ArgCys: 0.0 ± 0.0
4.831ArgAsp: 4.831 ± 1.83
3.623ArgGlu: 3.623 ± 1.56
1.208ArgPhe: 1.208 ± 0.896
7.246ArgGly: 7.246 ± 3.175
2.415ArgHis: 2.415 ± 1.229
3.623ArgIle: 3.623 ± 1.746
7.246ArgLys: 7.246 ± 3.175
2.415ArgLeu: 2.415 ± 2.962
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
7.246ArgPro: 7.246 ± 1.042
1.208ArgGln: 1.208 ± 0.907
8.454ArgArg: 8.454 ± 5.071
10.87ArgSer: 10.87 ± 4.179
9.662ArgThr: 9.662 ± 4.693
2.415ArgVal: 2.415 ± 1.815
0.0ArgTrp: 0.0 ± 0.0
4.831ArgTyr: 4.831 ± 2.418
0.0ArgXaa: 0.0 ± 0.0
Ser
4.831SerAla: 4.831 ± 0.889
0.0SerCys: 0.0 ± 0.0
4.831SerAsp: 4.831 ± 0.68
4.831SerGlu: 4.831 ± 2.05
3.623SerPhe: 3.623 ± 0.572
8.454SerGly: 8.454 ± 0.881
2.415SerHis: 2.415 ± 1.229
4.831SerIle: 4.831 ± 0.68
2.415SerLys: 2.415 ± 1.34
13.285SerLeu: 13.285 ± 6.587
2.415SerMet: 2.415 ± 1.34
4.831SerAsn: 4.831 ± 0.889
3.623SerPro: 3.623 ± 4.443
3.623SerGln: 3.623 ± 1.56
10.87SerArg: 10.87 ± 1.025
8.454SerSer: 8.454 ± 6.492
9.662SerThr: 9.662 ± 5.831
7.246SerVal: 7.246 ± 3.239
3.623SerTrp: 3.623 ± 2.57
2.415SerTyr: 2.415 ± 0.915
0.0SerXaa: 0.0 ± 0.0
Thr
6.039ThrAla: 6.039 ± 4.537
0.0ThrCys: 0.0 ± 0.0
1.208ThrAsp: 1.208 ± 0.896
0.0ThrGlu: 0.0 ± 0.0
3.623ThrPhe: 3.623 ± 2.675
2.415ThrGly: 2.415 ± 1.34
3.623ThrHis: 3.623 ± 2.57
1.208ThrIle: 1.208 ± 0.907
1.208ThrLys: 1.208 ± 0.896
4.831ThrLeu: 4.831 ± 4.111
2.415ThrMet: 2.415 ± 1.34
3.623ThrAsn: 3.623 ± 2.722
1.208ThrPro: 1.208 ± 0.896
2.415ThrGln: 2.415 ± 0.915
7.246ThrArg: 7.246 ± 4.084
7.246ThrSer: 7.246 ± 4.084
4.831ThrThr: 4.831 ± 0.889
3.623ThrVal: 3.623 ± 1.587
1.208ThrTrp: 1.208 ± 0.896
3.623ThrTyr: 3.623 ± 1.587
0.0ThrXaa: 0.0 ± 0.0
Val
4.831ValAla: 4.831 ± 2.227
3.623ValCys: 3.623 ± 2.722
3.623ValAsp: 3.623 ± 1.746
2.415ValGlu: 2.415 ± 1.791
2.415ValPhe: 2.415 ± 1.229
7.246ValGly: 7.246 ± 1.042
2.415ValHis: 2.415 ± 1.791
1.208ValIle: 1.208 ± 0.896
4.831ValLys: 4.831 ± 2.383
4.831ValLeu: 4.831 ± 0.68
2.415ValMet: 2.415 ± 0.915
0.0ValAsn: 0.0 ± 0.0
1.208ValPro: 1.208 ± 0.896
2.415ValGln: 2.415 ± 1.229
1.208ValArg: 1.208 ± 0.907
2.415ValSer: 2.415 ± 2.962
3.623ValThr: 3.623 ± 1.587
4.831ValVal: 4.831 ± 0.68
0.0ValTrp: 0.0 ± 0.0
3.623ValTyr: 3.623 ± 1.567
0.0ValXaa: 0.0 ± 0.0
Trp
2.415TrpAla: 2.415 ± 0.915
0.0TrpCys: 0.0 ± 0.0
2.415TrpAsp: 2.415 ± 1.791
1.208TrpGlu: 1.208 ± 0.896
0.0TrpPhe: 0.0 ± 0.0
2.415TrpGly: 2.415 ± 1.791
3.623TrpHis: 3.623 ± 1.746
1.208TrpIle: 1.208 ± 1.481
1.208TrpLys: 1.208 ± 0.896
4.831TrpLeu: 4.831 ± 2.227
1.208TrpMet: 1.208 ± 0.907
1.208TrpAsn: 1.208 ± 0.907
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.623TrpArg: 3.623 ± 1.567
1.208TrpSer: 1.208 ± 1.481
0.0TrpThr: 0.0 ± 0.0
1.208TrpVal: 1.208 ± 0.907
0.0TrpTrp: 0.0 ± 0.0
1.208TrpTyr: 1.208 ± 1.481
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.623TyrAla: 3.623 ± 2.687
0.0TyrCys: 0.0 ± 0.0
2.415TyrAsp: 2.415 ± 1.815
0.0TyrGlu: 0.0 ± 0.0
1.208TyrPhe: 1.208 ± 0.896
1.208TyrGly: 1.208 ± 0.896
0.0TyrHis: 0.0 ± 0.0
2.415TyrIle: 2.415 ± 0.915
1.208TyrLys: 1.208 ± 0.907
2.415TyrLeu: 2.415 ± 1.34
1.208TyrMet: 1.208 ± 0.907
0.0TyrAsn: 0.0 ± 0.0
1.208TyrPro: 1.208 ± 0.896
0.0TyrGln: 0.0 ± 0.0
3.623TyrArg: 3.623 ± 1.587
3.623TyrSer: 3.623 ± 2.722
2.415TyrThr: 2.415 ± 1.815
3.623TyrVal: 3.623 ± 1.567
1.208TyrTrp: 1.208 ± 0.907
2.415TyrTyr: 2.415 ± 1.815
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (829 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski