Amino acid dipepetide frequency for Circoviridae 21 LDMD-2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.928AlaAla: 13.928 ± 10.205
1.393AlaCys: 1.393 ± 1.192
0.0AlaAsp: 0.0 ± 0.0
4.178AlaGlu: 4.178 ± 2.828
0.0AlaPhe: 0.0 ± 0.0
8.357AlaGly: 8.357 ± 1.907
1.393AlaHis: 1.393 ± 0.943
4.178AlaIle: 4.178 ± 1.768
2.786AlaLys: 2.786 ± 1.885
2.786AlaLeu: 2.786 ± 1.885
1.393AlaMet: 1.393 ± 0.943
1.393AlaAsn: 1.393 ± 0.943
2.786AlaPro: 2.786 ± 2.041
4.178AlaGln: 4.178 ± 1.235
5.571AlaArg: 5.571 ± 2.059
5.571AlaSer: 5.571 ± 2.917
5.571AlaThr: 5.571 ± 3.734
2.786AlaVal: 2.786 ± 2.041
0.0AlaTrp: 0.0 ± 0.0
4.178AlaTyr: 4.178 ± 1.235
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.393CysPhe: 1.393 ± 1.882
0.0CysGly: 0.0 ± 0.0
1.393CysHis: 1.393 ± 0.943
1.393CysIle: 1.393 ± 0.943
1.393CysLys: 1.393 ± 0.943
1.393CysLeu: 1.393 ± 1.882
0.0CysMet: 0.0 ± 0.0
4.178CysAsn: 4.178 ± 1.768
1.393CysPro: 1.393 ± 1.882
0.0CysGln: 0.0 ± 0.0
1.393CysArg: 1.393 ± 0.943
0.0CysSer: 0.0 ± 0.0
2.786CysThr: 2.786 ± 2.041
1.393CysVal: 1.393 ± 1.882
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
1.393AspCys: 1.393 ± 1.882
4.178AspAsp: 4.178 ± 2.828
0.0AspGlu: 0.0 ± 0.0
1.393AspPhe: 1.393 ± 0.943
1.393AspGly: 1.393 ± 1.192
1.393AspHis: 1.393 ± 0.943
2.786AspIle: 2.786 ± 0.765
6.964AspLys: 6.964 ± 2.449
4.178AspLeu: 4.178 ± 2.828
1.393AspMet: 1.393 ± 0.829
1.393AspAsn: 1.393 ± 0.943
2.786AspPro: 2.786 ± 1.885
2.786AspGln: 2.786 ± 2.385
1.393AspArg: 1.393 ± 1.192
5.571AspSer: 5.571 ± 3.77
2.786AspThr: 2.786 ± 2.385
4.178AspVal: 4.178 ± 1.255
0.0AspTrp: 0.0 ± 0.0
4.178AspTyr: 4.178 ± 2.828
0.0AspXaa: 0.0 ± 0.0
Glu
4.178GluAla: 4.178 ± 2.828
0.0GluCys: 0.0 ± 0.0
2.786GluAsp: 2.786 ± 1.885
0.0GluGlu: 0.0 ± 0.0
2.786GluPhe: 2.786 ± 1.885
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
2.786GluIle: 2.786 ± 0.765
4.178GluLys: 4.178 ± 2.828
5.571GluLeu: 5.571 ± 3.272
4.178GluMet: 4.178 ± 1.768
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
0.0GluGln: 0.0 ± 0.0
4.178GluArg: 4.178 ± 2.828
0.0GluSer: 0.0 ± 0.0
9.749GluThr: 9.749 ± 4.801
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
5.571GluTyr: 5.571 ± 1.529
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
2.786PheAsp: 2.786 ± 0.765
1.393PheGlu: 1.393 ± 1.192
1.393PhePhe: 1.393 ± 1.192
2.786PheGly: 2.786 ± 0.765
1.393PheHis: 1.393 ± 1.192
1.393PheIle: 1.393 ± 1.882
1.393PheLys: 1.393 ± 0.943
4.178PheLeu: 4.178 ± 3.399
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
1.393PheGln: 1.393 ± 0.943
1.393PheArg: 1.393 ± 1.192
6.964PheSer: 6.964 ± 0.882
4.178PheThr: 4.178 ± 1.894
4.178PheVal: 4.178 ± 1.235
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.571GlyAla: 5.571 ± 2.917
0.0GlyCys: 0.0 ± 0.0
5.571GlyAsp: 5.571 ± 1.529
8.357GlyGlu: 8.357 ± 2.294
2.786GlyPhe: 2.786 ± 0.765
6.964GlyGly: 6.964 ± 1.825
2.786GlyHis: 2.786 ± 2.385
4.178GlyIle: 4.178 ± 1.255
4.178GlyLys: 4.178 ± 1.235
1.393GlyLeu: 1.393 ± 1.882
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
1.393GlyPro: 1.393 ± 0.943
2.786GlyGln: 2.786 ± 1.885
9.749GlyArg: 9.749 ± 3.056
2.786GlySer: 2.786 ± 1.885
6.964GlyThr: 6.964 ± 4.091
2.786GlyVal: 2.786 ± 2.041
0.0GlyTrp: 0.0 ± 0.0
2.786GlyTyr: 2.786 ± 2.385
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
2.786HisCys: 2.786 ± 2.041
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.786HisGly: 2.786 ± 0.765
0.0HisHis: 0.0 ± 0.0
5.571HisIle: 5.571 ± 1.529
0.0HisLys: 0.0 ± 0.0
1.393HisLeu: 1.393 ± 1.192
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.393HisPro: 1.393 ± 0.943
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
4.178HisSer: 4.178 ± 1.235
2.786HisThr: 2.786 ± 1.885
1.393HisVal: 1.393 ± 0.943
0.0HisTrp: 0.0 ± 0.0
2.786HisTyr: 2.786 ± 1.885
0.0HisXaa: 0.0 ± 0.0
Ile
2.786IleAla: 2.786 ± 0.765
0.0IleCys: 0.0 ± 0.0
1.393IleAsp: 1.393 ± 0.943
6.964IleGlu: 6.964 ± 2.955
2.786IlePhe: 2.786 ± 2.041
2.786IleGly: 2.786 ± 2.385
1.393IleHis: 1.393 ± 0.943
4.178IleIle: 4.178 ± 1.235
2.786IleLys: 2.786 ± 2.041
8.357IleLeu: 8.357 ± 1.907
1.393IleMet: 1.393 ± 1.192
0.0IleAsn: 0.0 ± 0.0
5.571IlePro: 5.571 ± 2.059
1.393IleGln: 1.393 ± 0.943
5.571IleArg: 5.571 ± 0.873
4.178IleSer: 4.178 ± 3.741
4.178IleThr: 4.178 ± 1.235
4.178IleVal: 4.178 ± 2.828
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
1.393IleXaa: 1.393 ± 1.192
Lys
8.357LysAla: 8.357 ± 2.469
1.393LysCys: 1.393 ± 1.882
1.393LysAsp: 1.393 ± 0.943
2.786LysGlu: 2.786 ± 1.885
2.786LysPhe: 2.786 ± 0.765
5.571LysGly: 5.571 ± 2.059
2.786LysHis: 2.786 ± 0.765
1.393LysIle: 1.393 ± 1.192
4.178LysLys: 4.178 ± 1.768
0.0LysLeu: 0.0 ± 0.0
2.786LysMet: 2.786 ± 3.764
1.393LysAsn: 1.393 ± 1.882
2.786LysPro: 2.786 ± 0.765
2.786LysGln: 2.786 ± 0.765
5.571LysArg: 5.571 ± 1.821
0.0LysSer: 0.0 ± 0.0
4.178LysThr: 4.178 ± 1.235
1.393LysVal: 1.393 ± 1.192
0.0LysTrp: 0.0 ± 0.0
5.571LysTyr: 5.571 ± 3.77
0.0LysXaa: 0.0 ± 0.0
Leu
1.393LeuAla: 1.393 ± 0.943
0.0LeuCys: 0.0 ± 0.0
6.964LeuAsp: 6.964 ± 1.825
2.786LeuGlu: 2.786 ± 1.885
2.786LeuPhe: 2.786 ± 1.636
1.393LeuGly: 1.393 ± 0.943
1.393LeuHis: 1.393 ± 1.882
4.178LeuIle: 4.178 ± 1.768
2.786LeuLys: 2.786 ± 1.885
5.571LeuLeu: 5.571 ± 1.529
1.393LeuMet: 1.393 ± 0.943
4.178LeuAsn: 4.178 ± 3.741
6.964LeuPro: 6.964 ± 4.981
1.393LeuGln: 1.393 ± 0.943
5.571LeuArg: 5.571 ± 2.505
8.357LeuSer: 8.357 ± 4.527
5.571LeuThr: 5.571 ± 3.107
4.178LeuVal: 4.178 ± 1.894
1.393LeuTrp: 1.393 ± 0.943
2.786LeuTyr: 2.786 ± 0.765
0.0LeuXaa: 0.0 ± 0.0
Met
2.786MetAla: 2.786 ± 0.765
1.393MetCys: 1.393 ± 1.882
1.393MetAsp: 1.393 ± 0.943
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.393MetIle: 1.393 ± 1.882
1.393MetLys: 1.393 ± 1.192
0.0MetLeu: 0.0 ± 0.0
1.393MetMet: 1.393 ± 1.882
1.393MetAsn: 1.393 ± 1.192
2.786MetPro: 2.786 ± 1.885
0.0MetGln: 0.0 ± 0.0
2.786MetArg: 2.786 ± 0.765
1.393MetSer: 1.393 ± 1.882
0.0MetThr: 0.0 ± 0.0
1.393MetVal: 1.393 ± 1.192
1.393MetTrp: 1.393 ± 1.882
1.393MetTyr: 1.393 ± 1.192
0.0MetXaa: 0.0 ± 0.0
Asn
2.786AsnAla: 2.786 ± 2.385
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
4.178AsnGlu: 4.178 ± 3.399
1.393AsnPhe: 1.393 ± 1.882
2.786AsnGly: 2.786 ± 0.765
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
5.571AsnLeu: 5.571 ± 0.873
1.393AsnMet: 1.393 ± 0.943
4.178AsnAsn: 4.178 ± 1.768
4.178AsnPro: 4.178 ± 1.235
1.393AsnGln: 1.393 ± 0.943
2.786AsnArg: 2.786 ± 0.765
2.786AsnSer: 2.786 ± 2.385
1.393AsnThr: 1.393 ± 1.192
0.0AsnVal: 0.0 ± 0.0
1.393AsnTrp: 1.393 ± 0.943
2.786AsnTyr: 2.786 ± 0.765
1.393AsnXaa: 1.393 ± 1.192
Pro
4.178ProAla: 4.178 ± 1.255
1.393ProCys: 1.393 ± 1.882
1.393ProAsp: 1.393 ± 1.192
5.571ProGlu: 5.571 ± 3.77
0.0ProPhe: 0.0 ± 0.0
8.357ProGly: 8.357 ± 2.511
0.0ProHis: 0.0 ± 0.0
5.571ProIle: 5.571 ± 0.873
0.0ProLys: 0.0 ± 0.0
2.786ProLeu: 2.786 ± 3.764
0.0ProMet: 0.0 ± 1.45
1.393ProAsn: 1.393 ± 0.943
1.393ProPro: 1.393 ± 0.943
1.393ProGln: 1.393 ± 0.943
5.571ProArg: 5.571 ± 2.059
1.393ProSer: 1.393 ± 1.882
1.393ProThr: 1.393 ± 0.943
9.749ProVal: 9.749 ± 2.698
1.393ProTrp: 1.393 ± 0.943
1.393ProTyr: 1.393 ± 1.192
0.0ProXaa: 0.0 ± 0.0
Gln
4.178GlnAla: 4.178 ± 2.828
2.786GlnCys: 2.786 ± 0.765
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
1.393GlnPhe: 1.393 ± 1.192
1.393GlnGly: 1.393 ± 0.943
2.786GlnHis: 2.786 ± 1.885
2.786GlnIle: 2.786 ± 1.885
1.393GlnLys: 1.393 ± 1.192
1.393GlnLeu: 1.393 ± 0.943
1.393GlnMet: 1.393 ± 1.033
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.393GlnArg: 1.393 ± 0.943
2.786GlnSer: 2.786 ± 0.765
1.393GlnThr: 1.393 ± 0.943
1.393GlnVal: 1.393 ± 0.943
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
5.571ArgAsp: 5.571 ± 3.77
1.393ArgGlu: 1.393 ± 0.943
8.357ArgPhe: 8.357 ± 0.127
4.178ArgGly: 4.178 ± 1.768
0.0ArgHis: 0.0 ± 0.0
6.964ArgIle: 6.964 ± 2.811
8.357ArgLys: 8.357 ± 2.469
5.571ArgLeu: 5.571 ± 3.77
0.0ArgMet: 0.0 ± 0.0
2.786ArgAsn: 2.786 ± 0.765
4.178ArgPro: 4.178 ± 1.255
1.393ArgGln: 1.393 ± 0.943
8.357ArgArg: 8.357 ± 2.469
4.178ArgSer: 4.178 ± 1.768
9.749ArgThr: 9.749 ± 1.015
2.786ArgVal: 2.786 ± 2.385
0.0ArgTrp: 0.0 ± 0.0
2.786ArgTyr: 2.786 ± 2.041
0.0ArgXaa: 0.0 ± 0.0
Ser
4.178SerAla: 4.178 ± 1.768
0.0SerCys: 0.0 ± 0.0
6.964SerAsp: 6.964 ± 0.882
1.393SerGlu: 1.393 ± 1.192
1.393SerPhe: 1.393 ± 1.192
8.357SerGly: 8.357 ± 1.907
1.393SerHis: 1.393 ± 0.943
1.393SerIle: 1.393 ± 1.192
2.786SerLys: 2.786 ± 1.636
8.357SerLeu: 8.357 ± 2.509
1.393SerMet: 1.393 ± 1.192
6.964SerAsn: 6.964 ± 1.313
4.178SerPro: 4.178 ± 3.399
0.0SerGln: 0.0 ± 0.0
1.393SerArg: 1.393 ± 1.882
4.178SerSer: 4.178 ± 2.763
5.571SerThr: 5.571 ± 0.873
5.571SerVal: 5.571 ± 3.272
1.393SerTrp: 1.393 ± 1.192
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.964ThrAla: 6.964 ± 2.811
1.393ThrCys: 1.393 ± 0.943
4.178ThrAsp: 4.178 ± 1.768
0.0ThrGlu: 0.0 ± 0.0
1.393ThrPhe: 1.393 ± 1.192
6.964ThrGly: 6.964 ± 4.091
4.178ThrHis: 4.178 ± 2.828
5.571ThrIle: 5.571 ± 3.77
1.393ThrLys: 1.393 ± 1.882
1.393ThrLeu: 1.393 ± 1.882
0.0ThrMet: 0.0 ± 0.0
1.393ThrAsn: 1.393 ± 0.943
6.964ThrPro: 6.964 ± 2.955
1.393ThrGln: 1.393 ± 1.192
6.964ThrArg: 6.964 ± 2.955
2.786ThrSer: 2.786 ± 2.041
13.928ThrThr: 13.928 ± 6.436
11.142ThrVal: 11.142 ± 4.492
2.786ThrTrp: 2.786 ± 1.885
1.393ThrTyr: 1.393 ± 1.192
0.0ThrXaa: 0.0 ± 0.0
Val
4.178ValAla: 4.178 ± 3.741
1.393ValCys: 1.393 ± 0.943
2.786ValAsp: 2.786 ± 1.636
4.178ValGlu: 4.178 ± 2.828
1.393ValPhe: 1.393 ± 0.943
2.786ValGly: 2.786 ± 0.765
2.786ValHis: 2.786 ± 2.385
4.178ValIle: 4.178 ± 1.894
5.571ValLys: 5.571 ± 0.873
4.178ValLeu: 4.178 ± 1.255
0.0ValMet: 0.0 ± 0.0
4.178ValAsn: 4.178 ± 1.768
4.178ValPro: 4.178 ± 5.646
2.786ValGln: 2.786 ± 1.885
5.571ValArg: 5.571 ± 2.917
6.964ValSer: 6.964 ± 4.981
0.0ValThr: 0.0 ± 0.0
6.964ValVal: 6.964 ± 7.41
1.393ValTrp: 1.393 ± 0.943
1.393ValTyr: 1.393 ± 1.192
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.393TrpCys: 1.393 ± 0.943
0.0TrpAsp: 0.0 ± 0.0
1.393TrpGlu: 1.393 ± 0.943
1.393TrpPhe: 1.393 ± 0.943
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.393TrpLys: 1.393 ± 1.882
1.393TrpLeu: 1.393 ± 0.943
0.0TrpMet: 0.0 ± 0.0
1.393TrpAsn: 1.393 ± 1.192
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.393TrpSer: 1.393 ± 0.943
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.393TrpTyr: 1.393 ± 0.943
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.571TyrAla: 5.571 ± 3.77
1.393TyrCys: 1.393 ± 0.943
2.786TyrAsp: 2.786 ± 0.765
1.393TyrGlu: 1.393 ± 0.943
0.0TyrPhe: 0.0 ± 0.0
2.786TyrGly: 2.786 ± 0.765
0.0TyrHis: 0.0 ± 0.0
1.393TyrIle: 1.393 ± 0.943
4.178TyrLys: 4.178 ± 1.768
5.571TyrLeu: 5.571 ± 1.529
2.786TyrMet: 2.786 ± 2.041
4.178TyrAsn: 4.178 ± 1.768
2.786TyrPro: 2.786 ± 0.765
1.393TyrGln: 1.393 ± 0.943
1.393TyrArg: 1.393 ± 1.192
1.393TyrSer: 1.393 ± 0.943
0.0TyrThr: 0.0 ± 0.0
1.393TyrVal: 1.393 ± 1.192
0.0TyrTrp: 0.0 ± 0.0
2.786TyrTyr: 2.786 ± 0.765
0.0TyrXaa: 0.0 ± 0.0
Xaa
1.393XaaAla: 1.393 ± 1.192
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
1.393XaaGly: 1.393 ± 1.192
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (719 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski