Amino acid dipepetide frequency for Lake Sarah-associated circular virus-32

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.435AlaAla: 5.435 ± 4.168
0.0AlaCys: 0.0 ± 0.0
2.717AlaAsp: 2.717 ± 2.084
1.359AlaGlu: 1.359 ± 0.942
5.435AlaPhe: 5.435 ± 0.2
5.435AlaGly: 5.435 ± 2.184
5.435AlaHis: 5.435 ± 0.2
8.152AlaIle: 8.152 ± 1.685
2.717AlaLys: 2.717 ± 1.884
9.511AlaLeu: 9.511 ± 1.341
1.359AlaMet: 1.359 ± 0.942
1.359AlaAsn: 1.359 ± 1.042
8.152AlaPro: 8.152 ± 3.669
2.717AlaGln: 2.717 ± 0.1
4.076AlaArg: 4.076 ± 1.142
8.152AlaSer: 8.152 ± 3.669
6.793AlaThr: 6.793 ± 1.241
5.435AlaVal: 5.435 ± 0.2
0.0AlaTrp: 0.0 ± 0.0
1.359AlaTyr: 1.359 ± 0.942
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.359CysAsp: 1.359 ± 1.042
0.0CysGlu: 0.0 ± 0.0
2.717CysPhe: 2.717 ± 1.884
1.359CysGly: 1.359 ± 0.942
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.359CysLys: 1.359 ± 1.042
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.717CysPro: 2.717 ± 0.1
1.359CysGln: 1.359 ± 0.942
1.359CysArg: 1.359 ± 0.942
0.0CysSer: 0.0 ± 0.0
2.717CysThr: 2.717 ± 0.1
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
5.435AspAsp: 5.435 ± 2.184
1.359AspGlu: 1.359 ± 1.042
5.435AspPhe: 5.435 ± 0.2
2.717AspGly: 2.717 ± 0.1
0.0AspHis: 0.0 ± 0.0
4.076AspIle: 4.076 ± 0.842
2.717AspLys: 2.717 ± 0.1
5.435AspLeu: 5.435 ± 3.769
0.0AspMet: 0.0 ± 0.663
1.359AspAsn: 1.359 ± 1.042
2.717AspPro: 2.717 ± 0.1
1.359AspGln: 1.359 ± 0.942
1.359AspArg: 1.359 ± 0.942
0.0AspSer: 0.0 ± 0.0
2.717AspThr: 2.717 ± 2.084
5.435AspVal: 5.435 ± 0.2
0.0AspTrp: 0.0 ± 0.0
4.076AspTyr: 4.076 ± 1.142
0.0AspXaa: 0.0 ± 0.0
Glu
6.793GluAla: 6.793 ± 1.241
1.359GluCys: 1.359 ± 0.942
1.359GluAsp: 1.359 ± 0.942
6.793GluGlu: 6.793 ± 0.743
5.435GluPhe: 5.435 ± 2.184
1.359GluGly: 1.359 ± 0.942
2.717GluHis: 2.717 ± 1.884
1.359GluIle: 1.359 ± 0.942
2.717GluLys: 2.717 ± 1.884
0.0GluLeu: 0.0 ± 0.0
2.717GluMet: 2.717 ± 1.632
0.0GluAsn: 0.0 ± 0.0
2.717GluPro: 2.717 ± 0.1
2.717GluGln: 2.717 ± 1.884
1.359GluArg: 1.359 ± 1.042
4.076GluSer: 4.076 ± 1.142
1.359GluThr: 1.359 ± 1.042
2.717GluVal: 2.717 ± 0.1
1.359GluTrp: 1.359 ± 0.942
1.359GluTyr: 1.359 ± 1.042
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
2.717PheAsp: 2.717 ± 0.1
2.717PheGlu: 2.717 ± 0.1
0.0PhePhe: 0.0 ± 0.0
2.717PheGly: 2.717 ± 1.884
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.717PheLys: 2.717 ± 1.884
2.717PheLeu: 2.717 ± 0.1
1.359PheMet: 1.359 ± 1.042
2.717PheAsn: 2.717 ± 2.084
5.435PhePro: 5.435 ± 1.785
0.0PheGln: 0.0 ± 0.0
4.076PheArg: 4.076 ± 3.126
5.435PheSer: 5.435 ± 0.2
4.076PheThr: 4.076 ± 1.142
4.076PheVal: 4.076 ± 0.842
0.0PheTrp: 0.0 ± 0.0
1.359PheTyr: 1.359 ± 1.042
0.0PheXaa: 0.0 ± 0.0
Gly
4.076GlyAla: 4.076 ± 1.142
0.0GlyCys: 0.0 ± 0.0
1.359GlyAsp: 1.359 ± 0.942
4.076GlyGlu: 4.076 ± 0.842
1.359GlyPhe: 1.359 ± 0.942
2.717GlyGly: 2.717 ± 0.1
0.0GlyHis: 0.0 ± 0.0
6.793GlyIle: 6.793 ± 2.727
2.717GlyLys: 2.717 ± 0.1
2.717GlyLeu: 2.717 ± 0.1
0.0GlyMet: 0.0 ± 0.0
1.359GlyAsn: 1.359 ± 1.042
2.717GlyPro: 2.717 ± 0.1
6.793GlyGln: 6.793 ± 3.226
4.076GlyArg: 4.076 ± 1.142
5.435GlySer: 5.435 ± 4.168
8.152GlyThr: 8.152 ± 0.299
1.359GlyVal: 1.359 ± 0.942
0.0GlyTrp: 0.0 ± 0.0
2.717GlyTyr: 2.717 ± 0.1
0.0GlyXaa: 0.0 ± 0.0
His
1.359HisAla: 1.359 ± 1.042
0.0HisCys: 0.0 ± 0.0
2.717HisAsp: 2.717 ± 1.884
0.0HisGlu: 0.0 ± 0.0
1.359HisPhe: 1.359 ± 1.042
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.359HisIle: 1.359 ± 0.942
0.0HisLys: 0.0 ± 0.0
4.076HisLeu: 4.076 ± 2.827
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.359HisGln: 1.359 ± 0.942
4.076HisArg: 4.076 ± 0.842
0.0HisSer: 0.0 ± 0.0
1.359HisThr: 1.359 ± 1.042
2.717HisVal: 2.717 ± 1.884
0.0HisTrp: 0.0 ± 0.0
1.359HisTyr: 1.359 ± 0.942
0.0HisXaa: 0.0 ± 0.0
Ile
2.717IleAla: 2.717 ± 0.1
0.0IleCys: 0.0 ± 0.0
2.717IleAsp: 2.717 ± 1.884
6.793IleGlu: 6.793 ± 0.743
4.076IlePhe: 4.076 ± 1.142
2.717IleGly: 2.717 ± 1.884
4.076IleHis: 4.076 ± 0.842
0.0IleIle: 0.0 ± 0.0
4.076IleLys: 4.076 ± 0.842
2.717IleLeu: 2.717 ± 2.084
0.0IleMet: 0.0 ± 0.0
1.359IleAsn: 1.359 ± 1.042
2.717IlePro: 2.717 ± 0.1
4.076IleGln: 4.076 ± 0.842
5.435IleArg: 5.435 ± 1.785
2.717IleSer: 2.717 ± 0.1
4.076IleThr: 4.076 ± 1.142
4.076IleVal: 4.076 ± 3.126
5.435IleTrp: 5.435 ± 3.769
2.717IleTyr: 2.717 ± 0.1
0.0IleXaa: 0.0 ± 0.0
Lys
8.152LysAla: 8.152 ± 1.685
0.0LysCys: 0.0 ± 0.0
2.717LysAsp: 2.717 ± 0.1
2.717LysGlu: 2.717 ± 0.1
1.359LysPhe: 1.359 ± 1.042
5.435LysGly: 5.435 ± 1.785
0.0LysHis: 0.0 ± 0.0
4.076LysIle: 4.076 ± 1.142
4.076LysLys: 4.076 ± 2.827
6.793LysLeu: 6.793 ± 0.743
2.717LysMet: 2.717 ± 0.1
1.359LysAsn: 1.359 ± 0.942
1.359LysPro: 1.359 ± 0.942
1.359LysGln: 1.359 ± 1.042
2.717LysArg: 2.717 ± 1.884
2.717LysSer: 2.717 ± 1.884
1.359LysThr: 1.359 ± 0.942
0.0LysVal: 0.0 ± 0.0
1.359LysTrp: 1.359 ± 0.942
2.717LysTyr: 2.717 ± 0.1
0.0LysXaa: 0.0 ± 0.0
Leu
5.435LeuAla: 5.435 ± 1.785
0.0LeuCys: 0.0 ± 0.0
2.717LeuAsp: 2.717 ± 1.884
1.359LeuGlu: 1.359 ± 0.942
1.359LeuPhe: 1.359 ± 0.942
5.435LeuGly: 5.435 ± 2.184
0.0LeuHis: 0.0 ± 0.0
9.511LeuIle: 9.511 ± 0.643
5.435LeuLys: 5.435 ± 0.2
6.793LeuLeu: 6.793 ± 5.21
0.0LeuMet: 0.0 ± 0.0
5.435LeuAsn: 5.435 ± 3.769
5.435LeuPro: 5.435 ± 0.2
4.076LeuGln: 4.076 ± 0.842
2.717LeuArg: 2.717 ± 1.884
14.946LeuSer: 14.946 ± 0.443
2.717LeuThr: 2.717 ± 2.084
1.359LeuVal: 1.359 ± 1.042
4.076LeuTrp: 4.076 ± 0.842
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.359MetAla: 1.359 ± 1.042
0.0MetCys: 0.0 ± 0.0
1.359MetAsp: 1.359 ± 0.942
1.359MetGlu: 1.359 ± 1.042
0.0MetPhe: 0.0 ± 0.0
1.359MetGly: 1.359 ± 1.042
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.359MetLys: 1.359 ± 1.042
1.359MetLeu: 1.359 ± 0.942
1.359MetMet: 1.359 ± 1.042
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.359MetGln: 1.359 ± 0.942
2.717MetArg: 2.717 ± 2.084
1.359MetSer: 1.359 ± 1.042
1.359MetThr: 1.359 ± 0.942
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.435AsnAla: 5.435 ± 1.785
1.359AsnCys: 1.359 ± 1.042
2.717AsnAsp: 2.717 ± 2.084
4.076AsnGlu: 4.076 ± 1.142
0.0AsnPhe: 0.0 ± 0.0
2.717AsnGly: 2.717 ± 2.084
1.359AsnHis: 1.359 ± 0.942
5.435AsnIle: 5.435 ± 4.168
2.717AsnLys: 2.717 ± 1.884
1.359AsnLeu: 1.359 ± 1.042
1.359AsnMet: 1.359 ± 1.042
5.435AsnAsn: 5.435 ± 2.184
1.359AsnPro: 1.359 ± 0.942
1.359AsnGln: 1.359 ± 1.042
0.0AsnArg: 0.0 ± 0.0
1.359AsnSer: 1.359 ± 1.042
2.717AsnThr: 2.717 ± 2.084
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
4.076AsnTyr: 4.076 ± 1.142
0.0AsnXaa: 0.0 ± 0.0
Pro
2.717ProAla: 2.717 ± 1.884
1.359ProCys: 1.359 ± 0.942
1.359ProAsp: 1.359 ± 0.942
2.717ProGlu: 2.717 ± 1.884
1.359ProPhe: 1.359 ± 1.042
2.717ProGly: 2.717 ± 1.884
2.717ProHis: 2.717 ± 0.1
1.359ProIle: 1.359 ± 0.942
2.717ProLys: 2.717 ± 1.884
4.076ProLeu: 4.076 ± 0.842
1.359ProMet: 1.359 ± 1.042
2.717ProAsn: 2.717 ± 0.1
2.717ProPro: 2.717 ± 1.884
1.359ProGln: 1.359 ± 1.042
5.435ProArg: 5.435 ± 1.785
9.511ProSer: 9.511 ± 2.627
6.793ProThr: 6.793 ± 3.226
5.435ProVal: 5.435 ± 3.769
0.0ProTrp: 0.0 ± 0.0
1.359ProTyr: 1.359 ± 1.042
0.0ProXaa: 0.0 ± 0.0
Gln
4.076GlnAla: 4.076 ± 0.842
2.717GlnCys: 2.717 ± 0.1
1.359GlnAsp: 1.359 ± 1.042
1.359GlnGlu: 1.359 ± 0.942
1.359GlnPhe: 1.359 ± 1.042
1.359GlnGly: 1.359 ± 0.942
0.0GlnHis: 0.0 ± 0.0
2.717GlnIle: 2.717 ± 2.084
2.717GlnLys: 2.717 ± 1.884
5.435GlnLeu: 5.435 ± 2.184
0.0GlnMet: 0.0 ± 0.0
1.359GlnAsn: 1.359 ± 1.042
2.717GlnPro: 2.717 ± 1.884
1.359GlnGln: 1.359 ± 0.942
1.359GlnArg: 1.359 ± 0.942
4.076GlnSer: 4.076 ± 0.842
4.076GlnThr: 4.076 ± 1.142
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
4.076GlnTyr: 4.076 ± 0.842
0.0GlnXaa: 0.0 ± 0.0
Arg
5.435ArgAla: 5.435 ± 0.2
1.359ArgCys: 1.359 ± 0.942
2.717ArgAsp: 2.717 ± 0.1
4.076ArgGlu: 4.076 ± 0.842
0.0ArgPhe: 0.0 ± 0.0
5.435ArgGly: 5.435 ± 4.168
1.359ArgHis: 1.359 ± 0.942
5.435ArgIle: 5.435 ± 2.184
1.359ArgLys: 1.359 ± 0.942
8.152ArgLeu: 8.152 ± 0.299
1.359ArgMet: 1.359 ± 0.942
5.435ArgAsn: 5.435 ± 2.184
4.076ArgPro: 4.076 ± 2.827
1.359ArgGln: 1.359 ± 1.042
5.435ArgArg: 5.435 ± 1.785
4.076ArgSer: 4.076 ± 0.842
0.0ArgThr: 0.0 ± 0.0
2.717ArgVal: 2.717 ± 2.084
2.717ArgTrp: 2.717 ± 0.1
2.717ArgTyr: 2.717 ± 1.884
0.0ArgXaa: 0.0 ± 0.0
Ser
16.304SerAla: 16.304 ± 3.37
1.359SerCys: 1.359 ± 1.042
2.717SerAsp: 2.717 ± 1.884
0.0SerGlu: 0.0 ± 0.0
5.435SerPhe: 5.435 ± 0.2
4.076SerGly: 4.076 ± 1.142
2.717SerHis: 2.717 ± 1.884
1.359SerIle: 1.359 ± 1.042
5.435SerLys: 5.435 ± 0.2
4.076SerLeu: 4.076 ± 0.842
0.0SerMet: 0.0 ± 0.0
6.793SerAsn: 6.793 ± 1.241
4.076SerPro: 4.076 ± 0.842
5.435SerGln: 5.435 ± 1.785
6.793SerArg: 6.793 ± 1.241
6.793SerSer: 6.793 ± 1.241
4.076SerThr: 4.076 ± 0.842
5.435SerVal: 5.435 ± 0.2
1.359SerTrp: 1.359 ± 1.042
1.359SerTyr: 1.359 ± 0.942
0.0SerXaa: 0.0 ± 0.0
Thr
8.152ThrAla: 8.152 ± 4.268
1.359ThrCys: 1.359 ± 0.942
4.076ThrAsp: 4.076 ± 1.142
4.076ThrGlu: 4.076 ± 3.126
0.0ThrPhe: 0.0 ± 0.0
4.076ThrGly: 4.076 ± 3.126
0.0ThrHis: 0.0 ± 0.0
2.717ThrIle: 2.717 ± 0.1
1.359ThrLys: 1.359 ± 1.042
4.076ThrLeu: 4.076 ± 0.842
0.0ThrMet: 0.0 ± 0.0
1.359ThrAsn: 1.359 ± 1.042
6.793ThrPro: 6.793 ± 1.241
1.359ThrGln: 1.359 ± 1.042
8.152ThrArg: 8.152 ± 4.268
6.793ThrSer: 6.793 ± 0.743
6.793ThrThr: 6.793 ± 5.21
4.076ThrVal: 4.076 ± 0.842
0.0ThrTrp: 0.0 ± 0.0
1.359ThrTyr: 1.359 ± 0.942
0.0ThrXaa: 0.0 ± 0.0
Val
5.435ValAla: 5.435 ± 1.785
0.0ValCys: 0.0 ± 0.0
4.076ValAsp: 4.076 ± 3.126
0.0ValGlu: 0.0 ± 0.0
2.717ValPhe: 2.717 ± 1.884
2.717ValGly: 2.717 ± 1.884
0.0ValHis: 0.0 ± 0.0
2.717ValIle: 2.717 ± 1.884
2.717ValLys: 2.717 ± 0.1
6.793ValLeu: 6.793 ± 2.727
0.0ValMet: 0.0 ± 0.0
5.435ValAsn: 5.435 ± 4.168
2.717ValPro: 2.717 ± 0.1
1.359ValGln: 1.359 ± 0.942
2.717ValArg: 2.717 ± 0.1
1.359ValSer: 1.359 ± 0.942
2.717ValThr: 2.717 ± 2.084
5.435ValVal: 5.435 ± 0.2
1.359ValTrp: 1.359 ± 1.042
1.359ValTyr: 1.359 ± 1.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.359TrpCys: 1.359 ± 0.942
1.359TrpAsp: 1.359 ± 0.942
0.0TrpGlu: 0.0 ± 0.0
1.359TrpPhe: 1.359 ± 0.942
4.076TrpGly: 4.076 ± 1.142
0.0TrpHis: 0.0 ± 0.0
2.717TrpIle: 2.717 ± 1.884
2.717TrpLys: 2.717 ± 2.084
1.359TrpLeu: 1.359 ± 0.942
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.359TrpPro: 1.359 ± 0.942
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
2.717TrpThr: 2.717 ± 0.1
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.717TyrAla: 2.717 ± 0.1
2.717TyrCys: 2.717 ± 0.1
0.0TyrAsp: 0.0 ± 0.0
5.435TyrGlu: 5.435 ± 0.2
1.359TyrPhe: 1.359 ± 1.042
0.0TyrGly: 0.0 ± 0.0
1.359TyrHis: 1.359 ± 0.942
2.717TyrIle: 2.717 ± 1.884
1.359TyrLys: 1.359 ± 0.942
1.359TyrLeu: 1.359 ± 0.942
1.359TyrMet: 1.359 ± 1.042
1.359TyrAsn: 1.359 ± 1.042
0.0TyrPro: 0.0 ± 0.0
1.359TyrGln: 1.359 ± 1.042
1.359TyrArg: 1.359 ± 0.942
6.793TyrSer: 6.793 ± 0.743
0.0TyrThr: 0.0 ± 0.0
1.359TyrVal: 1.359 ± 1.042
1.359TyrTrp: 1.359 ± 1.042
1.359TyrTyr: 1.359 ± 1.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (737 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski