Amino acid dipepetide frequency for CRESS virus sp. ctczB4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.428AlaAla: 10.428 ± 3.823
1.043AlaCys: 1.043 ± 0.558
0.0AlaAsp: 0.0 ± 0.0
3.128AlaGlu: 3.128 ± 1.675
1.043AlaPhe: 1.043 ± 1.23
10.428AlaGly: 10.428 ± 3.951
3.128AlaHis: 3.128 ± 1.97
5.214AlaIle: 5.214 ± 1.191
4.171AlaLys: 4.171 ± 1.529
2.086AlaLeu: 2.086 ± 0.765
3.128AlaMet: 3.128 ± 2.564
2.086AlaAsn: 2.086 ± 0.765
3.128AlaPro: 3.128 ± 1.675
1.043AlaGln: 1.043 ± 0.558
9.385AlaArg: 9.385 ± 3.398
7.299AlaSer: 7.299 ± 2.302
2.086AlaThr: 2.086 ± 1.117
3.128AlaVal: 3.128 ± 0.53
1.043AlaTrp: 1.043 ± 1.23
3.128AlaTyr: 3.128 ± 0.53
0.0AlaXaa: 0.0 ± 0.0
Cys
2.086CysAla: 2.086 ± 0.765
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.043CysGlu: 1.043 ± 1.23
2.086CysPhe: 2.086 ± 0.765
0.0CysGly: 0.0 ± 0.0
1.043CysHis: 1.043 ± 0.558
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.043CysPro: 1.043 ± 0.558
0.0CysGln: 0.0 ± 0.0
1.043CysArg: 1.043 ± 1.23
1.043CysSer: 1.043 ± 1.23
2.086CysThr: 2.086 ± 1.117
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.128AspAla: 3.128 ± 0.53
0.0AspCys: 0.0 ± 0.0
1.043AspAsp: 1.043 ± 0.558
3.128AspGlu: 3.128 ± 1.97
1.043AspPhe: 1.043 ± 0.558
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
1.043AspIle: 1.043 ± 0.558
0.0AspLys: 0.0 ± 0.0
3.128AspLeu: 3.128 ± 1.97
1.043AspMet: 1.043 ± 0.558
2.086AspAsn: 2.086 ± 1.117
3.128AspPro: 3.128 ± 1.97
0.0AspGln: 0.0 ± 0.0
3.128AspArg: 3.128 ± 0.53
3.128AspSer: 3.128 ± 1.675
3.128AspThr: 3.128 ± 0.53
5.214AspVal: 5.214 ± 1.242
1.043AspTrp: 1.043 ± 1.23
2.086AspTyr: 2.086 ± 1.117
0.0AspXaa: 0.0 ± 0.0
Glu
6.257GluAla: 6.257 ± 2.294
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
4.171GluGlu: 4.171 ± 0.774
1.043GluPhe: 1.043 ± 1.23
5.214GluGly: 5.214 ± 2.717
0.0GluHis: 0.0 ± 0.0
1.043GluIle: 1.043 ± 1.23
2.086GluLys: 2.086 ± 1.117
5.214GluLeu: 5.214 ± 4.05
1.043GluMet: 1.043 ± 0.558
3.128GluAsn: 3.128 ± 2.564
1.043GluPro: 1.043 ± 0.558
3.128GluGln: 3.128 ± 0.53
5.214GluArg: 5.214 ± 2.724
3.128GluSer: 3.128 ± 0.53
4.171GluThr: 4.171 ± 0.774
2.086GluVal: 2.086 ± 2.633
1.043GluTrp: 1.043 ± 0.558
2.086GluTyr: 2.086 ± 0.765
0.0GluXaa: 0.0 ± 0.0
Phe
2.086PheAla: 2.086 ± 1.117
1.043PheCys: 1.043 ± 0.558
2.086PheAsp: 2.086 ± 0.765
2.086PheGlu: 2.086 ± 0.765
1.043PhePhe: 1.043 ± 0.558
2.086PheGly: 2.086 ± 2.459
0.0PheHis: 0.0 ± 0.0
3.128PheIle: 3.128 ± 1.675
2.086PheLys: 2.086 ± 1.117
2.086PheLeu: 2.086 ± 3.048
0.0PheMet: 0.0 ± 0.0
4.171PheAsn: 4.171 ± 0.774
1.043PhePro: 1.043 ± 0.558
5.214PheGln: 5.214 ± 1.242
4.171PheArg: 4.171 ± 1.529
0.0PheSer: 0.0 ± 0.0
2.086PheThr: 2.086 ± 0.765
1.043PheVal: 1.043 ± 0.558
2.086PheTrp: 2.086 ± 2.459
1.043PheTyr: 1.043 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
5.214GlyAla: 5.214 ± 1.242
1.043GlyCys: 1.043 ± 1.23
5.214GlyAsp: 5.214 ± 2.792
4.171GlyGlu: 4.171 ± 0.774
6.257GlyPhe: 6.257 ± 1.059
8.342GlyGly: 8.342 ± 1.549
0.0GlyHis: 0.0 ± 0.0
3.128GlyIle: 3.128 ± 1.97
5.214GlyLys: 5.214 ± 2.119
5.214GlyLeu: 5.214 ± 2.792
1.043GlyMet: 1.043 ± 0.558
0.0GlyAsn: 0.0 ± 0.0
2.086GlyPro: 2.086 ± 1.117
5.214GlyGln: 5.214 ± 2.724
3.128GlyArg: 3.128 ± 2.658
10.428GlySer: 10.428 ± 1.595
7.299GlyThr: 7.299 ± 1.204
7.299GlyVal: 7.299 ± 1.204
1.043GlyTrp: 1.043 ± 0.558
3.128GlyTyr: 3.128 ± 0.53
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.043HisCys: 1.043 ± 1.23
1.043HisAsp: 1.043 ± 0.558
0.0HisGlu: 0.0 ± 0.0
2.086HisPhe: 2.086 ± 0.765
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.043HisIle: 1.043 ± 0.558
0.0HisLys: 0.0 ± 0.0
1.043HisLeu: 1.043 ± 1.23
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.043HisGln: 1.043 ± 0.558
0.0HisArg: 0.0 ± 0.0
1.043HisSer: 1.043 ± 2.814
1.043HisThr: 1.043 ± 1.23
0.0HisVal: 0.0 ± 0.0
1.043HisTrp: 1.043 ± 1.23
1.043HisTyr: 1.043 ± 1.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.214IleAla: 5.214 ± 1.191
0.0IleCys: 0.0 ± 0.0
2.086IleAsp: 2.086 ± 2.459
1.043IleGlu: 1.043 ± 2.814
2.086IlePhe: 2.086 ± 2.459
6.257IleGly: 6.257 ± 3.35
0.0IleHis: 0.0 ± 0.0
1.043IleIle: 1.043 ± 1.23
4.171IleLys: 4.171 ± 0.774
3.128IleLeu: 3.128 ± 0.53
0.0IleMet: 0.0 ± 0.0
4.171IleAsn: 4.171 ± 2.615
2.086IlePro: 2.086 ± 2.459
6.257IleGln: 6.257 ± 3.039
3.128IleArg: 3.128 ± 1.97
4.171IleSer: 4.171 ± 2.615
5.214IleThr: 5.214 ± 1.191
5.214IleVal: 5.214 ± 2.119
1.043IleTrp: 1.043 ± 1.23
2.086IleTyr: 2.086 ± 1.117
0.0IleXaa: 0.0 ± 0.0
Lys
4.171LysAla: 4.171 ± 2.233
0.0LysCys: 0.0 ± 0.0
1.043LysAsp: 1.043 ± 1.23
1.043LysGlu: 1.043 ± 1.23
2.086LysPhe: 2.086 ± 0.765
5.214LysGly: 5.214 ± 1.191
0.0LysHis: 0.0 ± 0.0
4.171LysIle: 4.171 ± 2.233
2.086LysLys: 2.086 ± 1.117
2.086LysLeu: 2.086 ± 1.117
2.086LysMet: 2.086 ± 1.117
2.086LysAsn: 2.086 ± 1.117
3.128LysPro: 3.128 ± 0.53
3.128LysGln: 3.128 ± 0.53
1.043LysArg: 1.043 ± 0.558
6.257LysSer: 6.257 ± 2.274
2.086LysThr: 2.086 ± 2.633
6.257LysVal: 6.257 ± 3.039
0.0LysTrp: 0.0 ± 0.0
2.086LysTyr: 2.086 ± 1.117
0.0LysXaa: 0.0 ± 0.0
Leu
1.043LeuAla: 1.043 ± 1.23
1.043LeuCys: 1.043 ± 1.23
4.171LeuAsp: 4.171 ± 1.529
3.128LeuGlu: 3.128 ± 3.699
3.128LeuPhe: 3.128 ± 2.564
5.214LeuGly: 5.214 ± 1.242
2.086LeuHis: 2.086 ± 0.765
3.128LeuIle: 3.128 ± 2.658
3.128LeuLys: 3.128 ± 0.53
9.385LeuLeu: 9.385 ± 10.556
1.043LeuMet: 1.043 ± 0.558
7.299LeuAsn: 7.299 ± 3.908
1.043LeuPro: 1.043 ± 1.23
2.086LeuGln: 2.086 ± 2.633
5.214LeuArg: 5.214 ± 2.717
6.257LeuSer: 6.257 ± 2.294
5.214LeuThr: 5.214 ± 2.119
3.128LeuVal: 3.128 ± 0.53
0.0LeuTrp: 0.0 ± 0.0
2.086LeuTyr: 2.086 ± 2.633
0.0LeuXaa: 0.0 ± 0.0
Met
1.043MetAla: 1.043 ± 0.558
0.0MetCys: 0.0 ± 0.0
1.043MetAsp: 1.043 ± 0.558
0.0MetGlu: 0.0 ± 0.0
1.043MetPhe: 1.043 ± 0.558
1.043MetGly: 1.043 ± 0.558
0.0MetHis: 0.0 ± 0.0
4.171MetIle: 4.171 ± 2.615
0.0MetLys: 0.0 ± 0.0
2.086MetLeu: 2.086 ± 2.633
1.043MetMet: 1.043 ± 0.558
0.0MetAsn: 0.0 ± 0.0
6.257MetPro: 6.257 ± 3.35
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.086MetSer: 2.086 ± 0.765
2.086MetThr: 2.086 ± 1.117
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.171AsnAla: 4.171 ± 2.233
1.043AsnCys: 1.043 ± 0.558
2.086AsnAsp: 2.086 ± 1.117
4.171AsnGlu: 4.171 ± 2.338
0.0AsnPhe: 0.0 ± 0.0
1.043AsnGly: 1.043 ± 1.23
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
2.086AsnLys: 2.086 ± 2.633
2.086AsnLeu: 2.086 ± 1.117
3.128AsnMet: 3.128 ± 0.64
5.214AsnAsn: 5.214 ± 2.792
1.043AsnPro: 1.043 ± 0.558
1.043AsnGln: 1.043 ± 0.558
2.086AsnArg: 2.086 ± 1.117
6.257AsnSer: 6.257 ± 1.763
4.171AsnThr: 4.171 ± 2.233
3.128AsnVal: 3.128 ± 0.53
2.086AsnTrp: 2.086 ± 0.765
3.128AsnTyr: 3.128 ± 1.675
0.0AsnXaa: 0.0 ± 0.0
Pro
7.299ProAla: 7.299 ± 2.302
0.0ProCys: 0.0 ± 0.0
4.171ProAsp: 4.171 ± 0.774
0.0ProGlu: 0.0 ± 0.0
2.086ProPhe: 2.086 ± 0.765
2.086ProGly: 2.086 ± 0.765
2.086ProHis: 2.086 ± 0.765
2.086ProIle: 2.086 ± 0.765
3.128ProLys: 3.128 ± 1.675
1.043ProLeu: 1.043 ± 1.23
0.0ProMet: 0.0 ± 0.0
1.043ProAsn: 1.043 ± 0.558
3.128ProPro: 3.128 ± 0.53
3.128ProGln: 3.128 ± 1.675
3.128ProArg: 3.128 ± 1.97
1.043ProSer: 1.043 ± 0.558
3.128ProThr: 3.128 ± 1.675
2.086ProVal: 2.086 ± 0.765
0.0ProTrp: 0.0 ± 0.0
2.086ProTyr: 2.086 ± 0.765
0.0ProXaa: 0.0 ± 0.0
Gln
4.171GlnAla: 4.171 ± 2.615
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
3.128GlnGlu: 3.128 ± 0.53
2.086GlnPhe: 2.086 ± 1.117
7.299GlnGly: 7.299 ± 2.1
0.0GlnHis: 0.0 ± 0.0
3.128GlnIle: 3.128 ± 1.97
3.128GlnLys: 3.128 ± 1.675
3.128GlnLeu: 3.128 ± 2.658
0.0GlnMet: 0.0 ± 0.0
1.043GlnAsn: 1.043 ± 0.558
2.086GlnPro: 2.086 ± 1.117
2.086GlnGln: 2.086 ± 2.633
2.086GlnArg: 2.086 ± 1.117
2.086GlnSer: 2.086 ± 0.765
4.171GlnThr: 4.171 ± 1.529
3.128GlnVal: 3.128 ± 0.53
2.086GlnTrp: 2.086 ± 1.117
2.086GlnTyr: 2.086 ± 1.117
0.0GlnXaa: 0.0 ± 0.0
Arg
7.299ArgAla: 7.299 ± 2.302
2.086ArgCys: 2.086 ± 2.459
3.128ArgAsp: 3.128 ± 0.53
6.257ArgGlu: 6.257 ± 2.034
3.128ArgPhe: 3.128 ± 1.675
6.257ArgGly: 6.257 ± 3.94
0.0ArgHis: 0.0 ± 0.0
5.214ArgIle: 5.214 ± 2.779
1.043ArgLys: 1.043 ± 0.558
2.086ArgLeu: 2.086 ± 2.459
0.0ArgMet: 0.0 ± 0.0
3.128ArgAsn: 3.128 ± 1.97
2.086ArgPro: 2.086 ± 1.117
2.086ArgGln: 2.086 ± 1.117
8.342ArgArg: 8.342 ± 2.458
6.257ArgSer: 6.257 ± 5.061
4.171ArgThr: 4.171 ± 2.338
6.257ArgVal: 6.257 ± 2.294
1.043ArgTrp: 1.043 ± 1.23
3.128ArgTyr: 3.128 ± 0.53
0.0ArgXaa: 0.0 ± 0.0
Ser
4.171SerAla: 4.171 ± 0.774
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
1.043SerGlu: 1.043 ± 1.23
3.128SerPhe: 3.128 ± 1.675
6.257SerGly: 6.257 ± 3.35
2.086SerHis: 2.086 ± 3.048
5.214SerIle: 5.214 ± 2.724
7.299SerLys: 7.299 ± 1.891
6.257SerLeu: 6.257 ± 3.35
4.171SerMet: 4.171 ± 2.079
3.128SerAsn: 3.128 ± 1.97
2.086SerPro: 2.086 ± 1.117
4.171SerGln: 4.171 ± 2.338
9.385SerArg: 9.385 ± 3.395
7.299SerSer: 7.299 ± 2.1
3.128SerThr: 3.128 ± 1.675
12.513SerVal: 12.513 ± 2.323
1.043SerTrp: 1.043 ± 1.23
2.086SerTyr: 2.086 ± 0.765
0.0SerXaa: 0.0 ± 0.0
Thr
4.171ThrAla: 4.171 ± 1.529
0.0ThrCys: 0.0 ± 0.0
3.128ThrAsp: 3.128 ± 0.53
2.086ThrGlu: 2.086 ± 1.117
2.086ThrPhe: 2.086 ± 1.117
7.299ThrGly: 7.299 ± 1.891
1.043ThrHis: 1.043 ± 1.23
7.299ThrIle: 7.299 ± 2.1
3.128ThrLys: 3.128 ± 1.675
5.214ThrLeu: 5.214 ± 2.792
1.043ThrMet: 1.043 ± 0.558
3.128ThrAsn: 3.128 ± 1.675
3.128ThrPro: 3.128 ± 1.97
0.0ThrGln: 0.0 ± 0.0
6.257ThrArg: 6.257 ± 3.511
5.214ThrSer: 5.214 ± 1.242
4.171ThrThr: 4.171 ± 2.233
6.257ThrVal: 6.257 ± 3.35
1.043ThrTrp: 1.043 ± 0.558
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.086ValAla: 2.086 ± 0.765
2.086ValCys: 2.086 ± 1.117
5.214ValAsp: 5.214 ± 1.191
5.214ValGlu: 5.214 ± 2.724
0.0ValPhe: 0.0 ± 0.0
5.214ValGly: 5.214 ± 2.792
0.0ValHis: 0.0 ± 0.0
5.214ValIle: 5.214 ± 1.242
3.128ValLys: 3.128 ± 0.53
8.342ValLeu: 8.342 ± 7.757
2.086ValMet: 2.086 ± 1.117
4.171ValAsn: 4.171 ± 2.233
3.128ValPro: 3.128 ± 0.53
4.171ValGln: 4.171 ± 0.774
2.086ValArg: 2.086 ± 1.117
7.299ValSer: 7.299 ± 2.302
4.171ValThr: 4.171 ± 2.233
2.086ValVal: 2.086 ± 0.765
2.086ValTrp: 2.086 ± 2.459
4.171ValTyr: 4.171 ± 0.774
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
4.171TrpGlu: 4.171 ± 1.529
1.043TrpPhe: 1.043 ± 1.23
2.086TrpGly: 2.086 ± 0.765
0.0TrpHis: 0.0 ± 0.0
2.086TrpIle: 2.086 ± 2.459
1.043TrpLys: 1.043 ± 1.23
2.086TrpLeu: 2.086 ± 2.459
0.0TrpMet: 0.0 ± 0.0
1.043TrpAsn: 1.043 ± 0.558
0.0TrpPro: 0.0 ± 0.0
2.086TrpGln: 2.086 ± 0.765
1.043TrpArg: 1.043 ± 0.558
0.0TrpSer: 0.0 ± 0.0
1.043TrpThr: 1.043 ± 1.23
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.086TrpTyr: 2.086 ± 0.765
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.086TyrAla: 2.086 ± 1.117
1.043TyrCys: 1.043 ± 0.558
1.043TyrAsp: 1.043 ± 0.558
2.086TyrGlu: 2.086 ± 1.117
2.086TyrPhe: 2.086 ± 0.765
3.128TyrGly: 3.128 ± 0.53
0.0TyrHis: 0.0 ± 0.0
1.043TyrIle: 1.043 ± 2.814
3.128TyrLys: 3.128 ± 1.675
3.128TyrLeu: 3.128 ± 0.53
0.0TyrMet: 0.0 ± 0.55
1.043TyrAsn: 1.043 ± 0.558
2.086TyrPro: 2.086 ± 2.459
1.043TyrGln: 1.043 ± 0.558
3.128TyrArg: 3.128 ± 0.53
4.171TyrSer: 4.171 ± 0.774
1.043TyrThr: 1.043 ± 0.558
3.128TyrVal: 3.128 ± 0.53
2.086TyrTrp: 2.086 ± 0.765
2.086TyrTyr: 2.086 ± 1.117
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (960 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski