Amino acid dipepetide frequency for Changjiang picorna-like virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.167AlaAla: 9.167 ± 1.032
2.391AlaCys: 2.391 ± 0.627
3.189AlaAsp: 3.189 ± 1.679
3.189AlaGlu: 3.189 ± 1.046
2.79AlaPhe: 2.79 ± 0.836
6.377AlaGly: 6.377 ± 1.069
1.993AlaHis: 1.993 ± 0.417
3.189AlaIle: 3.189 ± 0.414
5.181AlaLys: 5.181 ± 0.831
8.37AlaLeu: 8.37 ± 2.509
3.189AlaMet: 3.189 ± 1.679
3.587AlaAsn: 3.587 ± 1.906
5.978AlaPro: 5.978 ± 2.544
4.783AlaGln: 4.783 ± 1.276
4.783AlaArg: 4.783 ± 1.253
8.768AlaSer: 8.768 ± 1.075
6.377AlaThr: 6.377 ± 1.702
6.776AlaVal: 6.776 ± 1.67
1.196AlaTrp: 1.196 ± 0.003
3.587AlaTyr: 3.587 ± 2.538
0.0AlaXaa: 0.0 ± 0.0
Cys
2.79CysAla: 2.79 ± 1.469
0.399CysCys: 0.399 ± 0.21
0.399CysAsp: 0.399 ± 0.21
1.594CysGlu: 1.594 ± 0.425
0.399CysPhe: 0.399 ± 0.423
0.797CysGly: 0.797 ± 0.213
0.0CysHis: 0.0 ± 0.0
0.399CysIle: 0.399 ± 0.21
1.196CysLys: 1.196 ± 0.003
1.196CysLeu: 1.196 ± 0.003
0.0CysMet: 0.0 ± 0.0
0.797CysAsn: 0.797 ± 0.213
0.797CysPro: 0.797 ± 0.42
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.196CysSer: 1.196 ± 0.003
0.399CysThr: 0.399 ± 0.21
1.993CysVal: 1.993 ± 0.216
0.0CysTrp: 0.0 ± 0.0
0.399CysTyr: 0.399 ± 0.21
0.0CysXaa: 0.0 ± 0.0
Asp
5.58AspAla: 5.58 ± 1.04
0.399AspCys: 0.399 ± 0.21
5.978AspAsp: 5.978 ± 2.515
3.189AspGlu: 3.189 ± 1.046
2.391AspPhe: 2.391 ± 0.006
2.391AspGly: 2.391 ± 0.627
1.196AspHis: 1.196 ± 0.629
2.79AspIle: 2.79 ± 0.836
3.189AspLys: 3.189 ± 1.046
4.783AspLeu: 4.783 ± 0.012
0.399AspMet: 0.399 ± 0.423
3.189AspAsn: 3.189 ± 0.851
1.594AspPro: 1.594 ± 0.425
1.196AspGln: 1.196 ± 0.629
2.391AspArg: 2.391 ± 1.259
3.587AspSer: 3.587 ± 0.624
2.79AspThr: 2.79 ± 0.428
4.384AspVal: 4.384 ± 1.043
1.196AspTrp: 1.196 ± 0.003
1.594AspTyr: 1.594 ± 0.839
0.0AspXaa: 0.0 ± 0.0
Glu
3.587GluAla: 3.587 ± 1.888
0.797GluCys: 0.797 ± 0.213
2.391GluAsp: 2.391 ± 0.006
1.594GluGlu: 1.594 ± 0.207
4.384GluPhe: 4.384 ± 1.486
1.196GluGly: 1.196 ± 0.003
0.797GluHis: 0.797 ± 0.42
1.993GluIle: 1.993 ± 1.049
1.196GluLys: 1.196 ± 0.629
7.573GluLeu: 7.573 ± 0.44
1.196GluMet: 1.196 ± 0.003
1.196GluAsn: 1.196 ± 0.003
1.594GluPro: 1.594 ± 0.839
1.196GluGln: 1.196 ± 0.003
2.391GluArg: 2.391 ± 1.259
2.79GluSer: 2.79 ± 0.204
1.196GluThr: 1.196 ± 0.003
4.384GluVal: 4.384 ± 0.221
0.399GluTrp: 0.399 ± 0.21
2.79GluTyr: 2.79 ± 0.836
0.0GluXaa: 0.0 ± 0.0
Phe
3.587PheAla: 3.587 ± 1.273
0.399PheCys: 0.399 ± 0.423
2.79PheAsp: 2.79 ± 0.836
3.587PheGlu: 3.587 ± 1.906
1.594PhePhe: 1.594 ± 0.839
5.181PheGly: 5.181 ± 0.831
0.399PheHis: 0.399 ± 0.423
1.993PheIle: 1.993 ± 0.216
3.189PheLys: 3.189 ± 0.219
4.384PheLeu: 4.384 ± 1.043
1.196PheMet: 1.196 ± 0.635
2.391PheAsn: 2.391 ± 0.006
3.986PhePro: 3.986 ± 0.431
0.797PheGln: 0.797 ± 0.42
2.79PheArg: 2.79 ± 0.428
3.587PheSer: 3.587 ± 0.641
3.189PheThr: 3.189 ± 0.851
1.594PheVal: 1.594 ± 0.839
1.594PheTrp: 1.594 ± 0.207
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.978GlyAla: 5.978 ± 0.647
0.797GlyCys: 0.797 ± 0.213
3.587GlyAsp: 3.587 ± 1.256
2.391GlyGlu: 2.391 ± 0.006
3.587GlyPhe: 3.587 ± 0.009
5.58GlyGly: 5.58 ± 1.489
0.399GlyHis: 0.399 ± 0.21
4.783GlyIle: 4.783 ± 0.644
3.986GlyLys: 3.986 ± 0.834
3.986GlyLeu: 3.986 ± 1.064
1.993GlyMet: 1.993 ± 0.216
1.993GlyAsn: 1.993 ± 0.216
3.587GlyPro: 3.587 ± 1.906
1.594GlyGln: 1.594 ± 0.839
2.79GlyArg: 2.79 ± 0.428
4.384GlySer: 4.384 ± 2.119
4.783GlyThr: 4.783 ± 1.276
8.768GlyVal: 8.768 ± 0.443
0.797GlyTrp: 0.797 ± 0.213
0.399GlyTyr: 0.399 ± 0.423
0.0GlyXaa: 0.0 ± 0.0
His
1.594HisAla: 1.594 ± 0.207
0.0HisCys: 0.0 ± 0.0
1.196HisAsp: 1.196 ± 0.629
0.0HisGlu: 0.0 ± 0.0
1.594HisPhe: 1.594 ± 0.207
1.594HisGly: 1.594 ± 0.207
0.399HisHis: 0.399 ± 0.21
1.196HisIle: 1.196 ± 0.629
1.196HisLys: 1.196 ± 0.629
1.196HisLeu: 1.196 ± 0.635
0.797HisMet: 0.797 ± 0.845
0.0HisAsn: 0.0 ± 0.0
0.797HisPro: 0.797 ± 0.42
0.797HisGln: 0.797 ± 0.213
0.797HisArg: 0.797 ± 0.42
1.196HisSer: 1.196 ± 0.003
0.797HisThr: 0.797 ± 0.42
2.391HisVal: 2.391 ± 0.006
0.0HisTrp: 0.0 ± 0.0
0.797HisTyr: 0.797 ± 0.845
0.0HisXaa: 0.0 ± 0.0
Ile
4.384IleAla: 4.384 ± 0.221
2.79IleCys: 2.79 ± 0.836
3.189IleAsp: 3.189 ± 1.046
0.399IleGlu: 0.399 ± 0.21
3.189IlePhe: 3.189 ± 0.851
3.189IleGly: 3.189 ± 2.116
0.399IleHis: 0.399 ± 0.21
1.594IleIle: 1.594 ± 0.207
3.587IleLys: 3.587 ± 1.256
2.79IleLeu: 2.79 ± 0.428
0.797IleMet: 0.797 ± 0.213
2.391IleAsn: 2.391 ± 0.638
2.391IlePro: 2.391 ± 1.271
1.594IleGln: 1.594 ± 0.207
2.391IleArg: 2.391 ± 1.259
2.79IleSer: 2.79 ± 0.204
2.79IleThr: 2.79 ± 1.693
2.391IleVal: 2.391 ± 0.006
0.399IleTrp: 0.399 ± 0.21
1.196IleTyr: 1.196 ± 0.003
0.0IleXaa: 0.0 ± 0.0
Lys
2.79LysAla: 2.79 ± 1.469
0.399LysCys: 0.399 ± 0.21
3.587LysAsp: 3.587 ± 1.888
4.384LysGlu: 4.384 ± 1.043
2.79LysPhe: 2.79 ± 0.428
3.587LysGly: 3.587 ± 1.256
0.797LysHis: 0.797 ± 0.213
2.391LysIle: 2.391 ± 0.006
3.986LysLys: 3.986 ± 2.098
3.986LysLeu: 3.986 ± 0.431
0.399LysMet: 0.399 ± 0.423
2.79LysAsn: 2.79 ± 1.469
2.391LysPro: 2.391 ± 0.006
1.594LysGln: 1.594 ± 0.207
3.986LysArg: 3.986 ± 2.098
2.79LysSer: 2.79 ± 0.836
3.189LysThr: 3.189 ± 0.414
4.384LysVal: 4.384 ± 1.676
1.196LysTrp: 1.196 ± 0.003
1.594LysTyr: 1.594 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
5.58LeuAla: 5.58 ± 0.224
1.993LeuCys: 1.993 ± 0.216
5.181LeuAsp: 5.181 ± 0.198
3.986LeuGlu: 3.986 ± 0.201
2.391LeuPhe: 2.391 ± 0.006
5.181LeuGly: 5.181 ± 1.067
3.986LeuHis: 3.986 ± 0.431
2.391LeuIle: 2.391 ± 0.638
5.58LeuLys: 5.58 ± 1.673
7.971LeuLeu: 7.971 ± 0.862
3.189LeuMet: 3.189 ± 0.414
2.391LeuAsn: 2.391 ± 0.006
4.783LeuPro: 4.783 ± 0.644
2.79LeuGln: 2.79 ± 0.204
4.384LeuArg: 4.384 ± 0.411
5.181LeuSer: 5.181 ± 0.434
6.776LeuThr: 6.776 ± 1.492
8.37LeuVal: 8.37 ± 2.509
2.79LeuTrp: 2.79 ± 0.836
1.196LeuTyr: 1.196 ± 0.003
0.0LeuXaa: 0.0 ± 0.0
Met
2.391MetAla: 2.391 ± 0.627
0.399MetCys: 0.399 ± 0.21
1.594MetAsp: 1.594 ± 0.839
0.797MetGlu: 0.797 ± 0.42
0.797MetPhe: 0.797 ± 0.42
0.797MetGly: 0.797 ± 0.213
0.399MetHis: 0.399 ± 0.21
0.797MetIle: 0.797 ± 0.845
1.196MetLys: 1.196 ± 0.629
1.594MetLeu: 1.594 ± 1.058
1.196MetMet: 1.196 ± 0.629
0.797MetAsn: 0.797 ± 0.213
1.993MetPro: 1.993 ± 0.216
0.797MetGln: 0.797 ± 0.213
2.391MetArg: 2.391 ± 0.006
2.391MetSer: 2.391 ± 0.638
1.594MetThr: 1.594 ± 0.207
2.391MetVal: 2.391 ± 0.638
0.399MetTrp: 0.399 ± 0.21
1.196MetTyr: 1.196 ± 0.629
0.0MetXaa: 0.0 ± 0.0
Asn
3.587AsnAla: 3.587 ± 0.009
0.0AsnCys: 0.0 ± 0.0
1.196AsnAsp: 1.196 ± 0.635
1.594AsnGlu: 1.594 ± 0.425
1.196AsnPhe: 1.196 ± 0.629
2.79AsnGly: 2.79 ± 0.204
1.196AsnHis: 1.196 ± 0.629
2.79AsnIle: 2.79 ± 1.061
2.79AsnLys: 2.79 ± 0.836
2.79AsnLeu: 2.79 ± 1.693
0.0AsnMet: 0.0 ± 0.0
3.986AsnAsn: 3.986 ± 1.064
1.196AsnPro: 1.196 ± 0.635
0.797AsnGln: 0.797 ± 0.213
0.399AsnArg: 0.399 ± 0.21
4.783AsnSer: 4.783 ± 1.276
3.986AsnThr: 3.986 ± 0.431
5.58AsnVal: 5.58 ± 0.408
0.399AsnTrp: 0.399 ± 0.423
1.196AsnTyr: 1.196 ± 0.003
0.0AsnXaa: 0.0 ± 0.0
Pro
4.783ProAla: 4.783 ± 0.012
0.399ProCys: 0.399 ± 0.21
1.594ProAsp: 1.594 ± 0.425
1.196ProGlu: 1.196 ± 0.635
1.993ProPhe: 1.993 ± 1.48
2.391ProGly: 2.391 ± 1.271
1.196ProHis: 1.196 ± 0.635
2.391ProIle: 2.391 ± 1.903
1.993ProLys: 1.993 ± 0.417
5.58ProLeu: 5.58 ± 1.04
0.797ProMet: 0.797 ± 0.213
1.196ProAsn: 1.196 ± 1.268
1.993ProPro: 1.993 ± 1.48
2.391ProGln: 2.391 ± 0.627
2.79ProArg: 2.79 ± 0.428
4.384ProSer: 4.384 ± 1.486
3.189ProThr: 3.189 ± 2.116
4.384ProVal: 4.384 ± 0.854
0.797ProTrp: 0.797 ± 0.213
3.189ProTyr: 3.189 ± 1.483
0.0ProXaa: 0.0 ± 0.0
Gln
4.384GlnAla: 4.384 ± 0.221
0.0GlnCys: 0.0 ± 0.0
2.391GlnAsp: 2.391 ± 0.006
1.196GlnGlu: 1.196 ± 0.629
0.399GlnPhe: 0.399 ± 0.423
0.797GlnGly: 0.797 ± 0.213
0.0GlnHis: 0.0 ± 0.0
0.399GlnIle: 0.399 ± 0.423
1.993GlnLys: 1.993 ± 0.417
1.993GlnLeu: 1.993 ± 0.216
0.399GlnMet: 0.399 ± 0.21
0.797GlnAsn: 0.797 ± 0.845
1.594GlnPro: 1.594 ± 0.425
0.797GlnGln: 0.797 ± 0.42
2.391GlnArg: 2.391 ± 1.259
3.189GlnSer: 3.189 ± 0.414
0.0GlnThr: 0.0 ± 0.0
2.79GlnVal: 2.79 ± 0.204
0.797GlnTrp: 0.797 ± 0.213
0.399GlnTyr: 0.399 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
7.573ArgAla: 7.573 ± 2.09
0.797ArgCys: 0.797 ± 0.42
2.391ArgAsp: 2.391 ± 0.006
3.189ArgGlu: 3.189 ± 1.679
2.391ArgPhe: 2.391 ± 0.638
2.79ArgGly: 2.79 ± 0.204
1.594ArgHis: 1.594 ± 0.839
3.189ArgIle: 3.189 ± 1.046
1.993ArgLys: 1.993 ± 0.417
5.978ArgLeu: 5.978 ± 1.883
1.594ArgMet: 1.594 ± 0.207
3.189ArgAsn: 3.189 ± 0.219
1.196ArgPro: 1.196 ± 0.003
0.399ArgGln: 0.399 ± 0.21
5.181ArgArg: 5.181 ± 2.095
4.783ArgSer: 4.783 ± 0.621
3.189ArgThr: 3.189 ± 0.414
4.783ArgVal: 4.783 ± 1.253
0.399ArgTrp: 0.399 ± 0.423
0.399ArgTyr: 0.399 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
9.566SerAla: 9.566 ± 1.288
0.797SerCys: 0.797 ± 0.213
3.587SerAsp: 3.587 ± 0.641
3.986SerGlu: 3.986 ± 1.466
4.384SerPhe: 4.384 ± 1.043
6.377SerGly: 6.377 ± 1.46
1.196SerHis: 1.196 ± 0.635
3.587SerIle: 3.587 ± 0.624
1.993SerLys: 1.993 ± 0.417
5.58SerLeu: 5.58 ± 0.224
3.189SerMet: 3.189 ± 1.046
4.384SerAsn: 4.384 ± 1.486
2.391SerPro: 2.391 ± 1.271
1.196SerGln: 1.196 ± 0.635
4.384SerArg: 4.384 ± 0.411
6.776SerSer: 6.776 ± 0.227
5.181SerThr: 5.181 ± 4.861
3.587SerVal: 3.587 ± 1.906
0.399SerTrp: 0.399 ± 0.21
4.384SerTyr: 4.384 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
4.783ThrAla: 4.783 ± 2.541
0.0ThrCys: 0.0 ± 0.0
3.189ThrAsp: 3.189 ± 0.219
2.79ThrGlu: 2.79 ± 0.428
4.783ThrPhe: 4.783 ± 0.621
5.181ThrGly: 5.181 ± 3.596
0.399ThrHis: 0.399 ± 0.21
2.79ThrIle: 2.79 ± 1.061
1.594ThrLys: 1.594 ± 0.425
4.384ThrLeu: 4.384 ± 0.221
1.993ThrMet: 1.993 ± 1.48
1.594ThrAsn: 1.594 ± 0.207
2.391ThrPro: 2.391 ± 1.271
0.399ThrGln: 0.399 ± 0.21
2.79ThrArg: 2.79 ± 0.204
6.776ThrSer: 6.776 ± 1.492
3.986ThrThr: 3.986 ± 0.431
8.37ThrVal: 8.37 ± 0.612
0.797ThrTrp: 0.797 ± 0.213
2.79ThrTyr: 2.79 ± 1.061
0.0ThrXaa: 0.0 ± 0.0
Val
8.37ValAla: 8.37 ± 0.02
1.196ValCys: 1.196 ± 0.635
4.783ValAsp: 4.783 ± 0.621
3.189ValGlu: 3.189 ± 1.046
3.189ValPhe: 3.189 ± 1.046
5.978ValGly: 5.978 ± 0.014
1.993ValHis: 1.993 ± 0.216
3.587ValIle: 3.587 ± 0.624
3.189ValLys: 3.189 ± 1.046
9.566ValLeu: 9.566 ± 1.242
1.594ValMet: 1.594 ± 0.839
4.384ValAsn: 4.384 ± 2.308
6.776ValPro: 6.776 ± 2.124
1.993ValGln: 1.993 ± 0.848
7.174ValArg: 7.174 ± 0.615
1.993ValSer: 1.993 ± 0.417
5.58ValThr: 5.58 ± 0.224
8.37ValVal: 8.37 ± 0.653
1.196ValTrp: 1.196 ± 0.003
3.986ValTyr: 3.986 ± 0.834
0.0ValXaa: 0.0 ± 0.0
Trp
1.196TrpAla: 1.196 ± 0.003
0.399TrpCys: 0.399 ± 0.21
0.797TrpAsp: 0.797 ± 0.42
1.196TrpGlu: 1.196 ± 0.003
1.594TrpPhe: 1.594 ± 1.058
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.797TrpIle: 0.797 ± 0.845
0.399TrpLys: 0.399 ± 0.21
0.399TrpLeu: 0.399 ± 0.21
0.399TrpMet: 0.399 ± 0.21
0.797TrpAsn: 0.797 ± 0.42
0.0TrpPro: 0.0 ± 0.0
0.797TrpGln: 0.797 ± 0.213
1.196TrpArg: 1.196 ± 0.635
1.594TrpSer: 1.594 ± 0.207
1.196TrpThr: 1.196 ± 0.629
0.797TrpVal: 0.797 ± 0.42
0.399TrpTrp: 0.399 ± 0.423
1.594TrpTyr: 1.594 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.189TyrAla: 3.189 ± 1.483
0.399TyrCys: 0.399 ± 0.21
1.196TyrAsp: 1.196 ± 0.003
1.196TyrGlu: 1.196 ± 0.003
2.391TyrPhe: 2.391 ± 1.271
3.986TyrGly: 3.986 ± 1.696
0.0TyrHis: 0.0 ± 0.0
1.594TyrIle: 1.594 ± 0.207
3.587TyrLys: 3.587 ± 1.256
1.993TyrLeu: 1.993 ± 0.216
1.196TyrMet: 1.196 ± 0.313
0.399TyrAsn: 0.399 ± 0.423
1.196TyrPro: 1.196 ± 0.635
0.797TyrGln: 0.797 ± 0.213
1.594TyrArg: 1.594 ± 0.839
3.986TyrSer: 3.986 ± 1.064
1.594TyrThr: 1.594 ± 0.839
1.594TyrVal: 1.594 ± 0.207
0.399TyrTrp: 0.399 ± 0.21
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2510 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski