Amino acid dipepetide frequency for Gossypium punctatum mild leaf curl virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.875AlaAla: 4.875 ± 1.827
0.609AlaCys: 0.609 ± 0.553
1.828AlaAsp: 1.828 ± 0.687
1.828AlaGlu: 1.828 ± 1.093
0.609AlaPhe: 0.609 ± 0.567
2.438AlaGly: 2.438 ± 0.864
0.0AlaHis: 0.0 ± 0.0
3.047AlaIle: 3.047 ± 1.44
3.047AlaLys: 3.047 ± 1.426
5.484AlaLeu: 5.484 ± 1.417
0.0AlaMet: 0.0 ± 0.0
1.219AlaAsn: 1.219 ± 0.728
0.609AlaPro: 0.609 ± 0.553
4.266AlaGln: 4.266 ± 1.236
4.266AlaArg: 4.266 ± 1.768
3.656AlaSer: 3.656 ± 2.181
2.438AlaThr: 2.438 ± 1.597
1.828AlaVal: 1.828 ± 0.801
1.219AlaTrp: 1.219 ± 0.601
2.438AlaTyr: 2.438 ± 1.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.609CysAla: 0.609 ± 0.567
1.828CysCys: 1.828 ± 1.379
0.609CysAsp: 0.609 ± 0.567
0.609CysGlu: 0.609 ± 0.553
0.609CysPhe: 0.609 ± 0.693
0.609CysGly: 0.609 ± 0.471
0.609CysHis: 0.609 ± 0.63
2.438CysIle: 2.438 ± 1.197
2.438CysLys: 2.438 ± 1.07
0.0CysLeu: 0.0 ± 0.0
0.609CysMet: 0.609 ± 0.691
3.047CysAsn: 3.047 ± 1.324
2.438CysPro: 2.438 ± 1.371
0.0CysGln: 0.0 ± 0.0
1.828CysArg: 1.828 ± 0.768
2.438CysSer: 2.438 ± 1.445
1.219CysThr: 1.219 ± 0.806
1.828CysVal: 1.828 ± 0.684
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.438AspAla: 2.438 ± 1.033
0.0AspCys: 0.0 ± 0.0
3.656AspAsp: 3.656 ± 1.52
1.828AspGlu: 1.828 ± 0.749
1.219AspPhe: 1.219 ± 0.666
3.047AspGly: 3.047 ± 1.271
1.219AspHis: 1.219 ± 0.683
3.656AspIle: 3.656 ± 1.655
2.438AspLys: 2.438 ± 0.786
3.656AspLeu: 3.656 ± 1.582
0.609AspMet: 0.609 ± 0.567
3.047AspAsn: 3.047 ± 0.889
3.656AspPro: 3.656 ± 1.395
3.047AspGln: 3.047 ± 0.855
3.656AspArg: 3.656 ± 1.163
4.266AspSer: 4.266 ± 1.442
1.828AspThr: 1.828 ± 1.379
5.484AspVal: 5.484 ± 1.437
1.828AspTrp: 1.828 ± 0.999
3.047AspTyr: 3.047 ± 0.885
0.0AspXaa: 0.0 ± 0.0
Glu
3.047GluAla: 3.047 ± 1.084
0.0GluCys: 0.0 ± 0.0
1.828GluAsp: 1.828 ± 0.901
4.266GluGlu: 4.266 ± 2.409
3.047GluPhe: 3.047 ± 0.885
1.828GluGly: 1.828 ± 0.927
0.609GluHis: 0.609 ± 0.693
1.828GluIle: 1.828 ± 1.283
3.047GluLys: 3.047 ± 1.842
3.656GluLeu: 3.656 ± 1.549
0.0GluMet: 0.0 ± 0.0
2.438GluAsn: 2.438 ± 1.597
2.438GluPro: 2.438 ± 1.276
2.438GluGln: 2.438 ± 1.024
1.219GluArg: 1.219 ± 1.11
3.656GluSer: 3.656 ± 0.65
1.828GluThr: 1.828 ± 0.977
3.656GluVal: 3.656 ± 1.346
0.609GluTrp: 0.609 ± 0.63
1.828GluTyr: 1.828 ± 1.073
0.0GluXaa: 0.0 ± 0.0
Phe
1.219PheAla: 1.219 ± 1.134
2.438PheCys: 2.438 ± 0.64
1.828PheAsp: 1.828 ± 0.927
1.219PheGlu: 1.219 ± 0.806
1.219PhePhe: 1.219 ± 0.666
1.828PheGly: 1.828 ± 1.163
0.609PheHis: 0.609 ± 0.471
1.219PheIle: 1.219 ± 0.941
4.266PheLys: 4.266 ± 2.107
4.875PheLeu: 4.875 ± 1.708
0.609PheMet: 0.609 ± 0.471
2.438PheAsn: 2.438 ± 0.964
1.828PhePro: 1.828 ± 0.768
2.438PheGln: 2.438 ± 1.401
3.656PheArg: 3.656 ± 0.945
1.828PheSer: 1.828 ± 0.739
4.875PheThr: 4.875 ± 2.039
0.609PheVal: 0.609 ± 0.553
0.609PheTrp: 0.609 ± 0.555
1.828PheTyr: 1.828 ± 0.91
0.0PheXaa: 0.0 ± 0.0
Gly
2.438GlyAla: 2.438 ± 0.98
1.828GlyCys: 1.828 ± 1.169
2.438GlyAsp: 2.438 ± 1.09
1.219GlyGlu: 1.219 ± 0.78
1.219GlyPhe: 1.219 ± 1.024
2.438GlyGly: 2.438 ± 0.921
1.828GlyHis: 1.828 ± 1.205
1.219GlyIle: 1.219 ± 0.69
6.703GlyLys: 6.703 ± 1.945
4.875GlyLeu: 4.875 ± 2.167
1.219GlyMet: 1.219 ± 0.791
0.609GlyAsn: 0.609 ± 0.693
5.484GlyPro: 5.484 ± 1.808
1.828GlyGln: 1.828 ± 0.927
1.828GlyArg: 1.828 ± 0.739
6.703GlySer: 6.703 ± 2.319
2.438GlyThr: 2.438 ± 1.223
2.438GlyVal: 2.438 ± 1.011
0.0GlyTrp: 0.0 ± 0.0
2.438GlyTyr: 2.438 ± 1.332
0.0GlyXaa: 0.0 ± 0.0
His
1.828HisAla: 1.828 ± 1.055
1.828HisCys: 1.828 ± 0.981
2.438HisAsp: 2.438 ± 1.203
0.0HisGlu: 0.0 ± 0.0
3.047HisPhe: 3.047 ± 1.391
2.438HisGly: 2.438 ± 1.332
0.609HisHis: 0.609 ± 0.63
1.828HisIle: 1.828 ± 0.706
1.219HisLys: 1.219 ± 1.023
1.219HisLeu: 1.219 ± 0.69
1.219HisMet: 1.219 ± 0.704
1.828HisAsn: 1.828 ± 0.999
1.828HisPro: 1.828 ± 0.954
2.438HisGln: 2.438 ± 1.092
3.047HisArg: 3.047 ± 1.444
0.609HisSer: 0.609 ± 0.63
1.219HisThr: 1.219 ± 1.107
2.438HisVal: 2.438 ± 1.739
0.0HisTrp: 0.0 ± 0.0
0.609HisTyr: 0.609 ± 0.555
0.0HisXaa: 0.0 ± 0.0
Ile
0.609IleAla: 0.609 ± 0.555
0.609IleCys: 0.609 ± 0.691
3.047IleAsp: 3.047 ± 1.796
2.438IleGlu: 2.438 ± 1.455
3.656IlePhe: 3.656 ± 2.285
3.047IleGly: 3.047 ± 1.229
2.438IleHis: 2.438 ± 0.901
1.828IleIle: 1.828 ± 0.866
4.875IleLys: 4.875 ± 1.347
4.266IleLeu: 4.266 ± 1.72
0.0IleMet: 0.0 ± 0.0
3.656IleAsn: 3.656 ± 1.09
2.438IlePro: 2.438 ± 1.041
4.875IleGln: 4.875 ± 1.541
4.875IleArg: 4.875 ± 1.1
5.484IleSer: 5.484 ± 2.021
3.047IleThr: 3.047 ± 1.544
2.438IleVal: 2.438 ± 1.039
2.438IleTrp: 2.438 ± 1.423
1.219IleTyr: 1.219 ± 0.806
0.0IleXaa: 0.0 ± 0.0
Lys
2.438LysAla: 2.438 ± 1.39
1.828LysCys: 1.828 ± 0.954
3.656LysAsp: 3.656 ± 1.122
2.438LysGlu: 2.438 ± 1.342
2.438LysPhe: 2.438 ± 0.711
4.875LysGly: 4.875 ± 0.766
1.219LysHis: 1.219 ± 0.941
4.266LysIle: 4.266 ± 1.582
0.609LysLys: 0.609 ± 0.553
4.875LysLeu: 4.875 ± 2.23
0.0LysMet: 0.0 ± 0.0
5.484LysAsn: 5.484 ± 1.723
2.438LysPro: 2.438 ± 1.267
2.438LysGln: 2.438 ± 1.193
1.828LysArg: 1.828 ± 1.23
4.875LysSer: 4.875 ± 0.77
3.656LysThr: 3.656 ± 1.138
3.656LysVal: 3.656 ± 1.767
0.609LysTrp: 0.609 ± 0.553
3.656LysTyr: 3.656 ± 1.138
0.0LysXaa: 0.0 ± 0.0
Leu
1.828LeuAla: 1.828 ± 0.733
1.828LeuCys: 1.828 ± 1.138
5.484LeuAsp: 5.484 ± 1.87
2.438LeuGlu: 2.438 ± 1.197
3.656LeuPhe: 3.656 ± 1.709
4.266LeuGly: 4.266 ± 1.52
3.656LeuHis: 3.656 ± 1.59
3.656LeuIle: 3.656 ± 1.263
5.484LeuLys: 5.484 ± 1.462
3.047LeuLeu: 3.047 ± 1.34
1.828LeuMet: 1.828 ± 1.143
3.656LeuAsn: 3.656 ± 1.612
1.219LeuPro: 1.219 ± 0.69
1.219LeuGln: 1.219 ± 1.024
4.266LeuArg: 4.266 ± 1.34
6.703LeuSer: 6.703 ± 1.648
4.266LeuThr: 4.266 ± 1.378
4.875LeuVal: 4.875 ± 1.717
0.0LeuTrp: 0.0 ± 0.0
3.047LeuTyr: 3.047 ± 1.227
0.0LeuXaa: 0.0 ± 0.0
Met
0.609MetAla: 0.609 ± 0.553
1.219MetCys: 1.219 ± 0.729
2.438MetAsp: 2.438 ± 1.61
1.219MetGlu: 1.219 ± 0.658
1.219MetPhe: 1.219 ± 1.107
1.219MetGly: 1.219 ± 0.851
0.609MetHis: 0.609 ± 0.567
0.609MetIle: 0.609 ± 0.772
0.609MetLys: 0.609 ± 0.555
2.438MetLeu: 2.438 ± 0.786
0.609MetMet: 0.609 ± 0.772
0.609MetAsn: 0.609 ± 0.553
1.219MetPro: 1.219 ± 0.652
1.219MetGln: 1.219 ± 0.856
1.828MetArg: 1.828 ± 0.69
1.828MetSer: 1.828 ± 1.637
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.219MetTrp: 1.219 ± 0.685
2.438MetTyr: 2.438 ± 1.2
0.0MetXaa: 0.0 ± 0.0
Asn
4.875AsnAla: 4.875 ± 1.701
1.828AsnCys: 1.828 ± 0.739
2.438AsnAsp: 2.438 ± 1.455
1.828AsnGlu: 1.828 ± 0.91
1.828AsnPhe: 1.828 ± 0.749
1.219AsnGly: 1.219 ± 0.87
1.828AsnHis: 1.828 ± 1.23
2.438AsnIle: 2.438 ± 0.64
0.609AsnLys: 0.609 ± 0.555
5.484AsnLeu: 5.484 ± 1.968
1.828AsnMet: 1.828 ± 1.004
4.266AsnAsn: 4.266 ± 1.447
4.875AsnPro: 4.875 ± 1.115
1.828AsnGln: 1.828 ± 0.706
3.047AsnArg: 3.047 ± 1.366
4.875AsnSer: 4.875 ± 1.256
4.266AsnThr: 4.266 ± 1.427
4.266AsnVal: 4.266 ± 1.169
0.609AsnTrp: 0.609 ± 0.471
3.047AsnTyr: 3.047 ± 0.863
0.0AsnXaa: 0.0 ± 0.0
Pro
1.828ProAla: 1.828 ± 0.821
2.438ProCys: 2.438 ± 1.052
3.047ProAsp: 3.047 ± 1.597
1.219ProGlu: 1.219 ± 0.652
1.828ProPhe: 1.828 ± 0.801
1.219ProGly: 1.219 ± 0.728
1.828ProHis: 1.828 ± 0.952
4.266ProIle: 4.266 ± 0.792
3.656ProLys: 3.656 ± 1.823
3.047ProLeu: 3.047 ± 1.18
2.438ProMet: 2.438 ± 1.657
3.656ProAsn: 3.656 ± 1.169
1.219ProPro: 1.219 ± 0.851
2.438ProGln: 2.438 ± 1.065
4.875ProArg: 4.875 ± 1.253
7.313ProSer: 7.313 ± 2.186
1.828ProThr: 1.828 ± 1.138
6.094ProVal: 6.094 ± 1.414
0.609ProTrp: 0.609 ± 0.555
1.828ProTyr: 1.828 ± 0.687
0.0ProXaa: 0.0 ± 0.0
Gln
3.047GlnAla: 3.047 ± 1.35
0.609GlnCys: 0.609 ± 0.555
3.047GlnAsp: 3.047 ± 1.489
3.047GlnGlu: 3.047 ± 0.877
3.047GlnPhe: 3.047 ± 1.437
2.438GlnGly: 2.438 ± 1.455
1.219GlnHis: 1.219 ± 1.26
1.219GlnIle: 1.219 ± 0.69
0.609GlnLys: 0.609 ± 0.691
3.047GlnLeu: 3.047 ± 1.525
1.219GlnMet: 1.219 ± 0.652
1.219GlnAsn: 1.219 ± 1.024
2.438GlnPro: 2.438 ± 2.048
4.266GlnGln: 4.266 ± 1.599
2.438GlnArg: 2.438 ± 1.27
4.266GlnSer: 4.266 ± 0.901
2.438GlnThr: 2.438 ± 1.403
5.484GlnVal: 5.484 ± 1.019
0.0GlnTrp: 0.0 ± 0.0
2.438GlnTyr: 2.438 ± 0.711
0.0GlnXaa: 0.0 ± 0.0
Arg
2.438ArgAla: 2.438 ± 1.458
1.219ArgCys: 1.219 ± 1.024
4.266ArgAsp: 4.266 ± 1.116
3.047ArgGlu: 3.047 ± 1.351
1.828ArgPhe: 1.828 ± 1.183
2.438ArgGly: 2.438 ± 0.743
2.438ArgHis: 2.438 ± 1.393
5.484ArgIle: 5.484 ± 1.692
2.438ArgLys: 2.438 ± 1.133
1.828ArgLeu: 1.828 ± 1.094
1.828ArgMet: 1.828 ± 1.163
3.656ArgAsn: 3.656 ± 1.073
4.875ArgPro: 4.875 ± 0.88
2.438ArgGln: 2.438 ± 1.3
7.922ArgArg: 7.922 ± 3.652
9.75ArgSer: 9.75 ± 1.625
1.828ArgThr: 1.828 ± 0.93
7.313ArgVal: 7.313 ± 1.273
0.0ArgTrp: 0.0 ± 0.0
3.047ArgTyr: 3.047 ± 1.045
0.0ArgXaa: 0.0 ± 0.0
Ser
1.219SerAla: 1.219 ± 0.941
1.219SerCys: 1.219 ± 0.797
4.875SerAsp: 4.875 ± 0.766
5.484SerGlu: 5.484 ± 1.422
5.484SerPhe: 5.484 ± 0.985
3.656SerGly: 3.656 ± 1.612
2.438SerHis: 2.438 ± 0.964
6.094SerIle: 6.094 ± 1.362
5.484SerLys: 5.484 ± 2.001
3.656SerLeu: 3.656 ± 1.375
3.656SerMet: 3.656 ± 1.585
4.875SerAsn: 4.875 ± 0.946
6.094SerPro: 6.094 ± 1.087
2.438SerGln: 2.438 ± 1.093
7.922SerArg: 7.922 ± 1.495
11.578SerSer: 11.578 ± 3.819
7.922SerThr: 7.922 ± 2.165
3.656SerVal: 3.656 ± 1.822
0.609SerTrp: 0.609 ± 0.555
4.875SerTyr: 4.875 ± 2.174
0.0SerXaa: 0.0 ± 0.0
Thr
4.875ThrAla: 4.875 ± 1.34
1.219ThrCys: 1.219 ± 1.134
0.609ThrAsp: 0.609 ± 0.555
3.656ThrGlu: 3.656 ± 1.288
1.219ThrPhe: 1.219 ± 0.875
6.094ThrGly: 6.094 ± 1.159
4.875ThrHis: 4.875 ± 2.207
2.438ThrIle: 2.438 ± 0.849
1.828ThrLys: 1.828 ± 1.055
3.047ThrLeu: 3.047 ± 1.098
1.828ThrMet: 1.828 ± 0.93
4.266ThrAsn: 4.266 ± 1.037
6.094ThrPro: 6.094 ± 2.262
2.438ThrGln: 2.438 ± 1.162
2.438ThrArg: 2.438 ± 0.84
3.047ThrSer: 3.047 ± 2.298
3.047ThrThr: 3.047 ± 1.841
2.438ThrVal: 2.438 ± 1.34
0.0ThrTrp: 0.0 ± 0.0
1.219ThrTyr: 1.219 ± 0.652
0.0ThrXaa: 0.0 ± 0.0
Val
1.219ValAla: 1.219 ± 0.728
0.609ValCys: 0.609 ± 0.567
4.266ValAsp: 4.266 ± 1.145
3.047ValGlu: 3.047 ± 1.437
3.047ValPhe: 3.047 ± 1.188
3.656ValGly: 3.656 ± 1.575
1.828ValHis: 1.828 ± 0.91
4.875ValIle: 4.875 ± 1.734
5.484ValLys: 5.484 ± 1.444
3.656ValLeu: 3.656 ± 1.482
1.219ValMet: 1.219 ± 1.1
4.266ValAsn: 4.266 ± 1.251
3.656ValPro: 3.656 ± 0.947
4.266ValGln: 4.266 ± 1.357
3.656ValArg: 3.656 ± 2.327
4.875ValSer: 4.875 ± 1.503
3.656ValThr: 3.656 ± 2.067
2.438ValVal: 2.438 ± 1.039
1.219ValTrp: 1.219 ± 0.652
4.875ValTyr: 4.875 ± 1.99
0.0ValXaa: 0.0 ± 0.0
Trp
2.438TrpAla: 2.438 ± 1.517
0.0TrpCys: 0.0 ± 0.0
1.219TrpAsp: 1.219 ± 1.023
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.609TrpGly: 0.609 ± 0.471
0.609TrpHis: 0.609 ± 0.553
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.609TrpLeu: 0.609 ± 0.567
0.609TrpMet: 0.609 ± 0.553
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.609TrpGln: 0.609 ± 0.471
1.219TrpArg: 1.219 ± 0.683
1.219TrpSer: 1.219 ± 0.683
1.219TrpThr: 1.219 ± 0.78
0.609TrpVal: 0.609 ± 0.567
0.0TrpTrp: 0.0 ± 0.0
0.609TrpTyr: 0.609 ± 0.471
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.828TyrAla: 1.828 ± 1.353
0.609TyrCys: 0.609 ± 0.567
0.609TyrAsp: 0.609 ± 0.553
2.438TyrGlu: 2.438 ± 1.472
1.219TyrPhe: 1.219 ± 0.78
1.828TyrGly: 1.828 ± 0.75
1.219TyrHis: 1.219 ± 0.652
4.875TyrIle: 4.875 ± 1.789
2.438TyrLys: 2.438 ± 1.662
3.656TyrLeu: 3.656 ± 1.525
1.219TyrMet: 1.219 ± 0.772
3.047TyrAsn: 3.047 ± 1.227
1.828TyrPro: 1.828 ± 1.128
0.609TyrGln: 0.609 ± 0.553
4.266TyrArg: 4.266 ± 1.362
4.266TyrSer: 4.266 ± 1.169
3.656TyrThr: 3.656 ± 1.067
4.875TyrVal: 4.875 ± 1.061
0.0TyrTrp: 0.0 ± 0.0
1.828TyrTyr: 1.828 ± 0.69
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1642 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski