Amino acid dipepetide frequency for Eclipta yellow vein virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.086AlaAla: 7.086 ± 2.595
0.886AlaCys: 0.886 ± 0.645
1.771AlaAsp: 1.771 ± 1.13
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
2.657AlaGly: 2.657 ± 1.413
1.771AlaHis: 1.771 ± 1.844
2.657AlaIle: 2.657 ± 1.227
4.429AlaLys: 4.429 ± 1.176
6.2AlaLeu: 6.2 ± 2.069
0.0AlaMet: 0.0 ± 0.0
3.543AlaAsn: 3.543 ± 1.368
3.543AlaPro: 3.543 ± 2.021
3.543AlaGln: 3.543 ± 1.35
3.543AlaArg: 3.543 ± 1.9
4.429AlaSer: 4.429 ± 2.115
2.657AlaThr: 2.657 ± 2.254
3.543AlaVal: 3.543 ± 1.204
1.771AlaTrp: 1.771 ± 0.946
1.771AlaTyr: 1.771 ± 0.946
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.771CysCys: 1.771 ± 1.844
0.886CysAsp: 0.886 ± 0.751
1.771CysGlu: 1.771 ± 0.801
1.771CysPhe: 1.771 ± 1.539
1.771CysGly: 1.771 ± 1.156
0.0CysHis: 0.0 ± 0.0
2.657CysIle: 2.657 ± 1.358
1.771CysLys: 1.771 ± 0.801
0.0CysLeu: 0.0 ± 0.0
0.886CysMet: 0.886 ± 0.922
1.771CysAsn: 1.771 ± 1.156
1.771CysPro: 1.771 ± 1.844
0.0CysGln: 0.0 ± 0.0
2.657CysArg: 2.657 ± 1.362
3.543CysSer: 3.543 ± 2.293
0.886CysThr: 0.886 ± 0.959
0.886CysVal: 0.886 ± 0.751
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.657AspAla: 2.657 ± 1.245
0.0AspCys: 0.0 ± 0.0
2.657AspAsp: 2.657 ± 1.347
1.771AspGlu: 1.771 ± 0.801
1.771AspPhe: 1.771 ± 1.291
1.771AspGly: 1.771 ± 1.291
1.771AspHis: 1.771 ± 1.539
2.657AspIle: 2.657 ± 0.943
0.886AspLys: 0.886 ± 0.751
7.972AspLeu: 7.972 ± 3.053
0.0AspMet: 0.0 ± 0.0
3.543AspAsn: 3.543 ± 2.17
3.543AspPro: 3.543 ± 2.27
1.771AspGln: 1.771 ± 1.068
1.771AspArg: 1.771 ± 0.801
4.429AspSer: 4.429 ± 1.523
1.771AspThr: 1.771 ± 1.427
5.314AspVal: 5.314 ± 2.169
1.771AspTrp: 1.771 ± 1.156
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.429GluAla: 4.429 ± 1.949
0.886GluCys: 0.886 ± 1.178
0.0GluAsp: 0.0 ± 0.0
7.086GluGlu: 7.086 ± 4.353
2.657GluPhe: 2.657 ± 1.362
4.429GluGly: 4.429 ± 1.849
0.0GluHis: 0.0 ± 0.0
1.771GluIle: 1.771 ± 1.205
1.771GluLys: 1.771 ± 1.291
3.543GluLeu: 3.543 ± 1.414
0.0GluMet: 0.0 ± 0.0
2.657GluAsn: 2.657 ± 1.413
3.543GluPro: 3.543 ± 1.077
3.543GluGln: 3.543 ± 1.644
0.0GluArg: 0.0 ± 0.0
5.314GluSer: 5.314 ± 1.605
2.657GluThr: 2.657 ± 1.13
2.657GluVal: 2.657 ± 1.47
2.657GluTrp: 2.657 ± 1.456
0.886GluTyr: 0.886 ± 0.922
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.886PheCys: 0.886 ± 0.751
3.543PheAsp: 3.543 ± 1.601
0.886PheGlu: 0.886 ± 0.645
2.657PhePhe: 2.657 ± 1.413
0.886PheGly: 0.886 ± 0.751
3.543PheHis: 3.543 ± 1.815
1.771PheIle: 1.771 ± 1.068
2.657PheLys: 2.657 ± 1.899
6.2PheLeu: 6.2 ± 1.578
0.886PheMet: 0.886 ± 0.645
3.543PheAsn: 3.543 ± 2.235
0.886PhePro: 0.886 ± 0.922
3.543PheGln: 3.543 ± 1.933
5.314PheArg: 5.314 ± 1.884
1.771PheSer: 1.771 ± 1.435
2.657PheThr: 2.657 ± 1.296
1.771PheVal: 1.771 ± 1.291
0.0PheTrp: 0.0 ± 0.0
1.771PheTyr: 1.771 ± 1.162
0.0PheXaa: 0.0 ± 0.0
Gly
4.429GlyAla: 4.429 ± 1.216
2.657GlyCys: 2.657 ± 2.174
3.543GlyAsp: 3.543 ± 2.137
3.543GlyGlu: 3.543 ± 1.204
1.771GlyPhe: 1.771 ± 1.427
1.771GlyGly: 1.771 ± 1.291
1.771GlyHis: 1.771 ± 1.156
1.771GlyIle: 1.771 ± 1.156
6.2GlyLys: 6.2 ± 2.95
2.657GlyLeu: 2.657 ± 1.418
0.0GlyMet: 0.0 ± 0.0
0.886GlyAsn: 0.886 ± 0.959
3.543GlyPro: 3.543 ± 1.815
2.657GlyGln: 2.657 ± 1.482
0.886GlyArg: 0.886 ± 0.645
4.429GlySer: 4.429 ± 1.96
3.543GlyThr: 3.543 ± 1.236
1.771GlyVal: 1.771 ± 2.114
0.886GlyTrp: 0.886 ± 0.751
0.886GlyTyr: 0.886 ± 0.922
0.0GlyXaa: 0.0 ± 0.0
His
0.886HisAla: 0.886 ± 0.751
2.657HisCys: 2.657 ± 2.45
2.657HisAsp: 2.657 ± 1.708
1.771HisGlu: 1.771 ± 1.427
3.543HisPhe: 3.543 ± 1.9
2.657HisGly: 2.657 ± 1.358
1.771HisHis: 1.771 ± 1.122
0.886HisIle: 0.886 ± 0.959
0.886HisLys: 0.886 ± 1.057
2.657HisLeu: 2.657 ± 1.482
0.886HisMet: 0.886 ± 0.751
3.543HisAsn: 3.543 ± 1.452
1.771HisPro: 1.771 ± 0.946
1.771HisGln: 1.771 ± 1.162
2.657HisArg: 2.657 ± 2.174
0.886HisSer: 0.886 ± 0.751
0.0HisThr: 0.0 ± 0.0
3.543HisVal: 3.543 ± 1.342
0.0HisTrp: 0.0 ± 0.0
0.886HisTyr: 0.886 ± 0.751
0.0HisXaa: 0.0 ± 0.0
Ile
0.886IleAla: 0.886 ± 0.959
2.657IleCys: 2.657 ± 1.13
1.771IleAsp: 1.771 ± 1.291
2.657IleGlu: 2.657 ± 1.936
2.657IlePhe: 2.657 ± 1.936
0.0IleGly: 0.0 ± 0.0
2.657IleHis: 2.657 ± 1.358
3.543IleIle: 3.543 ± 2.298
4.429IleLys: 4.429 ± 0.885
4.429IleLeu: 4.429 ± 1.257
1.771IleMet: 1.771 ± 1.078
2.657IleAsn: 2.657 ± 1.296
0.886IlePro: 0.886 ± 0.645
5.314IleGln: 5.314 ± 3.284
6.2IleArg: 6.2 ± 1.809
3.543IleSer: 3.543 ± 1.633
3.543IleThr: 3.543 ± 1.836
3.543IleVal: 3.543 ± 2.116
0.0IleTrp: 0.0 ± 0.0
0.886IleTyr: 0.886 ± 0.751
0.0IleXaa: 0.0 ± 0.0
Lys
0.886LysAla: 0.886 ± 0.959
1.771LysCys: 1.771 ± 1.205
3.543LysAsp: 3.543 ± 1.933
3.543LysGlu: 3.543 ± 1.815
2.657LysPhe: 2.657 ± 0.943
1.771LysGly: 1.771 ± 1.156
0.886LysHis: 0.886 ± 0.645
4.429LysIle: 4.429 ± 2.198
1.771LysLys: 1.771 ± 0.946
0.886LysLeu: 0.886 ± 1.057
0.886LysMet: 0.886 ± 1.057
6.2LysAsn: 6.2 ± 3.045
3.543LysPro: 3.543 ± 1.544
2.657LysGln: 2.657 ± 1.347
3.543LysArg: 3.543 ± 2.244
5.314LysSer: 5.314 ± 1.602
1.771LysThr: 1.771 ± 0.946
4.429LysVal: 4.429 ± 1.821
0.0LysTrp: 0.0 ± 0.0
4.429LysTyr: 4.429 ± 1.179
0.0LysXaa: 0.0 ± 0.0
Leu
1.771LeuAla: 1.771 ± 0.967
1.771LeuCys: 1.771 ± 1.291
4.429LeuAsp: 4.429 ± 1.608
4.429LeuGlu: 4.429 ± 1.45
1.771LeuPhe: 1.771 ± 1.539
6.2LeuGly: 6.2 ± 1.893
1.771LeuHis: 1.771 ± 0.946
4.429LeuIle: 4.429 ± 1.731
6.2LeuLys: 6.2 ± 1.734
2.657LeuLeu: 2.657 ± 1.939
0.886LeuMet: 0.886 ± 0.751
6.2LeuAsn: 6.2 ± 0.708
1.771LeuPro: 1.771 ± 1.122
4.429LeuGln: 4.429 ± 1.842
7.086LeuArg: 7.086 ± 3.327
4.429LeuSer: 4.429 ± 1.385
5.314LeuThr: 5.314 ± 1.831
1.771LeuVal: 1.771 ± 0.801
0.886LeuTrp: 0.886 ± 1.057
5.314LeuTyr: 5.314 ± 1.895
0.0LeuXaa: 0.0 ± 0.0
Met
1.771MetAla: 1.771 ± 0.801
0.886MetCys: 0.886 ± 0.751
2.657MetAsp: 2.657 ± 2.138
0.886MetGlu: 0.886 ± 1.178
0.886MetPhe: 0.886 ± 1.178
2.657MetGly: 2.657 ± 1.076
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.771MetLeu: 1.771 ± 1.162
0.0MetMet: 0.0 ± 0.0
0.886MetAsn: 0.886 ± 0.751
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.886MetArg: 0.886 ± 1.178
0.886MetSer: 0.886 ± 0.751
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
2.657MetTrp: 2.657 ± 1.047
1.771MetTyr: 1.771 ± 1.503
0.0MetXaa: 0.0 ± 0.0
Asn
6.2AsnAla: 6.2 ± 2.64
1.771AsnCys: 1.771 ± 1.156
1.771AsnAsp: 1.771 ± 1.291
1.771AsnGlu: 1.771 ± 1.162
3.543AsnPhe: 3.543 ± 2.259
2.657AsnGly: 2.657 ± 1.347
2.657AsnHis: 2.657 ± 1.503
1.771AsnIle: 1.771 ± 0.801
0.886AsnLys: 0.886 ± 0.959
7.086AsnLeu: 7.086 ± 2.777
0.886AsnMet: 0.886 ± 1.042
3.543AsnAsn: 3.543 ± 2.312
3.543AsnPro: 3.543 ± 1.184
1.771AsnGln: 1.771 ± 1.539
3.543AsnArg: 3.543 ± 1.668
1.771AsnSer: 1.771 ± 1.918
1.771AsnThr: 1.771 ± 1.291
5.314AsnVal: 5.314 ± 2.169
2.657AsnTrp: 2.657 ± 1.076
3.543AsnTyr: 3.543 ± 1.342
0.0AsnXaa: 0.0 ± 0.0
Pro
3.543ProAla: 3.543 ± 1.108
1.771ProCys: 1.771 ± 1.162
1.771ProAsp: 1.771 ± 1.162
1.771ProGlu: 1.771 ± 0.967
1.771ProPhe: 1.771 ± 0.946
1.771ProGly: 1.771 ± 1.156
4.429ProHis: 4.429 ± 1.757
5.314ProIle: 5.314 ± 2.418
4.429ProLys: 4.429 ± 1.757
5.314ProLeu: 5.314 ± 2.194
2.657ProMet: 2.657 ± 1.664
2.657ProAsn: 2.657 ± 1.227
4.429ProPro: 4.429 ± 1.948
4.429ProGln: 4.429 ± 2.707
1.771ProArg: 1.771 ± 1.122
2.657ProSer: 2.657 ± 3.534
8.857ProThr: 8.857 ± 2.997
2.657ProVal: 2.657 ± 1.413
0.0ProTrp: 0.0 ± 0.0
1.771ProTyr: 1.771 ± 0.801
0.0ProXaa: 0.0 ± 0.0
Gln
3.543GlnAla: 3.543 ± 1.934
0.0GlnCys: 0.0 ± 0.0
4.429GlnAsp: 4.429 ± 3.682
2.657GlnGlu: 2.657 ± 1.482
4.429GlnPhe: 4.429 ± 2.286
1.771GlnGly: 1.771 ± 1.291
2.657GlnHis: 2.657 ± 2.138
2.657GlnIle: 2.657 ± 1.936
0.886GlnLys: 0.886 ± 0.922
3.543GlnLeu: 3.543 ± 1.322
0.886GlnMet: 0.886 ± 1.178
1.771GlnAsn: 1.771 ± 1.156
5.314GlnPro: 5.314 ± 3.754
3.543GlnGln: 3.543 ± 1.414
1.771GlnArg: 1.771 ± 1.068
4.429GlnSer: 4.429 ± 1.608
2.657GlnThr: 2.657 ± 1.227
3.543GlnVal: 3.543 ± 2.289
0.0GlnTrp: 0.0 ± 0.0
1.771GlnTyr: 1.771 ± 1.503
0.0GlnXaa: 0.0 ± 0.0
Arg
3.543ArgAla: 3.543 ± 2.41
1.771ArgCys: 1.771 ± 0.967
2.657ArgAsp: 2.657 ± 1.047
3.543ArgGlu: 3.543 ± 2.032
1.771ArgPhe: 1.771 ± 0.801
2.657ArgGly: 2.657 ± 0.931
0.886ArgHis: 0.886 ± 0.922
3.543ArgIle: 3.543 ± 1.077
2.657ArgLys: 2.657 ± 1.503
3.543ArgLeu: 3.543 ± 1.585
0.886ArgMet: 0.886 ± 0.751
2.657ArgAsn: 2.657 ± 1.525
4.429ArgPro: 4.429 ± 1.394
1.771ArgGln: 1.771 ± 1.427
6.2ArgArg: 6.2 ± 2.534
7.086ArgSer: 7.086 ± 1.805
4.429ArgThr: 4.429 ± 1.938
5.314ArgVal: 5.314 ± 1.948
0.0ArgTrp: 0.0 ± 0.0
1.771ArgTyr: 1.771 ± 1.162
0.0ArgXaa: 0.0 ± 0.0
Ser
1.771SerAla: 1.771 ± 1.156
0.886SerCys: 0.886 ± 0.922
2.657SerAsp: 2.657 ± 2.243
7.086SerGlu: 7.086 ± 2.444
2.657SerPhe: 2.657 ± 0.931
4.429SerGly: 4.429 ± 2.338
1.771SerHis: 1.771 ± 1.44
5.314SerIle: 5.314 ± 1.966
7.086SerLys: 7.086 ± 1.481
2.657SerLeu: 2.657 ± 1.482
1.771SerMet: 1.771 ± 1.17
4.429SerAsn: 4.429 ± 2.008
9.743SerPro: 9.743 ± 2.335
1.771SerGln: 1.771 ± 1.068
4.429SerArg: 4.429 ± 1.27
11.515SerSer: 11.515 ± 6.846
5.314SerThr: 5.314 ± 1.841
1.771SerVal: 1.771 ± 1.503
0.886SerTrp: 0.886 ± 0.751
2.657SerTyr: 2.657 ± 1.456
0.0SerXaa: 0.0 ± 0.0
Thr
4.429ThrAla: 4.429 ± 1.505
1.771ThrCys: 1.771 ± 1.068
0.886ThrAsp: 0.886 ± 0.959
0.886ThrGlu: 0.886 ± 0.959
2.657ThrPhe: 2.657 ± 2.189
4.429ThrGly: 4.429 ± 2.065
4.429ThrHis: 4.429 ± 1.958
2.657ThrIle: 2.657 ± 1.227
1.771ThrLys: 1.771 ± 1.291
2.657ThrLeu: 2.657 ± 1.47
0.886ThrMet: 0.886 ± 0.645
1.771ThrAsn: 1.771 ± 1.503
5.314ThrPro: 5.314 ± 1.18
1.771ThrGln: 1.771 ± 0.946
2.657ThrArg: 2.657 ± 1.076
6.2ThrSer: 6.2 ± 1.663
2.657ThrThr: 2.657 ± 1.47
5.314ThrVal: 5.314 ± 2.955
1.771ThrTrp: 1.771 ± 1.426
1.771ThrTyr: 1.771 ± 1.291
0.0ThrXaa: 0.0 ± 0.0
Val
0.886ValAla: 0.886 ± 0.645
0.0ValCys: 0.0 ± 0.0
3.543ValAsp: 3.543 ± 0.99
2.657ValGlu: 2.657 ± 1.958
3.543ValPhe: 3.543 ± 2.347
2.657ValGly: 2.657 ± 1.958
2.657ValHis: 2.657 ± 1.53
4.429ValIle: 4.429 ± 1.505
4.429ValLys: 4.429 ± 1.62
4.429ValLeu: 4.429 ± 1.733
0.886ValMet: 0.886 ± 0.751
3.543ValAsn: 3.543 ± 1.419
4.429ValPro: 4.429 ± 1.188
4.429ValGln: 4.429 ± 1.754
2.657ValArg: 2.657 ± 2.254
3.543ValSer: 3.543 ± 1.549
3.543ValThr: 3.543 ± 2.116
1.771ValVal: 1.771 ± 0.801
0.886ValTrp: 0.886 ± 1.057
4.429ValTyr: 4.429 ± 2.065
0.0ValXaa: 0.0 ± 0.0
Trp
3.543TrpAla: 3.543 ± 1.815
0.0TrpCys: 0.0 ± 0.0
0.886TrpAsp: 0.886 ± 0.922
0.886TrpGlu: 0.886 ± 1.057
0.0TrpPhe: 0.0 ± 0.0
0.886TrpGly: 0.886 ± 0.645
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.886TrpLys: 0.886 ± 1.057
0.886TrpLeu: 0.886 ± 0.959
0.886TrpMet: 0.886 ± 0.751
0.886TrpAsn: 0.886 ± 0.751
0.0TrpPro: 0.0 ± 0.0
1.771TrpGln: 1.771 ± 0.801
0.886TrpArg: 0.886 ± 1.178
1.771TrpSer: 1.771 ± 1.539
0.886TrpThr: 0.886 ± 1.057
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.771TrpTyr: 1.771 ± 1.068
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.543TyrAla: 3.543 ± 1.544
0.0TyrCys: 0.0 ± 0.0
1.771TyrAsp: 1.771 ± 1.162
0.886TyrGlu: 0.886 ± 0.751
2.657TyrPhe: 2.657 ± 0.943
1.771TyrGly: 1.771 ± 0.801
0.886TyrHis: 0.886 ± 0.751
1.771TyrIle: 1.771 ± 1.291
0.886TyrLys: 0.886 ± 0.645
4.429TyrLeu: 4.429 ± 2.338
1.771TyrMet: 1.771 ± 1.108
2.657TyrAsn: 2.657 ± 1.272
1.771TyrPro: 1.771 ± 1.068
1.771TyrGln: 1.771 ± 0.801
2.657TyrArg: 2.657 ± 1.726
2.657TyrSer: 2.657 ± 1.362
1.771TyrThr: 1.771 ± 1.205
4.429TyrVal: 4.429 ± 1.584
0.0TyrTrp: 0.0 ± 0.0
1.771TyrTyr: 1.771 ± 1.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1130 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski