Amino acid dipepetide frequency for Banana streak UI virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.344AlaAla: 2.344 ± 1.177
0.0AlaCys: 0.0 ± 0.0
3.282AlaAsp: 3.282 ± 1.993
7.032AlaGlu: 7.032 ± 0.717
4.688AlaPhe: 4.688 ± 2.354
0.938AlaGly: 0.938 ± 0.471
1.406AlaHis: 1.406 ± 3.15
6.564AlaIle: 6.564 ± 3.296
5.626AlaLys: 5.626 ± 3.651
6.564AlaLeu: 6.564 ± 3.54
1.406AlaMet: 1.406 ± 0.88
1.406AlaAsn: 1.406 ± 0.706
2.813AlaPro: 2.813 ± 1.413
3.282AlaGln: 3.282 ± 2.532
3.751AlaArg: 3.751 ± 1.884
2.344AlaSer: 2.344 ± 1.177
3.282AlaThr: 3.282 ± 1.648
1.875AlaVal: 1.875 ± 0.942
0.938AlaTrp: 0.938 ± 1.49
0.938AlaTyr: 0.938 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.469CysAla: 0.469 ± 0.235
0.469CysCys: 0.469 ± 0.235
0.469CysAsp: 0.469 ± 0.235
0.938CysGlu: 0.938 ± 0.471
0.938CysPhe: 0.938 ± 0.471
1.406CysGly: 1.406 ± 0.706
1.406CysHis: 1.406 ± 0.706
0.469CysIle: 0.469 ± 0.235
1.406CysLys: 1.406 ± 0.706
0.0CysLeu: 0.0 ± 0.0
0.469CysMet: 0.469 ± 0.235
0.938CysAsn: 0.938 ± 0.471
0.0CysPro: 0.0 ± 0.0
0.469CysGln: 0.469 ± 0.235
1.406CysArg: 1.406 ± 0.706
0.0CysSer: 0.0 ± 0.0
1.875CysThr: 1.875 ± 0.942
0.469CysVal: 0.469 ± 0.235
0.0CysTrp: 0.0 ± 0.0
2.344CysTyr: 2.344 ± 1.112
0.0CysXaa: 0.0 ± 0.0
Asp
1.875AspAla: 1.875 ± 0.942
1.406AspCys: 1.406 ± 0.706
3.282AspAsp: 3.282 ± 1.648
6.564AspGlu: 6.564 ± 3.296
3.751AspPhe: 3.751 ± 1.884
1.406AspGly: 1.406 ± 0.706
0.938AspHis: 0.938 ± 0.471
3.282AspIle: 3.282 ± 1.806
1.875AspLys: 1.875 ± 0.942
5.626AspLeu: 5.626 ± 3.977
0.938AspMet: 0.938 ± 0.471
3.282AspAsn: 3.282 ± 1.648
2.344AspPro: 2.344 ± 1.177
3.751AspGln: 3.751 ± 4.387
0.469AspArg: 0.469 ± 0.235
3.751AspSer: 3.751 ± 1.777
1.875AspThr: 1.875 ± 0.942
1.406AspVal: 1.406 ± 3.15
1.875AspTrp: 1.875 ± 0.942
3.282AspTyr: 3.282 ± 1.993
0.0AspXaa: 0.0 ± 0.0
Glu
8.439GluAla: 8.439 ± 2.7
0.938GluCys: 0.938 ± 0.471
7.501GluAsp: 7.501 ± 3.554
14.534GluGlu: 14.534 ± 4.19
1.406GluPhe: 1.406 ± 0.706
3.751GluGly: 3.751 ± 1.884
2.344GluHis: 2.344 ± 1.112
7.032GluIle: 7.032 ± 2.272
8.908GluLys: 8.908 ± 3.173
7.97GluLeu: 7.97 ± 1.081
1.406GluMet: 1.406 ± 0.706
3.282GluAsn: 3.282 ± 1.806
1.406GluPro: 1.406 ± 2.188
5.626GluGln: 5.626 ± 3.616
3.751GluArg: 3.751 ± 1.118
2.813GluSer: 2.813 ± 1.063
3.751GluThr: 3.751 ± 1.884
6.564GluVal: 6.564 ± 2.13
1.875GluTrp: 1.875 ± 1.205
3.282GluTyr: 3.282 ± 1.806
0.0GluXaa: 0.0 ± 0.0
Phe
0.469PheAla: 0.469 ± 0.235
0.469PheCys: 0.469 ± 0.235
1.406PheAsp: 1.406 ± 0.706
3.282PheGlu: 3.282 ± 1.648
0.938PhePhe: 0.938 ± 0.471
1.875PheGly: 1.875 ± 2.059
1.406PheHis: 1.406 ± 0.706
3.751PheIle: 3.751 ± 1.118
0.938PheLys: 0.938 ± 0.471
2.813PheLeu: 2.813 ± 1.063
2.813PheMet: 2.813 ± 1.242
1.406PheAsn: 1.406 ± 0.706
1.875PhePro: 1.875 ± 0.942
2.813PheGln: 2.813 ± 1.413
0.469PheArg: 0.469 ± 0.235
0.938PheSer: 0.938 ± 1.49
1.406PheThr: 1.406 ± 0.706
0.469PheVal: 0.469 ± 2.493
0.938PheTrp: 0.938 ± 0.471
1.406PheTyr: 1.406 ± 0.706
0.0PheXaa: 0.0 ± 0.0
Gly
2.813GlyAla: 2.813 ± 1.413
0.469GlyCys: 0.469 ± 0.235
2.344GlyAsp: 2.344 ± 1.95
3.751GlyGlu: 3.751 ± 1.118
1.875GlyPhe: 1.875 ± 2.059
2.813GlyGly: 2.813 ± 1.413
0.938GlyHis: 0.938 ± 0.471
3.282GlyIle: 3.282 ± 1.065
4.688GlyLys: 4.688 ± 1.812
3.282GlyLeu: 3.282 ± 1.065
0.469GlyMet: 0.469 ± 1.204
1.875GlyAsn: 1.875 ± 0.942
1.406GlyPro: 1.406 ± 0.706
2.344GlyGln: 2.344 ± 1.112
3.282GlyArg: 3.282 ± 1.648
2.344GlySer: 2.344 ± 1.177
5.626GlyThr: 5.626 ± 1.962
2.344GlyVal: 2.344 ± 1.95
1.406GlyTrp: 1.406 ± 0.706
3.751GlyTyr: 3.751 ± 1.118
0.0GlyXaa: 0.0 ± 0.0
His
1.875HisAla: 1.875 ± 0.942
0.469HisCys: 0.469 ± 0.235
0.469HisAsp: 0.469 ± 0.235
0.469HisGlu: 0.469 ± 0.235
0.469HisPhe: 0.469 ± 0.235
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
3.282HisIle: 3.282 ± 1.648
1.406HisLys: 1.406 ± 0.706
2.813HisLeu: 2.813 ± 1.413
0.938HisMet: 0.938 ± 1.49
0.938HisAsn: 0.938 ± 3.328
0.0HisPro: 0.0 ± 0.0
1.875HisGln: 1.875 ± 1.205
0.938HisArg: 0.938 ± 1.49
0.0HisSer: 0.0 ± 0.0
0.938HisThr: 0.938 ± 3.12
0.469HisVal: 0.469 ± 0.235
0.469HisTrp: 0.469 ± 0.235
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.875IleAla: 1.875 ± 0.942
2.813IleCys: 2.813 ± 1.413
4.688IleAsp: 4.688 ± 2.354
8.439IleGlu: 8.439 ± 3.189
1.875IlePhe: 1.875 ± 0.942
4.219IleGly: 4.219 ± 1.215
1.406IleHis: 1.406 ± 0.706
4.219IleIle: 4.219 ± 1.563
9.845IleLys: 9.845 ± 4.944
4.688IleLeu: 4.688 ± 1.812
0.938IleMet: 0.938 ± 0.471
2.813IleAsn: 2.813 ± 1.413
1.875IlePro: 1.875 ± 0.942
5.626IleGln: 5.626 ± 3.651
2.813IleArg: 2.813 ± 1.063
5.157IleSer: 5.157 ± 3.735
5.157IleThr: 5.157 ± 2.462
3.751IleVal: 3.751 ± 1.884
0.0IleTrp: 0.0 ± 0.0
1.406IleTyr: 1.406 ± 0.706
0.0IleXaa: 0.0 ± 0.0
Lys
5.626LysAla: 5.626 ± 8.922
1.406LysCys: 1.406 ± 0.706
3.751LysAsp: 3.751 ± 1.775
8.439LysGlu: 8.439 ± 2.916
6.095LysPhe: 6.095 ± 1.997
5.626LysGly: 5.626 ± 1.962
1.406LysHis: 1.406 ± 0.706
8.439LysIle: 8.439 ± 0.86
9.845LysLys: 9.845 ± 8.523
9.376LysLeu: 9.376 ± 2.612
3.282LysMet: 3.282 ± 1.648
4.219LysAsn: 4.219 ± 1.215
3.751LysPro: 3.751 ± 1.118
3.282LysGln: 3.282 ± 1.993
4.219LysArg: 4.219 ± 2.119
7.501LysSer: 7.501 ± 3.767
4.688LysThr: 4.688 ± 3.901
5.626LysVal: 5.626 ± 6.315
1.406LysTrp: 1.406 ± 0.706
1.406LysTyr: 1.406 ± 0.706
0.0LysXaa: 0.0 ± 0.0
Leu
6.095LeuAla: 6.095 ± 1.997
1.406LeuCys: 1.406 ± 0.706
4.688LeuAsp: 4.688 ± 2.223
10.314LeuGlu: 10.314 ± 2.329
1.406LeuPhe: 1.406 ± 2.892
5.626LeuGly: 5.626 ± 0.99
1.406LeuHis: 1.406 ± 0.706
3.282LeuIle: 3.282 ± 1.993
12.189LeuLys: 12.189 ± 8.413
7.032LeuLeu: 7.032 ± 1.535
1.875LeuMet: 1.875 ± 1.205
5.626LeuAsn: 5.626 ± 2.229
3.751LeuPro: 3.751 ± 1.884
2.344LeuGln: 2.344 ± 1.112
2.344LeuArg: 2.344 ± 1.112
4.688LeuSer: 4.688 ± 3.901
3.282LeuThr: 3.282 ± 2.532
3.282LeuVal: 3.282 ± 2.532
0.469LeuTrp: 0.469 ± 0.235
1.875LeuTyr: 1.875 ± 0.942
0.0LeuXaa: 0.0 ± 0.0
Met
3.282MetAla: 3.282 ± 1.648
0.0MetCys: 0.0 ± 0.0
1.406MetAsp: 1.406 ± 0.706
2.344MetGlu: 2.344 ± 1.177
0.0MetPhe: 0.0 ± 0.0
0.469MetGly: 0.469 ± 0.235
0.0MetHis: 0.0 ± 0.0
2.344MetIle: 2.344 ± 1.177
4.688MetLys: 4.688 ± 2.354
1.406MetLeu: 1.406 ± 1.334
0.938MetMet: 0.938 ± 0.471
0.938MetAsn: 0.938 ± 0.471
0.469MetPro: 0.469 ± 0.235
0.938MetGln: 0.938 ± 0.471
0.938MetArg: 0.938 ± 1.49
0.938MetSer: 0.938 ± 2.334
2.813MetThr: 2.813 ± 1.063
2.344MetVal: 2.344 ± 1.177
0.469MetTrp: 0.469 ± 0.235
0.938MetTyr: 0.938 ± 0.471
0.0MetXaa: 0.0 ± 0.0
Asn
3.751AsnAla: 3.751 ± 1.884
0.469AsnCys: 0.469 ± 0.235
0.938AsnAsp: 0.938 ± 0.471
1.875AsnGlu: 1.875 ± 0.942
0.938AsnPhe: 0.938 ± 0.471
2.813AsnGly: 2.813 ± 1.413
0.938AsnHis: 0.938 ± 1.49
3.282AsnIle: 3.282 ± 1.065
4.219AsnLys: 4.219 ± 1.215
2.813AsnLeu: 2.813 ± 1.063
0.469AsnMet: 0.469 ± 0.235
2.344AsnAsn: 2.344 ± 1.177
2.813AsnPro: 2.813 ± 1.865
2.344AsnGln: 2.344 ± 1.177
1.406AsnArg: 1.406 ± 0.706
4.688AsnSer: 4.688 ± 2.223
4.219AsnThr: 4.219 ± 4.004
0.469AsnVal: 0.469 ± 0.235
0.469AsnTrp: 0.469 ± 0.235
2.813AsnTyr: 2.813 ± 1.413
0.0AsnXaa: 0.0 ± 0.0
Pro
5.157ProAla: 5.157 ± 2.59
0.0ProCys: 0.0 ± 0.0
1.406ProAsp: 1.406 ± 0.706
3.282ProGlu: 3.282 ± 1.648
0.938ProPhe: 0.938 ± 0.471
1.875ProGly: 1.875 ± 0.942
0.938ProHis: 0.938 ± 0.471
1.406ProIle: 1.406 ± 0.706
4.219ProLys: 4.219 ± 1.215
1.875ProLeu: 1.875 ± 1.205
0.938ProMet: 0.938 ± 0.471
0.469ProAsn: 0.469 ± 0.235
0.938ProPro: 0.938 ± 2.334
1.875ProGln: 1.875 ± 0.942
1.875ProArg: 1.875 ± 0.942
3.282ProSer: 3.282 ± 7.227
1.875ProThr: 1.875 ± 0.942
0.938ProVal: 0.938 ± 0.471
0.469ProTrp: 0.469 ± 0.235
0.938ProTyr: 0.938 ± 0.471
0.0ProXaa: 0.0 ± 0.0
Gln
2.344GlnAla: 2.344 ± 1.177
0.469GlnCys: 0.469 ± 0.235
3.282GlnAsp: 3.282 ± 1.648
5.157GlnGlu: 5.157 ± 1.504
0.469GlnPhe: 0.469 ± 0.235
3.751GlnGly: 3.751 ± 1.118
0.469GlnHis: 0.469 ± 1.664
2.344GlnIle: 2.344 ± 1.112
3.751GlnLys: 3.751 ± 1.118
4.688GlnLeu: 4.688 ± 7.382
2.344GlnMet: 2.344 ± 1.177
2.344GlnAsn: 2.344 ± 1.177
3.282GlnPro: 3.282 ± 1.065
0.469GlnGln: 0.469 ± 0.235
3.282GlnArg: 3.282 ± 2.532
1.875GlnSer: 1.875 ± 2.664
1.875GlnThr: 1.875 ± 2.664
3.282GlnVal: 3.282 ± 1.993
0.469GlnTrp: 0.469 ± 0.235
1.875GlnTyr: 1.875 ± 0.942
0.0GlnXaa: 0.0 ± 0.0
Arg
2.813ArgAla: 2.813 ± 1.063
0.938ArgCys: 0.938 ± 0.471
1.406ArgAsp: 1.406 ± 1.334
0.938ArgGlu: 0.938 ± 1.49
1.406ArgPhe: 1.406 ± 0.706
2.344ArgGly: 2.344 ± 1.112
0.469ArgHis: 0.469 ± 0.235
6.564ArgIle: 6.564 ± 2.067
4.219ArgLys: 4.219 ± 2.119
4.688ArgLeu: 4.688 ± 2.354
1.406ArgMet: 1.406 ± 0.706
0.938ArgAsn: 0.938 ± 0.471
1.875ArgPro: 1.875 ± 1.205
0.0ArgGln: 0.0 ± 0.0
5.157ArgArg: 5.157 ± 2.162
4.688ArgSer: 4.688 ± 5.637
2.344ArgThr: 2.344 ± 1.177
1.406ArgVal: 1.406 ± 0.706
0.938ArgTrp: 0.938 ± 0.471
2.344ArgTyr: 2.344 ± 1.177
0.0ArgXaa: 0.0 ± 0.0
Ser
2.813SerAla: 2.813 ± 1.413
0.469SerCys: 0.469 ± 0.235
3.282SerAsp: 3.282 ± 2.532
7.97SerGlu: 7.97 ± 6.599
0.938SerPhe: 0.938 ± 0.471
3.282SerGly: 3.282 ± 1.806
0.469SerHis: 0.469 ± 1.664
2.813SerIle: 2.813 ± 1.413
5.626SerLys: 5.626 ± 4.428
4.219SerLeu: 4.219 ± 4.003
1.875SerMet: 1.875 ± 0.942
2.344SerAsn: 2.344 ± 2.438
2.344SerPro: 2.344 ± 1.95
3.282SerGln: 3.282 ± 1.993
2.813SerArg: 2.813 ± 1.063
5.626SerSer: 5.626 ± 3.651
5.626SerThr: 5.626 ± 1.962
3.282SerVal: 3.282 ± 1.993
0.469SerTrp: 0.469 ± 0.235
1.406SerTyr: 1.406 ± 0.706
0.0SerXaa: 0.0 ± 0.0
Thr
5.157ThrAla: 5.157 ± 1.874
0.938ThrCys: 0.938 ± 1.49
1.875ThrAsp: 1.875 ± 0.942
7.032ThrGlu: 7.032 ± 5.813
2.344ThrPhe: 2.344 ± 1.177
3.751ThrGly: 3.751 ± 1.775
0.938ThrHis: 0.938 ± 0.471
2.344ThrIle: 2.344 ± 1.95
6.564ThrLys: 6.564 ± 5.952
4.219ThrLeu: 4.219 ± 4.004
1.875ThrMet: 1.875 ± 0.942
2.344ThrAsn: 2.344 ± 1.177
1.406ThrPro: 1.406 ± 0.706
1.875ThrGln: 1.875 ± 0.942
2.344ThrArg: 2.344 ± 1.112
6.095ThrSer: 6.095 ± 3.509
4.219ThrThr: 4.219 ± 4.004
2.813ThrVal: 2.813 ± 1.413
0.469ThrTrp: 0.469 ± 0.235
1.406ThrTyr: 1.406 ± 0.706
0.0ThrXaa: 0.0 ± 0.0
Val
0.938ValAla: 0.938 ± 0.471
2.344ValCys: 2.344 ± 1.177
2.813ValAsp: 2.813 ± 1.865
1.875ValGlu: 1.875 ± 1.205
0.938ValPhe: 0.938 ± 0.471
3.282ValGly: 3.282 ± 1.065
0.938ValHis: 0.938 ± 2.334
3.282ValIle: 3.282 ± 2.532
3.282ValLys: 3.282 ± 4.583
5.157ValLeu: 5.157 ± 2.462
0.938ValMet: 0.938 ± 0.471
2.813ValAsn: 2.813 ± 1.413
0.469ValPro: 0.469 ± 0.235
3.751ValGln: 3.751 ± 1.118
3.282ValArg: 3.282 ± 1.648
1.875ValSer: 1.875 ± 2.664
3.282ValThr: 3.282 ± 1.806
2.344ValVal: 2.344 ± 1.177
0.0ValTrp: 0.0 ± 0.0
1.406ValTyr: 1.406 ± 0.706
0.0ValXaa: 0.0 ± 0.0
Trp
0.469TrpAla: 0.469 ± 0.235
0.0TrpCys: 0.0 ± 0.0
1.406TrpAsp: 1.406 ± 0.706
1.406TrpGlu: 1.406 ± 1.334
0.0TrpPhe: 0.0 ± 0.0
0.469TrpGly: 0.469 ± 0.235
0.0TrpHis: 0.0 ± 0.0
0.469TrpIle: 0.469 ± 0.235
1.406TrpLys: 1.406 ± 1.334
1.406TrpLeu: 1.406 ± 0.706
0.0TrpMet: 0.0 ± 0.0
1.875TrpAsn: 1.875 ± 0.942
0.469TrpPro: 0.469 ± 0.235
0.0TrpGln: 0.0 ± 0.0
1.406TrpArg: 1.406 ± 0.706
0.469TrpSer: 0.469 ± 0.235
1.406TrpThr: 1.406 ± 0.706
0.469TrpVal: 0.469 ± 0.235
0.938TrpTrp: 0.938 ± 0.471
0.938TrpTyr: 0.938 ± 0.471
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 1.177
0.469TyrCys: 0.469 ± 0.235
3.282TyrAsp: 3.282 ± 1.648
0.938TyrGlu: 0.938 ± 0.471
0.469TyrPhe: 0.469 ± 0.235
1.406TyrGly: 1.406 ± 0.706
0.0TyrHis: 0.0 ± 0.0
5.157TyrIle: 5.157 ± 2.59
4.688TyrLys: 4.688 ± 1.347
2.344TyrLeu: 2.344 ± 2.438
1.875TyrMet: 1.875 ± 0.942
1.875TyrAsn: 1.875 ± 0.942
1.406TyrPro: 1.406 ± 0.706
1.875TyrGln: 1.875 ± 0.942
0.938TyrArg: 0.938 ± 1.49
1.875TyrSer: 1.875 ± 0.942
0.469TyrThr: 0.469 ± 2.493
1.406TyrVal: 1.406 ± 0.706
0.938TyrTrp: 0.938 ± 0.471
1.406TyrTyr: 1.406 ± 0.706
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2134 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski