Amino acid dipepetide frequency for Helminthosporium victoriae virus-190S (Hv190SV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.189AlaAla: 16.189 ± 2.182
3.113AlaCys: 3.113 ± 0.349
6.227AlaAsp: 6.227 ± 2.779
4.359AlaGlu: 4.359 ± 0.315
2.491AlaPhe: 2.491 ± 1.671
8.095AlaGly: 8.095 ± 2.395
2.491AlaHis: 2.491 ± 0.938
6.227AlaIle: 6.227 ± 2.438
1.868AlaLys: 1.868 ± 0.384
14.944AlaLeu: 14.944 ± 0.46
0.623AlaMet: 0.623 ± 0.452
1.868AlaAsn: 1.868 ± 0.384
7.472AlaPro: 7.472 ± 4.552
3.113AlaGln: 3.113 ± 0.349
10.585AlaArg: 10.585 ± 0.145
8.717AlaSer: 8.717 ± 1.108
8.095AlaThr: 8.095 ± 1.082
8.717AlaVal: 8.717 ± 1.108
3.113AlaTrp: 3.113 ± 0.349
4.981AlaTyr: 4.981 ± 0.733
0.0AlaXaa: 0.0 ± 0.0
Cys
2.491CysAla: 2.491 ± 0.801
0.0CysCys: 0.0 ± 0.0
0.623CysAsp: 0.623 ± 0.418
0.623CysGlu: 0.623 ± 0.452
0.623CysPhe: 0.623 ± 0.452
0.623CysGly: 0.623 ± 0.452
0.0CysHis: 0.0 ± 0.0
0.623CysIle: 0.623 ± 0.418
0.623CysLys: 0.623 ± 0.418
1.245CysLeu: 1.245 ± 0.835
0.0CysMet: 0.0 ± 0.0
1.245CysAsn: 1.245 ± 0.034
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.245CysArg: 1.245 ± 0.835
1.245CysSer: 1.245 ± 0.034
0.623CysThr: 0.623 ± 0.418
1.868CysVal: 1.868 ± 0.384
0.623CysTrp: 0.623 ± 0.452
0.623CysTyr: 0.623 ± 0.418
0.0CysXaa: 0.0 ± 0.0
Asp
1.868AspAla: 1.868 ± 0.486
0.623AspCys: 0.623 ± 0.418
3.113AspAsp: 3.113 ± 0.349
3.736AspGlu: 3.736 ± 0.767
3.113AspPhe: 3.113 ± 0.349
4.359AspGly: 4.359 ± 3.162
0.623AspHis: 0.623 ± 0.418
1.245AspIle: 1.245 ± 0.034
1.245AspLys: 1.245 ± 0.835
3.736AspLeu: 3.736 ± 0.972
0.623AspMet: 0.623 ± 0.418
1.245AspAsn: 1.245 ± 0.034
3.736AspPro: 3.736 ± 0.102
1.245AspGln: 1.245 ± 0.034
4.359AspArg: 4.359 ± 0.315
2.491AspSer: 2.491 ± 0.938
4.981AspThr: 4.981 ± 1.006
3.113AspVal: 3.113 ± 0.52
0.0AspTrp: 0.0 ± 0.0
1.868AspTyr: 1.868 ± 0.384
0.0AspXaa: 0.0 ± 0.0
Glu
4.981GluAla: 4.981 ± 0.136
0.0GluCys: 0.0 ± 0.0
2.491GluAsp: 2.491 ± 0.938
3.736GluGlu: 3.736 ± 0.102
0.623GluPhe: 0.623 ± 0.418
5.604GluGly: 5.604 ± 1.458
0.623GluHis: 0.623 ± 0.452
1.245GluIle: 1.245 ± 0.034
1.245GluLys: 1.245 ± 0.835
6.227GluLeu: 6.227 ± 1.568
1.245GluMet: 1.245 ± 0.904
1.868GluAsn: 1.868 ± 0.384
1.245GluPro: 1.245 ± 0.034
1.868GluGln: 1.868 ± 0.384
4.359GluArg: 4.359 ± 0.554
1.868GluSer: 1.868 ± 1.253
1.868GluThr: 1.868 ± 0.384
2.491GluVal: 2.491 ± 0.801
1.868GluTrp: 1.868 ± 0.486
1.868GluTyr: 1.868 ± 0.486
0.0GluXaa: 0.0 ± 0.0
Phe
3.736PheAla: 3.736 ± 0.102
0.0PheCys: 0.0 ± 0.0
1.245PheAsp: 1.245 ± 0.034
1.868PheGlu: 1.868 ± 0.486
0.623PhePhe: 0.623 ± 0.452
3.113PheGly: 3.113 ± 0.52
0.623PheHis: 0.623 ± 0.452
1.868PheIle: 1.868 ± 0.486
1.245PheLys: 1.245 ± 0.835
2.491PheLeu: 2.491 ± 0.938
0.623PheMet: 0.623 ± 0.418
1.245PheAsn: 1.245 ± 0.034
3.113PhePro: 3.113 ± 0.52
0.623PheGln: 0.623 ± 0.452
1.868PheArg: 1.868 ± 0.486
2.491PheSer: 2.491 ± 0.938
2.491PheThr: 2.491 ± 0.801
1.245PheVal: 1.245 ± 0.835
1.245PheTrp: 1.245 ± 0.904
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
10.585GlyAla: 10.585 ± 0.145
1.245GlyCys: 1.245 ± 0.835
4.359GlyAsp: 4.359 ± 1.423
5.604GlyGlu: 5.604 ± 1.458
2.491GlyPhe: 2.491 ± 0.068
9.34GlyGly: 9.34 ± 3.299
4.359GlyHis: 4.359 ± 0.315
6.227GlyIle: 6.227 ± 1.04
1.868GlyLys: 1.868 ± 0.384
8.095GlyLeu: 8.095 ± 0.656
1.868GlyMet: 1.868 ± 0.384
5.604GlyAsn: 5.604 ± 2.327
6.227GlyPro: 6.227 ± 1.909
2.491GlyGln: 2.491 ± 0.068
5.604GlyArg: 5.604 ± 0.588
5.604GlySer: 5.604 ± 1.151
3.113GlyThr: 3.113 ± 0.52
5.604GlyVal: 5.604 ± 0.588
0.0GlyTrp: 0.0 ± 0.0
2.491GlyTyr: 2.491 ± 1.671
0.0GlyXaa: 0.0 ± 0.0
His
4.981HisAla: 4.981 ± 1.006
0.0HisCys: 0.0 ± 0.0
0.623HisAsp: 0.623 ± 0.452
2.491HisGlu: 2.491 ± 0.068
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.623HisHis: 0.623 ± 0.452
1.245HisIle: 1.245 ± 0.034
0.0HisLys: 0.0 ± 0.0
3.113HisLeu: 3.113 ± 0.349
0.623HisMet: 0.623 ± 0.418
0.623HisAsn: 0.623 ± 0.452
1.868HisPro: 1.868 ± 0.486
0.0HisGln: 0.0 ± 0.0
1.245HisArg: 1.245 ± 0.034
1.868HisSer: 1.868 ± 0.384
2.491HisThr: 2.491 ± 0.068
1.868HisVal: 1.868 ± 0.384
0.623HisTrp: 0.623 ± 0.452
1.245HisTyr: 1.245 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
4.981IleAla: 4.981 ± 0.136
0.623IleCys: 0.623 ± 0.418
0.623IleAsp: 0.623 ± 0.418
2.491IleGlu: 2.491 ± 0.068
1.245IlePhe: 1.245 ± 0.034
3.736IleGly: 3.736 ± 0.972
2.491IleHis: 2.491 ± 0.938
0.623IleIle: 0.623 ± 0.418
2.491IleLys: 2.491 ± 1.671
2.491IleLeu: 2.491 ± 0.801
0.623IleMet: 0.623 ± 0.418
0.623IleAsn: 0.623 ± 0.418
3.113IlePro: 3.113 ± 0.52
0.0IleGln: 0.0 ± 0.0
3.736IleArg: 3.736 ± 1.636
1.868IleSer: 1.868 ± 0.384
1.868IleThr: 1.868 ± 1.355
4.359IleVal: 4.359 ± 1.423
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.227LysAla: 6.227 ± 3.307
0.0LysCys: 0.0 ± 0.0
1.245LysAsp: 1.245 ± 0.835
0.623LysGlu: 0.623 ± 0.418
0.0LysPhe: 0.0 ± 0.0
3.113LysGly: 3.113 ± 0.349
0.623LysHis: 0.623 ± 0.418
0.0LysIle: 0.0 ± 0.0
3.113LysLys: 3.113 ± 1.219
1.245LysLeu: 1.245 ± 0.835
0.0LysMet: 0.0 ± 0.0
0.623LysAsn: 0.623 ± 0.418
0.623LysPro: 0.623 ± 0.418
0.623LysGln: 0.623 ± 0.418
2.491LysArg: 2.491 ± 1.671
1.868LysSer: 1.868 ± 1.253
0.623LysThr: 0.623 ± 0.418
0.0LysVal: 0.0 ± 0.0
0.623LysTrp: 0.623 ± 0.418
0.623LysTyr: 0.623 ± 0.452
0.0LysXaa: 0.0 ± 0.0
Leu
11.208LeuAla: 11.208 ± 2.046
1.245LeuCys: 1.245 ± 0.034
4.359LeuAsp: 4.359 ± 1.185
3.113LeuGlu: 3.113 ± 1.219
3.736LeuPhe: 3.736 ± 0.102
11.208LeuGly: 11.208 ± 0.562
1.245LeuHis: 1.245 ± 0.034
1.245LeuIle: 1.245 ± 0.034
1.868LeuLys: 1.868 ± 1.253
7.472LeuLeu: 7.472 ± 1.534
1.868LeuMet: 1.868 ± 1.253
4.981LeuAsn: 4.981 ± 1.602
7.472LeuPro: 7.472 ± 0.205
2.491LeuGln: 2.491 ± 0.801
7.472LeuArg: 7.472 ± 0.665
5.604LeuSer: 5.604 ± 1.151
6.849LeuThr: 6.849 ± 0.622
6.849LeuVal: 6.849 ± 1.117
1.245LeuTrp: 1.245 ± 0.034
1.868LeuTyr: 1.868 ± 1.253
0.0LeuXaa: 0.0 ± 0.0
Met
3.113MetAla: 3.113 ± 0.349
0.0MetCys: 0.0 ± 0.0
1.245MetAsp: 1.245 ± 0.835
0.623MetGlu: 0.623 ± 0.452
0.0MetPhe: 0.0 ± 0.0
0.623MetGly: 0.623 ± 0.418
0.623MetHis: 0.623 ± 0.418
1.245MetIle: 1.245 ± 0.034
0.623MetLys: 0.623 ± 0.452
1.868MetLeu: 1.868 ± 1.253
0.0MetMet: 0.0 ± 0.0
0.623MetAsn: 0.623 ± 0.418
1.868MetPro: 1.868 ± 0.384
0.0MetGln: 0.0 ± 0.0
0.623MetArg: 0.623 ± 0.452
2.491MetSer: 2.491 ± 0.801
0.623MetThr: 0.623 ± 0.452
0.623MetVal: 0.623 ± 0.418
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.849AsnAla: 6.849 ± 0.247
0.623AsnCys: 0.623 ± 0.452
1.245AsnAsp: 1.245 ± 0.835
1.868AsnGlu: 1.868 ± 0.486
1.868AsnPhe: 1.868 ± 0.486
4.981AsnGly: 4.981 ± 0.733
0.0AsnHis: 0.0 ± 0.0
1.868AsnIle: 1.868 ± 0.384
1.245AsnLys: 1.245 ± 0.835
1.245AsnLeu: 1.245 ± 0.034
0.623AsnMet: 0.623 ± 0.452
3.736AsnAsn: 3.736 ± 0.102
3.736AsnPro: 3.736 ± 0.102
0.623AsnGln: 0.623 ± 0.452
2.491AsnArg: 2.491 ± 0.938
3.113AsnSer: 3.113 ± 0.349
4.359AsnThr: 4.359 ± 1.185
1.245AsnVal: 1.245 ± 0.904
0.0AsnTrp: 0.0 ± 0.0
0.623AsnTyr: 0.623 ± 0.418
0.0AsnXaa: 0.0 ± 0.0
Pro
9.963ProAla: 9.963 ± 2.881
1.868ProCys: 1.868 ± 0.384
1.245ProAsp: 1.245 ± 0.034
3.736ProGlu: 3.736 ± 0.102
1.868ProPhe: 1.868 ± 1.355
7.472ProGly: 7.472 ± 1.074
2.491ProHis: 2.491 ± 0.068
2.491ProIle: 2.491 ± 0.938
1.245ProLys: 1.245 ± 0.835
6.849ProLeu: 6.849 ± 0.622
0.623ProMet: 0.623 ± 0.418
2.491ProAsn: 2.491 ± 0.068
9.963ProPro: 9.963 ± 2.881
1.868ProGln: 1.868 ± 0.384
2.491ProArg: 2.491 ± 0.938
1.868ProSer: 1.868 ± 0.486
4.359ProThr: 4.359 ± 1.423
3.113ProVal: 3.113 ± 0.52
0.0ProTrp: 0.0 ± 0.0
2.491ProTyr: 2.491 ± 0.068
0.0ProXaa: 0.0 ± 0.0
Gln
2.491GlnAla: 2.491 ± 0.938
0.623GlnCys: 0.623 ± 0.452
0.0GlnAsp: 0.0 ± 0.0
0.623GlnGlu: 0.623 ± 0.418
1.868GlnPhe: 1.868 ± 0.384
2.491GlnGly: 2.491 ± 0.938
0.623GlnHis: 0.623 ± 0.418
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
2.491GlnLeu: 2.491 ± 0.938
0.623GlnMet: 0.623 ± 0.317
0.623GlnAsn: 0.623 ± 0.418
3.113GlnPro: 3.113 ± 1.219
1.245GlnGln: 1.245 ± 0.904
1.868GlnArg: 1.868 ± 1.253
1.245GlnSer: 1.245 ± 0.034
2.491GlnThr: 2.491 ± 1.671
1.245GlnVal: 1.245 ± 0.034
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
9.34ArgAla: 9.34 ± 0.179
1.245ArgCys: 1.245 ± 0.034
3.736ArgAsp: 3.736 ± 0.972
1.245ArgGlu: 1.245 ± 0.835
1.245ArgPhe: 1.245 ± 0.034
7.472ArgGly: 7.472 ± 0.205
3.113ArgHis: 3.113 ± 0.349
0.623ArgIle: 0.623 ± 0.418
0.0ArgLys: 0.0 ± 0.0
8.095ArgLeu: 8.095 ± 1.952
2.491ArgMet: 2.491 ± 0.801
1.245ArgAsn: 1.245 ± 0.034
4.359ArgPro: 4.359 ± 1.423
4.359ArgGln: 4.359 ± 1.423
6.227ArgArg: 6.227 ± 0.171
4.359ArgSer: 4.359 ± 0.315
5.604ArgThr: 5.604 ± 1.458
4.981ArgVal: 4.981 ± 0.136
1.245ArgTrp: 1.245 ± 0.835
3.736ArgTyr: 3.736 ± 0.767
0.0ArgXaa: 0.0 ± 0.0
Ser
6.849SerAla: 6.849 ± 2.361
1.245SerCys: 1.245 ± 0.835
3.736SerAsp: 3.736 ± 0.102
2.491SerGlu: 2.491 ± 0.068
0.623SerPhe: 0.623 ± 0.418
5.604SerGly: 5.604 ± 1.151
2.491SerHis: 2.491 ± 0.068
1.868SerIle: 1.868 ± 0.486
0.623SerLys: 0.623 ± 0.418
5.604SerLeu: 5.604 ± 1.151
0.623SerMet: 0.623 ± 0.665
4.359SerAsn: 4.359 ± 0.315
2.491SerPro: 2.491 ± 0.068
1.868SerGln: 1.868 ± 1.253
3.113SerArg: 3.113 ± 0.349
4.981SerSer: 4.981 ± 1.602
3.736SerThr: 3.736 ± 0.102
6.849SerVal: 6.849 ± 0.622
1.245SerTrp: 1.245 ± 0.034
3.736SerTyr: 3.736 ± 0.767
0.0SerXaa: 0.0 ± 0.0
Thr
6.227ThrAla: 6.227 ± 1.909
0.0ThrCys: 0.0 ± 0.0
4.359ThrAsp: 4.359 ± 1.423
1.245ThrGlu: 1.245 ± 0.034
3.736ThrPhe: 3.736 ± 1.841
4.981ThrGly: 4.981 ± 0.733
0.623ThrHis: 0.623 ± 0.452
4.359ThrIle: 4.359 ± 0.554
1.868ThrLys: 1.868 ± 0.384
6.849ThrLeu: 6.849 ± 3.725
2.491ThrMet: 2.491 ± 0.068
3.736ThrAsn: 3.736 ± 0.102
1.868ThrPro: 1.868 ± 0.384
0.623ThrGln: 0.623 ± 0.418
6.227ThrArg: 6.227 ± 1.04
8.717ThrSer: 8.717 ± 0.239
5.604ThrThr: 5.604 ± 1.458
1.868ThrVal: 1.868 ± 1.355
0.0ThrTrp: 0.0 ± 0.0
2.491ThrTyr: 2.491 ± 0.801
0.0ThrXaa: 0.0 ± 0.0
Val
7.472ValAla: 7.472 ± 0.205
0.623ValCys: 0.623 ± 0.418
2.491ValAsp: 2.491 ± 0.068
6.227ValGlu: 6.227 ± 0.171
2.491ValPhe: 2.491 ± 1.807
6.227ValGly: 6.227 ± 0.699
0.623ValHis: 0.623 ± 0.452
2.491ValIle: 2.491 ± 0.801
1.868ValLys: 1.868 ± 1.253
4.359ValLeu: 4.359 ± 1.423
0.0ValMet: 0.0 ± 0.0
4.359ValAsn: 4.359 ± 0.554
5.604ValPro: 5.604 ± 0.588
0.623ValGln: 0.623 ± 0.418
4.359ValArg: 4.359 ± 0.554
1.868ValSer: 1.868 ± 1.253
4.359ValThr: 4.359 ± 3.162
3.113ValVal: 3.113 ± 1.389
0.0ValTrp: 0.0 ± 0.0
1.245ValTyr: 1.245 ± 0.835
0.0ValXaa: 0.0 ± 0.0
Trp
1.245TrpAla: 1.245 ± 0.835
0.623TrpCys: 0.623 ± 0.418
0.623TrpAsp: 0.623 ± 0.452
0.0TrpGlu: 0.0 ± 0.0
1.245TrpPhe: 1.245 ± 0.904
0.623TrpGly: 0.623 ± 0.452
0.0TrpHis: 0.0 ± 0.0
0.623TrpIle: 0.623 ± 0.452
0.0TrpLys: 0.0 ± 0.0
1.868TrpLeu: 1.868 ± 0.384
0.0TrpMet: 0.0 ± 0.0
0.623TrpAsn: 0.623 ± 0.418
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.113TrpArg: 3.113 ± 0.52
0.623TrpSer: 0.623 ± 0.452
1.245TrpThr: 1.245 ± 0.835
0.0TrpVal: 0.0 ± 0.0
0.623TrpTrp: 0.623 ± 0.418
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.491TyrAla: 2.491 ± 1.671
1.245TyrCys: 1.245 ± 0.034
3.736TyrAsp: 3.736 ± 0.767
0.623TyrGlu: 0.623 ± 0.418
1.868TyrPhe: 1.868 ± 0.384
3.113TyrGly: 3.113 ± 1.219
1.245TyrHis: 1.245 ± 0.034
1.868TyrIle: 1.868 ± 0.384
1.245TyrLys: 1.245 ± 0.835
3.113TyrLeu: 3.113 ± 1.219
0.623TyrMet: 0.623 ± 0.418
0.623TyrAsn: 0.623 ± 0.452
0.623TyrPro: 0.623 ± 0.452
0.0TyrGln: 0.0 ± 0.0
1.245TyrArg: 1.245 ± 0.034
1.868TyrSer: 1.868 ± 0.486
2.491TyrThr: 2.491 ± 0.801
1.245TyrVal: 1.245 ± 0.034
0.623TyrTrp: 0.623 ± 0.418
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1607 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski