Amino acid dipepetide frequency for Aedes camptorhynchus negev-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.432AlaAla: 1.432 ± 0.483
1.146AlaCys: 1.146 ± 0.237
1.432AlaAsp: 1.432 ± 0.841
3.437AlaGlu: 3.437 ± 0.685
2.005AlaPhe: 2.005 ± 1.099
2.005AlaGly: 2.005 ± 0.693
0.573AlaHis: 0.573 ± 0.551
3.724AlaIle: 3.724 ± 0.841
3.151AlaLys: 3.151 ± 0.782
4.297AlaLeu: 4.297 ± 1.29
0.573AlaMet: 0.573 ± 0.336
2.292AlaAsn: 2.292 ± 0.371
1.146AlaPro: 1.146 ± 0.472
1.432AlaGln: 1.432 ± 0.278
1.146AlaArg: 1.146 ± 0.374
3.437AlaSer: 3.437 ± 1.068
2.005AlaThr: 2.005 ± 1.11
1.432AlaVal: 1.432 ± 1.085
0.0AlaTrp: 0.0 ± 0.0
2.005AlaTyr: 2.005 ± 0.733
0.0AlaXaa: 0.0 ± 0.0
Cys
0.859CysAla: 0.859 ± 0.26
0.573CysCys: 0.573 ± 0.227
1.146CysAsp: 1.146 ± 0.809
1.432CysGlu: 1.432 ± 0.729
1.146CysPhe: 1.146 ± 0.237
1.719CysGly: 1.719 ± 0.674
0.0CysHis: 0.0 ± 0.0
0.573CysIle: 0.573 ± 0.336
1.432CysLys: 1.432 ± 0.608
1.719CysLeu: 1.719 ± 0.678
0.573CysMet: 0.573 ± 0.551
1.719CysAsn: 1.719 ± 0.421
1.146CysPro: 1.146 ± 0.52
0.573CysGln: 0.573 ± 0.315
0.573CysArg: 0.573 ± 0.315
2.292CysSer: 2.292 ± 0.475
0.573CysThr: 0.573 ± 0.227
0.573CysVal: 0.573 ± 0.609
0.0CysTrp: 0.0 ± 0.0
0.573CysTyr: 0.573 ± 0.429
0.0CysXaa: 0.0 ± 0.0
Asp
2.292AspAla: 2.292 ± 0.673
0.859AspCys: 0.859 ± 0.26
1.432AspAsp: 1.432 ± 0.66
3.437AspGlu: 3.437 ± 0.84
5.156AspPhe: 5.156 ± 1.326
2.005AspGly: 2.005 ± 0.619
0.286AspHis: 0.286 ± 0.349
3.437AspIle: 3.437 ± 0.42
3.437AspLys: 3.437 ± 0.712
4.297AspLeu: 4.297 ± 0.849
0.286AspMet: 0.286 ± 0.168
4.583AspAsn: 4.583 ± 1.061
0.286AspPro: 0.286 ± 0.168
2.005AspGln: 2.005 ± 0.518
3.724AspArg: 3.724 ± 1.116
3.724AspSer: 3.724 ± 0.879
2.005AspThr: 2.005 ± 0.795
4.583AspVal: 4.583 ± 0.856
0.286AspTrp: 0.286 ± 0.168
1.719AspTyr: 1.719 ± 0.519
0.0AspXaa: 0.0 ± 0.0
Glu
2.578GluAla: 2.578 ± 0.489
0.573GluCys: 0.573 ± 0.315
2.578GluAsp: 2.578 ± 0.559
4.01GluGlu: 4.01 ± 1.324
4.01GluPhe: 4.01 ± 1.682
2.865GluGly: 2.865 ± 0.901
1.719GluHis: 1.719 ± 0.875
3.724GluIle: 3.724 ± 1.347
4.583GluLys: 4.583 ± 1.588
7.734GluLeu: 7.734 ± 1.693
0.859GluMet: 0.859 ± 0.364
3.151GluAsn: 3.151 ± 0.973
2.005GluPro: 2.005 ± 0.525
1.719GluGln: 1.719 ± 0.757
1.146GluArg: 1.146 ± 0.472
1.432GluSer: 1.432 ± 0.608
3.724GluThr: 3.724 ± 0.873
4.87GluVal: 4.87 ± 1.05
0.286GluTrp: 0.286 ± 0.168
2.292GluTyr: 2.292 ± 0.53
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 1.812
3.151PheCys: 3.151 ± 1.143
4.01PheAsp: 4.01 ± 1.385
4.583PheGlu: 4.583 ± 1.198
4.297PhePhe: 4.297 ± 1.902
4.583PheGly: 4.583 ± 0.819
2.578PheHis: 2.578 ± 1.365
4.01PheIle: 4.01 ± 1.411
3.437PheLys: 3.437 ± 0.842
7.161PheLeu: 7.161 ± 1.901
1.146PheMet: 1.146 ± 0.53
4.01PheAsn: 4.01 ± 0.958
2.292PhePro: 2.292 ± 0.693
0.859PheGln: 0.859 ± 0.26
2.005PheArg: 2.005 ± 1.22
6.875PheSer: 6.875 ± 0.895
5.443PheThr: 5.443 ± 1.408
4.01PheVal: 4.01 ± 0.787
0.0PheTrp: 0.0 ± 0.0
2.005PheTyr: 2.005 ± 0.85
0.0PheXaa: 0.0 ± 0.0
Gly
2.005GlyAla: 2.005 ± 0.693
0.573GlyCys: 0.573 ± 0.227
2.005GlyAsp: 2.005 ± 0.534
2.005GlyGlu: 2.005 ± 0.826
3.151GlyPhe: 3.151 ± 0.472
2.292GlyGly: 2.292 ± 0.546
0.286GlyHis: 0.286 ± 0.607
4.01GlyIle: 4.01 ± 0.668
4.01GlyLys: 4.01 ± 1.279
4.297GlyLeu: 4.297 ± 0.646
0.286GlyMet: 0.286 ± 0.364
3.724GlyAsn: 3.724 ± 1.211
1.432GlyPro: 1.432 ± 0.671
1.146GlyGln: 1.146 ± 0.736
3.437GlyArg: 3.437 ± 0.777
2.005GlySer: 2.005 ± 0.815
4.583GlyThr: 4.583 ± 1.636
4.01GlyVal: 4.01 ± 0.74
0.286GlyTrp: 0.286 ± 0.304
1.432GlyTyr: 1.432 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
1.432HisAla: 1.432 ± 0.669
0.0HisCys: 0.0 ± 0.0
1.432HisAsp: 1.432 ± 0.538
0.573HisGlu: 0.573 ± 0.551
0.859HisPhe: 0.859 ± 0.26
0.859HisGly: 0.859 ± 0.543
0.286HisHis: 0.286 ± 0.304
1.719HisIle: 1.719 ± 0.519
1.719HisLys: 1.719 ± 1.405
1.432HisLeu: 1.432 ± 0.278
0.286HisMet: 0.286 ± 0.168
1.719HisAsn: 1.719 ± 0.699
0.859HisPro: 0.859 ± 0.576
0.286HisGln: 0.286 ± 0.304
1.432HisArg: 1.432 ± 0.608
1.146HisSer: 1.146 ± 0.466
0.573HisThr: 0.573 ± 0.227
1.719HisVal: 1.719 ± 1.257
0.0HisTrp: 0.0 ± 0.0
0.573HisTyr: 0.573 ± 0.336
0.0HisXaa: 0.0 ± 0.0
Ile
2.865IleAla: 2.865 ± 1.27
1.719IleCys: 1.719 ± 1.02
4.87IleAsp: 4.87 ± 1.486
2.578IleGlu: 2.578 ± 0.897
6.588IlePhe: 6.588 ± 3.401
4.01IleGly: 4.01 ± 0.858
1.719IleHis: 1.719 ± 1.02
8.021IleIle: 8.021 ± 3.486
5.156IleLys: 5.156 ± 1.402
8.594IleLeu: 8.594 ± 2.606
1.432IleMet: 1.432 ± 0.563
4.583IleAsn: 4.583 ± 1.354
4.87IlePro: 4.87 ± 0.707
1.432IleGln: 1.432 ± 0.794
3.724IleArg: 3.724 ± 0.861
6.302IleSer: 6.302 ± 0.752
3.437IleThr: 3.437 ± 0.934
5.443IleVal: 5.443 ± 0.779
0.286IleTrp: 0.286 ± 0.168
3.437IleTyr: 3.437 ± 0.737
0.0IleXaa: 0.0 ± 0.0
Lys
4.01LysAla: 4.01 ± 0.965
1.146LysCys: 1.146 ± 0.673
3.724LysAsp: 3.724 ± 1.507
4.01LysGlu: 4.01 ± 0.937
5.156LysPhe: 5.156 ± 1.631
2.005LysGly: 2.005 ± 0.805
1.146LysHis: 1.146 ± 1.249
8.88LysIle: 8.88 ± 0.817
4.01LysLys: 4.01 ± 0.787
8.88LysLeu: 8.88 ± 1.686
1.719LysMet: 1.719 ± 0.573
5.443LysAsn: 5.443 ± 1.059
2.578LysPro: 2.578 ± 1.157
1.146LysGln: 1.146 ± 0.504
2.578LysArg: 2.578 ± 1.162
4.87LysSer: 4.87 ± 0.931
5.443LysThr: 5.443 ± 2.24
4.583LysVal: 4.583 ± 1.31
1.146LysTrp: 1.146 ± 0.472
4.01LysTyr: 4.01 ± 1.002
0.0LysXaa: 0.0 ± 0.0
Leu
3.724LeuAla: 3.724 ± 1.94
1.146LeuCys: 1.146 ± 0.472
5.443LeuAsp: 5.443 ± 1.678
6.588LeuGlu: 6.588 ± 2.025
3.724LeuPhe: 3.724 ± 1.815
6.015LeuGly: 6.015 ± 2.159
2.292LeuHis: 2.292 ± 0.371
8.307LeuIle: 8.307 ± 2.773
8.021LeuLys: 8.021 ± 2.09
10.312LeuLeu: 10.312 ± 4.454
1.146LeuMet: 1.146 ± 0.637
6.588LeuAsn: 6.588 ± 1.351
5.443LeuPro: 5.443 ± 1.171
2.865LeuGln: 2.865 ± 0.901
3.724LeuArg: 3.724 ± 1.121
8.594LeuSer: 8.594 ± 1.128
4.583LeuThr: 4.583 ± 1.072
10.026LeuVal: 10.026 ± 1.995
0.573LeuTrp: 0.573 ± 0.336
2.865LeuTyr: 2.865 ± 1.402
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.432MetAsp: 1.432 ± 0.951
0.859MetGlu: 0.859 ± 0.416
1.432MetPhe: 1.432 ± 0.278
0.0MetGly: 0.0 ± 0.0
0.573MetHis: 0.573 ± 0.315
1.719MetIle: 1.719 ± 0.556
0.286MetLys: 0.286 ± 0.349
1.432MetLeu: 1.432 ± 0.518
0.573MetMet: 0.573 ± 0.227
1.146MetAsn: 1.146 ± 0.237
0.286MetPro: 0.286 ± 0.168
0.286MetGln: 0.286 ± 0.349
0.573MetArg: 0.573 ± 0.336
1.432MetSer: 1.432 ± 0.563
1.146MetThr: 1.146 ± 0.374
0.859MetVal: 0.859 ± 0.751
0.286MetTrp: 0.286 ± 0.349
0.859MetTyr: 0.859 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
2.865AsnAla: 2.865 ± 0.428
0.859AsnCys: 0.859 ± 0.657
3.437AsnAsp: 3.437 ± 1.031
3.437AsnGlu: 3.437 ± 1.121
4.87AsnPhe: 4.87 ± 1.309
2.292AsnGly: 2.292 ± 0.436
1.719AsnHis: 1.719 ± 0.757
6.302AsnIle: 6.302 ± 0.767
6.015AsnLys: 6.015 ± 1.332
6.875AsnLeu: 6.875 ± 1.118
1.432AsnMet: 1.432 ± 0.278
4.87AsnAsn: 4.87 ± 0.729
2.005AsnPro: 2.005 ± 1.11
0.573AsnGln: 0.573 ± 0.403
3.437AsnArg: 3.437 ± 1.008
4.87AsnSer: 4.87 ± 0.509
4.01AsnThr: 4.01 ± 2.059
5.729AsnVal: 5.729 ± 1.992
0.573AsnTrp: 0.573 ± 0.315
2.865AsnTyr: 2.865 ± 1.072
0.0AsnXaa: 0.0 ± 0.0
Pro
1.432ProAla: 1.432 ± 0.669
0.573ProCys: 0.573 ± 0.609
1.432ProAsp: 1.432 ± 0.278
1.432ProGlu: 1.432 ± 0.841
2.292ProPhe: 2.292 ± 1.395
0.859ProGly: 0.859 ± 0.364
1.146ProHis: 1.146 ± 0.374
2.865ProIle: 2.865 ± 0.891
2.292ProLys: 2.292 ± 0.944
4.87ProLeu: 4.87 ± 0.592
0.573ProMet: 0.573 ± 0.315
1.146ProAsn: 1.146 ± 0.237
2.005ProPro: 2.005 ± 0.538
1.146ProGln: 1.146 ± 1.409
2.005ProArg: 2.005 ± 0.675
2.005ProSer: 2.005 ± 0.402
3.437ProThr: 3.437 ± 0.836
3.151ProVal: 3.151 ± 0.476
0.286ProTrp: 0.286 ± 0.607
1.719ProTyr: 1.719 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
1.146GlnAla: 1.146 ± 0.585
0.859GlnCys: 0.859 ± 0.364
0.573GlnAsp: 0.573 ± 0.551
1.432GlnGlu: 1.432 ± 0.437
1.719GlnPhe: 1.719 ± 0.421
1.146GlnGly: 1.146 ± 0.629
0.0GlnHis: 0.0 ± 0.0
2.005GlnIle: 2.005 ± 0.534
1.719GlnLys: 1.719 ± 0.728
2.005GlnLeu: 2.005 ± 0.675
0.0GlnMet: 0.0 ± 0.0
0.286GlnAsn: 0.286 ± 0.456
1.146GlnPro: 1.146 ± 0.472
0.573GlnGln: 0.573 ± 0.336
2.005GlnArg: 2.005 ± 0.903
2.865GlnSer: 2.865 ± 2.208
1.146GlnThr: 1.146 ± 0.613
1.146GlnVal: 1.146 ± 0.633
0.286GlnTrp: 0.286 ± 0.349
0.859GlnTyr: 0.859 ± 1.048
0.0GlnXaa: 0.0 ± 0.0
Arg
1.432ArgAla: 1.432 ± 0.608
0.859ArgCys: 0.859 ± 0.26
1.432ArgAsp: 1.432 ± 0.458
1.719ArgGlu: 1.719 ± 0.757
3.151ArgPhe: 3.151 ± 1.105
2.578ArgGly: 2.578 ± 1.162
0.286ArgHis: 0.286 ± 0.168
2.578ArgIle: 2.578 ± 0.563
5.156ArgLys: 5.156 ± 1.374
5.156ArgLeu: 5.156 ± 0.949
1.146ArgMet: 1.146 ± 0.374
6.875ArgAsn: 6.875 ± 0.946
0.286ArgPro: 0.286 ± 0.168
0.859ArgGln: 0.859 ± 0.26
1.432ArgArg: 1.432 ± 0.278
3.437ArgSer: 3.437 ± 1.125
1.146ArgThr: 1.146 ± 0.237
3.151ArgVal: 3.151 ± 0.729
0.0ArgTrp: 0.0 ± 0.0
2.005ArgTyr: 2.005 ± 1.641
0.0ArgXaa: 0.0 ± 0.0
Ser
1.719SerAla: 1.719 ± 0.394
1.146SerCys: 1.146 ± 0.52
2.865SerAsp: 2.865 ± 1.459
4.01SerGlu: 4.01 ± 0.269
5.156SerPhe: 5.156 ± 0.659
4.87SerGly: 4.87 ± 1.24
1.432SerHis: 1.432 ± 0.608
4.01SerIle: 4.01 ± 0.998
8.021SerLys: 8.021 ± 2.496
6.588SerLeu: 6.588 ± 1.783
0.573SerMet: 0.573 ± 0.336
5.156SerAsn: 5.156 ± 1.263
2.292SerPro: 2.292 ± 0.371
2.005SerGln: 2.005 ± 0.733
3.151SerArg: 3.151 ± 1.105
7.734SerSer: 7.734 ± 3.353
4.583SerThr: 4.583 ± 2.136
6.302SerVal: 6.302 ± 1.374
0.286SerTrp: 0.286 ± 0.168
2.578SerTyr: 2.578 ± 0.923
0.0SerXaa: 0.0 ± 0.0
Thr
2.292ThrAla: 2.292 ± 0.714
1.719ThrCys: 1.719 ± 0.674
1.719ThrAsp: 1.719 ± 0.613
2.865ThrGlu: 2.865 ± 0.717
4.583ThrPhe: 4.583 ± 0.575
1.719ThrGly: 1.719 ± 0.841
1.146ThrHis: 1.146 ± 0.585
5.729ThrIle: 5.729 ± 1.346
5.156ThrLys: 5.156 ± 0.894
3.437ThrLeu: 3.437 ± 1.375
0.859ThrMet: 0.859 ± 0.45
4.297ThrAsn: 4.297 ± 1.493
1.432ThrPro: 1.432 ± 1.217
0.859ThrGln: 0.859 ± 0.617
3.437ThrArg: 3.437 ± 1.727
5.156ThrSer: 5.156 ± 0.672
4.87ThrThr: 4.87 ± 1.056
5.729ThrVal: 5.729 ± 1.031
0.859ThrTrp: 0.859 ± 0.364
1.432ThrTyr: 1.432 ± 0.841
0.0ThrXaa: 0.0 ± 0.0
Val
2.578ValAla: 2.578 ± 0.907
1.432ValCys: 1.432 ± 0.278
4.583ValAsp: 4.583 ± 0.954
5.156ValGlu: 5.156 ± 1.064
6.015ValPhe: 6.015 ± 1.988
3.437ValGly: 3.437 ± 0.711
0.573ValHis: 0.573 ± 0.227
6.015ValIle: 6.015 ± 0.907
5.443ValLys: 5.443 ± 0.747
8.594ValLeu: 8.594 ± 0.934
0.573ValMet: 0.573 ± 0.699
4.01ValAsn: 4.01 ± 0.781
3.724ValPro: 3.724 ± 1.347
1.432ValGln: 1.432 ± 0.768
2.865ValArg: 2.865 ± 0.876
4.01ValSer: 4.01 ± 0.904
4.297ValThr: 4.297 ± 1.537
3.724ValVal: 3.724 ± 1.099
0.286ValTrp: 0.286 ± 0.168
4.297ValTyr: 4.297 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.573TrpAla: 0.573 ± 0.336
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.286TrpGlu: 0.286 ± 0.168
0.573TrpPhe: 0.573 ± 0.683
0.286TrpGly: 0.286 ± 0.168
0.0TrpHis: 0.0 ± 0.0
0.859TrpIle: 0.859 ± 0.643
0.286TrpLys: 0.286 ± 0.168
0.573TrpLeu: 0.573 ± 0.336
0.0TrpMet: 0.0 ± 0.0
0.286TrpAsn: 0.286 ± 0.168
0.0TrpPro: 0.0 ± 0.0
0.286TrpGln: 0.286 ± 0.168
0.573TrpArg: 0.573 ± 0.315
0.286TrpSer: 0.286 ± 0.168
0.286TrpThr: 0.286 ± 0.349
0.286TrpVal: 0.286 ± 0.349
0.286TrpTrp: 0.286 ± 0.168
0.286TrpTyr: 0.286 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.859TyrAla: 0.859 ± 0.364
0.859TyrCys: 0.859 ± 0.576
4.297TyrAsp: 4.297 ± 1.32
2.005TyrGlu: 2.005 ± 0.534
3.437TyrPhe: 3.437 ± 0.688
1.432TyrGly: 1.432 ± 0.841
1.146TyrHis: 1.146 ± 0.633
2.578TyrIle: 2.578 ± 1.162
3.151TyrLys: 3.151 ± 1.212
3.724TyrLeu: 3.724 ± 0.861
0.859TyrMet: 0.859 ± 0.51
3.151TyrAsn: 3.151 ± 0.544
1.432TyrPro: 1.432 ± 0.407
1.432TyrGln: 1.432 ± 0.538
1.719TyrArg: 1.719 ± 1.036
2.005TyrSer: 2.005 ± 0.805
2.005TyrThr: 2.005 ± 0.374
1.719TyrVal: 1.719 ± 0.342
0.0TyrTrp: 0.0 ± 0.0
2.865TyrTyr: 2.865 ± 0.754
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3492 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski