Amino acid dipepetide frequency for Drosophila immigrans sigmavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.762AlaAla: 2.762 ± 1.359
0.753AlaCys: 0.753 ± 0.336
1.758AlaAsp: 1.758 ± 0.494
2.26AlaGlu: 2.26 ± 1.1
2.26AlaPhe: 2.26 ± 0.297
4.018AlaGly: 4.018 ± 1.107
2.009AlaHis: 2.009 ± 0.484
3.516AlaIle: 3.516 ± 1.98
2.009AlaLys: 2.009 ± 0.663
7.283AlaLeu: 7.283 ± 2.284
0.502AlaMet: 0.502 ± 0.269
2.511AlaAsn: 2.511 ± 0.696
1.507AlaPro: 1.507 ± 1.16
2.511AlaGln: 2.511 ± 0.622
1.507AlaArg: 1.507 ± 0.965
3.767AlaSer: 3.767 ± 0.923
3.014AlaThr: 3.014 ± 0.924
3.014AlaVal: 3.014 ± 0.559
0.502AlaTrp: 0.502 ± 0.3
2.26AlaTyr: 2.26 ± 0.697
0.0AlaXaa: 0.0 ± 0.0
Cys
0.502CysAla: 0.502 ± 0.378
0.0CysCys: 0.0 ± 0.0
1.005CysAsp: 1.005 ± 0.533
0.251CysGlu: 0.251 ± 0.15
0.753CysPhe: 0.753 ± 0.45
0.251CysGly: 0.251 ± 0.295
0.0CysHis: 0.0 ± 0.0
0.753CysIle: 0.753 ± 0.45
1.256CysLys: 1.256 ± 0.514
1.005CysLeu: 1.005 ± 0.311
0.251CysMet: 0.251 ± 0.15
0.502CysAsn: 0.502 ± 0.378
0.753CysPro: 0.753 ± 0.641
0.251CysGln: 0.251 ± 0.328
1.758CysArg: 1.758 ± 0.466
1.005CysSer: 1.005 ± 0.559
0.753CysThr: 0.753 ± 0.416
1.256CysVal: 1.256 ± 0.402
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.762AspAla: 2.762 ± 1.044
0.753AspCys: 0.753 ± 0.621
1.507AspAsp: 1.507 ± 1.088
3.265AspGlu: 3.265 ± 1.539
2.762AspPhe: 2.762 ± 1.009
1.758AspGly: 1.758 ± 0.692
1.256AspHis: 1.256 ± 0.361
2.762AspIle: 2.762 ± 0.321
2.762AspLys: 2.762 ± 0.734
6.529AspLeu: 6.529 ± 0.824
1.758AspMet: 1.758 ± 1.096
2.009AspAsn: 2.009 ± 0.378
4.018AspPro: 4.018 ± 0.877
2.26AspGln: 2.26 ± 0.846
2.511AspArg: 2.511 ± 0.892
3.516AspSer: 3.516 ± 1.064
2.26AspThr: 2.26 ± 0.714
1.256AspVal: 1.256 ± 0.56
1.005AspTrp: 1.005 ± 0.284
2.762AspTyr: 2.762 ± 1.087
0.0AspXaa: 0.0 ± 0.0
Glu
1.256GluAla: 1.256 ± 1.114
0.251GluCys: 0.251 ± 0.295
4.018GluAsp: 4.018 ± 1.421
3.767GluGlu: 3.767 ± 0.746
2.762GluPhe: 2.762 ± 0.935
3.516GluGly: 3.516 ± 0.596
1.005GluHis: 1.005 ± 0.473
6.278GluIle: 6.278 ± 1.304
3.014GluLys: 3.014 ± 1.048
7.032GluLeu: 7.032 ± 1.471
1.507GluMet: 1.507 ± 0.77
2.26GluAsn: 2.26 ± 0.72
2.762GluPro: 2.762 ± 1.501
1.005GluGln: 1.005 ± 0.494
1.256GluArg: 1.256 ± 0.56
3.767GluSer: 3.767 ± 1.408
3.014GluThr: 3.014 ± 1.238
2.762GluVal: 2.762 ± 0.712
1.256GluTrp: 1.256 ± 1.385
1.256GluTyr: 1.256 ± 0.453
0.0GluXaa: 0.0 ± 0.0
Phe
2.009PheAla: 2.009 ± 0.484
0.753PheCys: 0.753 ± 0.648
2.26PheAsp: 2.26 ± 1.178
2.009PheGlu: 2.009 ± 0.636
2.26PhePhe: 2.26 ± 1.043
1.256PheGly: 1.256 ± 0.413
0.753PheHis: 0.753 ± 0.268
2.511PheIle: 2.511 ± 0.765
2.26PheLys: 2.26 ± 0.565
4.52PheLeu: 4.52 ± 1.282
0.753PheMet: 0.753 ± 0.52
2.26PheAsn: 2.26 ± 0.506
3.265PhePro: 3.265 ± 0.798
1.507PheGln: 1.507 ± 0.285
3.516PheArg: 3.516 ± 0.931
3.767PheSer: 3.767 ± 1.051
2.009PheThr: 2.009 ± 0.625
3.014PheVal: 3.014 ± 1.325
0.753PheTrp: 0.753 ± 0.45
0.502PheTyr: 0.502 ± 0.687
0.0PheXaa: 0.0 ± 0.0
Gly
3.014GlyAla: 3.014 ± 1.628
0.502GlyCys: 0.502 ± 0.3
4.018GlyAsp: 4.018 ± 0.954
1.758GlyGlu: 1.758 ± 0.814
2.511GlyPhe: 2.511 ± 1.192
2.511GlyGly: 2.511 ± 0.623
2.26GlyHis: 2.26 ± 1.005
3.014GlyIle: 3.014 ± 0.223
4.018GlyLys: 4.018 ± 0.728
7.032GlyLeu: 7.032 ± 1.318
1.005GlyMet: 1.005 ± 0.403
2.762GlyAsn: 2.762 ± 0.691
1.758GlyPro: 1.758 ± 0.718
2.009GlyGln: 2.009 ± 0.502
1.758GlyArg: 1.758 ± 0.748
5.274GlySer: 5.274 ± 1.806
3.516GlyThr: 3.516 ± 1.758
4.018GlyVal: 4.018 ± 1.832
1.758GlyTrp: 1.758 ± 0.72
2.511GlyTyr: 2.511 ± 0.89
0.0GlyXaa: 0.0 ± 0.0
His
0.753HisAla: 0.753 ± 0.43
0.753HisCys: 0.753 ± 0.45
1.005HisAsp: 1.005 ± 0.442
0.753HisGlu: 0.753 ± 0.338
0.502HisPhe: 0.502 ± 0.3
1.507HisGly: 1.507 ± 0.9
0.753HisHis: 0.753 ± 0.609
1.256HisIle: 1.256 ± 0.359
1.758HisLys: 1.758 ± 0.864
4.018HisLeu: 4.018 ± 0.822
0.502HisMet: 0.502 ± 0.269
1.005HisAsn: 1.005 ± 0.909
1.758HisPro: 1.758 ± 0.434
0.502HisGln: 0.502 ± 0.279
1.507HisArg: 1.507 ± 0.482
3.516HisSer: 3.516 ± 0.708
2.009HisThr: 2.009 ± 0.821
1.005HisVal: 1.005 ± 0.435
0.502HisTrp: 0.502 ± 0.3
2.009HisTyr: 2.009 ± 0.585
0.0HisXaa: 0.0 ± 0.0
Ile
3.014IleAla: 3.014 ± 0.941
1.005IleCys: 1.005 ± 0.6
3.014IleAsp: 3.014 ± 0.844
3.516IleGlu: 3.516 ± 0.556
3.265IlePhe: 3.265 ± 0.537
4.52IleGly: 4.52 ± 0.846
2.26IleHis: 2.26 ± 1.365
4.771IleIle: 4.771 ± 0.966
5.023IleLys: 5.023 ± 1.654
6.529IleLeu: 6.529 ± 1.912
1.507IleMet: 1.507 ± 0.583
3.265IleAsn: 3.265 ± 1.215
5.023IlePro: 5.023 ± 0.888
3.265IleGln: 3.265 ± 1.038
4.269IleArg: 4.269 ± 0.947
4.52IleSer: 4.52 ± 0.995
3.516IleThr: 3.516 ± 0.365
3.014IleVal: 3.014 ± 1.324
0.753IleTrp: 0.753 ± 0.268
2.511IleTyr: 2.511 ± 0.77
0.0IleXaa: 0.0 ± 0.0
Lys
2.009LysAla: 2.009 ± 0.963
1.256LysCys: 1.256 ± 0.469
1.507LysAsp: 1.507 ± 0.535
4.52LysGlu: 4.52 ± 1.179
1.507LysPhe: 1.507 ± 0.632
2.511LysGly: 2.511 ± 0.719
1.256LysHis: 1.256 ± 0.514
3.516LysIle: 3.516 ± 0.978
2.762LysLys: 2.762 ± 1.592
7.283LysLeu: 7.283 ± 0.924
2.009LysMet: 2.009 ± 0.293
3.516LysAsn: 3.516 ± 0.329
2.762LysPro: 2.762 ± 0.835
2.009LysGln: 2.009 ± 0.469
2.511LysArg: 2.511 ± 0.609
3.516LysSer: 3.516 ± 0.808
3.516LysThr: 3.516 ± 0.866
4.771LysVal: 4.771 ± 0.614
1.507LysTrp: 1.507 ± 0.702
2.26LysTyr: 2.26 ± 0.581
0.0LysXaa: 0.0 ± 0.0
Leu
7.534LeuAla: 7.534 ± 1.45
0.251LeuCys: 0.251 ± 0.295
6.278LeuAsp: 6.278 ± 0.571
7.032LeuGlu: 7.032 ± 1.726
4.771LeuPhe: 4.771 ± 1.086
7.283LeuGly: 7.283 ± 0.68
2.26LeuHis: 2.26 ± 1.044
8.036LeuIle: 8.036 ± 1.852
4.52LeuLys: 4.52 ± 1.245
9.041LeuLeu: 9.041 ± 2.075
3.265LeuMet: 3.265 ± 0.565
5.776LeuAsn: 5.776 ± 1.033
5.274LeuPro: 5.274 ± 1.153
3.014LeuGln: 3.014 ± 0.814
7.534LeuArg: 7.534 ± 1.644
8.538LeuSer: 8.538 ± 0.788
6.781LeuThr: 6.781 ± 0.735
4.018LeuVal: 4.018 ± 1.968
2.009LeuTrp: 2.009 ± 0.828
4.52LeuTyr: 4.52 ± 1.256
0.0LeuXaa: 0.0 ± 0.0
Met
1.256MetAla: 1.256 ± 0.442
0.251MetCys: 0.251 ± 0.435
1.005MetAsp: 1.005 ± 0.431
1.507MetGlu: 1.507 ± 0.754
1.758MetPhe: 1.758 ± 1.074
1.005MetGly: 1.005 ± 0.34
0.753MetHis: 0.753 ± 0.338
1.758MetIle: 1.758 ± 0.527
2.26MetLys: 2.26 ± 1.075
2.762MetLeu: 2.762 ± 1.234
1.005MetMet: 1.005 ± 0.311
1.758MetAsn: 1.758 ± 0.534
0.251MetPro: 0.251 ± 0.411
0.753MetGln: 0.753 ± 0.43
1.005MetArg: 1.005 ± 0.442
3.014MetSer: 3.014 ± 0.78
3.014MetThr: 3.014 ± 0.601
0.502MetVal: 0.502 ± 0.687
0.251MetTrp: 0.251 ± 0.15
0.753MetTyr: 0.753 ± 0.416
0.0MetXaa: 0.0 ± 0.0
Asn
3.516AsnAla: 3.516 ± 1.509
0.502AsnCys: 0.502 ± 0.3
2.009AsnAsp: 2.009 ± 0.946
1.507AsnGlu: 1.507 ± 0.342
2.26AsnPhe: 2.26 ± 0.558
2.762AsnGly: 2.762 ± 1.39
1.758AsnHis: 1.758 ± 1.05
3.516AsnIle: 3.516 ± 1.07
1.758AsnLys: 1.758 ± 0.543
7.032AsnLeu: 7.032 ± 2.087
1.507AsnMet: 1.507 ± 0.763
2.26AsnAsn: 2.26 ± 1.005
3.516AsnPro: 3.516 ± 1.213
2.762AsnGln: 2.762 ± 0.857
2.009AsnArg: 2.009 ± 0.968
4.771AsnSer: 4.771 ± 1.375
3.516AsnThr: 3.516 ± 0.638
2.511AsnVal: 2.511 ± 0.483
1.005AsnTrp: 1.005 ± 0.442
1.758AsnTyr: 1.758 ± 0.637
0.0AsnXaa: 0.0 ± 0.0
Pro
3.265ProAla: 3.265 ± 0.343
1.507ProCys: 1.507 ± 0.989
4.269ProAsp: 4.269 ± 1.045
4.52ProGlu: 4.52 ± 1.265
1.256ProPhe: 1.256 ± 0.601
3.014ProGly: 3.014 ± 0.957
1.507ProHis: 1.507 ± 0.285
4.269ProIle: 4.269 ± 0.62
1.758ProLys: 1.758 ± 0.636
3.767ProLeu: 3.767 ± 0.678
1.005ProMet: 1.005 ± 0.931
2.009ProAsn: 2.009 ± 0.727
2.511ProPro: 2.511 ± 0.839
1.256ProGln: 1.256 ± 0.811
2.009ProArg: 2.009 ± 0.453
5.023ProSer: 5.023 ± 1.123
3.265ProThr: 3.265 ± 0.52
4.018ProVal: 4.018 ± 0.945
0.502ProTrp: 0.502 ± 0.269
0.753ProTyr: 0.753 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
1.507GlnAla: 1.507 ± 0.504
0.502GlnCys: 0.502 ± 0.3
1.507GlnAsp: 1.507 ± 1.456
2.762GlnGlu: 2.762 ± 0.927
1.005GlnPhe: 1.005 ± 0.431
3.014GlnGly: 3.014 ± 1.201
0.502GlnHis: 0.502 ± 0.378
2.009GlnIle: 2.009 ± 1.61
2.26GlnLys: 2.26 ± 0.651
2.009GlnLeu: 2.009 ± 0.921
0.251GlnMet: 0.251 ± 0.15
3.516GlnAsn: 3.516 ± 0.742
0.502GlnPro: 0.502 ± 0.377
0.753GlnGln: 0.753 ± 0.49
1.758GlnArg: 1.758 ± 0.778
3.767GlnSer: 3.767 ± 1.023
3.767GlnThr: 3.767 ± 1.025
2.009GlnVal: 2.009 ± 0.624
0.251GlnTrp: 0.251 ± 0.344
1.507GlnTyr: 1.507 ± 0.631
0.0GlnXaa: 0.0 ± 0.0
Arg
1.758ArgAla: 1.758 ± 0.789
0.502ArgCys: 0.502 ± 0.3
3.265ArgAsp: 3.265 ± 1.121
4.269ArgGlu: 4.269 ± 0.855
2.009ArgPhe: 2.009 ± 0.862
2.762ArgGly: 2.762 ± 0.956
2.26ArgHis: 2.26 ± 0.769
1.507ArgIle: 1.507 ± 0.762
1.256ArgLys: 1.256 ± 0.361
5.525ArgLeu: 5.525 ± 1.535
1.758ArgMet: 1.758 ± 0.796
3.516ArgAsn: 3.516 ± 0.976
2.511ArgPro: 2.511 ± 0.664
1.005ArgGln: 1.005 ± 0.442
2.762ArgArg: 2.762 ± 1.405
3.014ArgSer: 3.014 ± 0.899
1.758ArgThr: 1.758 ± 1.05
4.52ArgVal: 4.52 ± 0.671
1.005ArgTrp: 1.005 ± 0.419
2.009ArgTyr: 2.009 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
3.265SerAla: 3.265 ± 0.811
0.753SerCys: 0.753 ± 0.332
4.018SerAsp: 4.018 ± 1.1
4.771SerGlu: 4.771 ± 1.02
3.014SerPhe: 3.014 ± 1.818
5.274SerGly: 5.274 ± 1.855
2.26SerHis: 2.26 ± 0.498
6.781SerIle: 6.781 ± 1.403
4.771SerLys: 4.771 ± 0.756
9.794SerLeu: 9.794 ± 1.969
4.269SerMet: 4.269 ± 1.036
4.771SerAsn: 4.771 ± 1.245
2.762SerPro: 2.762 ± 1.049
4.018SerGln: 4.018 ± 1.761
3.767SerArg: 3.767 ± 1.042
9.794SerSer: 9.794 ± 2.182
5.274SerThr: 5.274 ± 1.153
4.269SerVal: 4.269 ± 1.233
2.762SerTrp: 2.762 ± 1.033
2.511SerTyr: 2.511 ± 0.969
0.0SerXaa: 0.0 ± 0.0
Thr
2.511ThrAla: 2.511 ± 0.522
0.502ThrCys: 0.502 ± 0.279
2.511ThrAsp: 2.511 ± 1.033
2.762ThrGlu: 2.762 ± 1.523
2.511ThrPhe: 2.511 ± 0.757
2.762ThrGly: 2.762 ± 0.707
1.005ThrHis: 1.005 ± 0.34
4.269ThrIle: 4.269 ± 1.046
5.274ThrLys: 5.274 ± 1.004
7.032ThrLeu: 7.032 ± 1.159
1.758ThrMet: 1.758 ± 0.772
2.26ThrAsn: 2.26 ± 0.506
5.274ThrPro: 5.274 ± 2.657
1.507ThrGln: 1.507 ± 0.664
2.009ThrArg: 2.009 ± 0.652
6.027ThrSer: 6.027 ± 0.683
4.018ThrThr: 4.018 ± 1.018
3.265ThrVal: 3.265 ± 0.652
1.758ThrTrp: 1.758 ± 0.827
2.26ThrTyr: 2.26 ± 0.949
0.0ThrXaa: 0.0 ± 0.0
Val
3.014ValAla: 3.014 ± 1.33
1.256ValCys: 1.256 ± 0.402
3.014ValAsp: 3.014 ± 0.415
1.005ValGlu: 1.005 ± 0.435
2.762ValPhe: 2.762 ± 0.676
3.516ValGly: 3.516 ± 0.752
1.256ValHis: 1.256 ± 0.638
4.771ValIle: 4.771 ± 1.043
3.014ValLys: 3.014 ± 0.754
5.525ValLeu: 5.525 ± 0.659
0.502ValMet: 0.502 ± 0.3
4.018ValAsn: 4.018 ± 1.287
3.014ValPro: 3.014 ± 0.701
1.256ValGln: 1.256 ± 0.811
2.511ValArg: 2.511 ± 0.344
6.027ValSer: 6.027 ± 1.438
3.014ValThr: 3.014 ± 0.934
3.767ValVal: 3.767 ± 1.646
1.005ValTrp: 1.005 ± 0.56
2.009ValTyr: 2.009 ± 0.432
0.0ValXaa: 0.0 ± 0.0
Trp
1.256TrpAla: 1.256 ± 0.306
0.251TrpCys: 0.251 ± 0.295
0.753TrpAsp: 0.753 ± 0.268
0.753TrpGlu: 0.753 ± 0.401
1.005TrpPhe: 1.005 ± 0.284
1.758TrpGly: 1.758 ± 0.72
0.753TrpHis: 0.753 ± 0.45
1.256TrpIle: 1.256 ± 0.533
1.256TrpLys: 1.256 ± 0.533
0.753TrpLeu: 0.753 ± 0.599
0.251TrpMet: 0.251 ± 0.15
0.753TrpAsn: 0.753 ± 0.338
0.502TrpPro: 0.502 ± 0.3
0.753TrpGln: 0.753 ± 0.474
1.256TrpArg: 1.256 ± 0.539
2.762TrpSer: 2.762 ± 0.62
1.005TrpThr: 1.005 ± 0.6
1.256TrpVal: 1.256 ± 0.306
0.0TrpTrp: 0.0 ± 0.0
0.502TrpTyr: 0.502 ± 0.3
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.762TyrAla: 2.762 ± 1.08
0.0TyrCys: 0.0 ± 0.0
0.753TyrAsp: 0.753 ± 0.42
0.753TyrGlu: 0.753 ± 0.377
1.005TyrPhe: 1.005 ± 0.538
2.009TyrGly: 2.009 ± 0.574
1.507TyrHis: 1.507 ± 0.518
2.009TyrIle: 2.009 ± 0.762
3.516TyrLys: 3.516 ± 0.452
3.265TyrLeu: 3.265 ± 1.114
1.005TyrMet: 1.005 ± 0.431
1.507TyrAsn: 1.507 ± 0.482
1.758TyrPro: 1.758 ± 0.842
2.762TyrGln: 2.762 ± 0.491
1.758TyrArg: 1.758 ± 0.504
3.767TyrSer: 3.767 ± 1.233
2.26TyrThr: 2.26 ± 0.917
2.009TyrVal: 2.009 ± 0.963
0.251TyrTrp: 0.251 ± 0.344
1.256TyrTyr: 1.256 ± 0.577
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski