Amino acid dipepetide frequency for Drosophila obscura sigmavirus 10A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.469AlaAla: 2.469 ± 1.799
0.494AlaCys: 0.494 ± 0.447
2.469AlaAsp: 2.469 ± 1.03
3.209AlaGlu: 3.209 ± 0.544
2.469AlaPhe: 2.469 ± 0.474
2.715AlaGly: 2.715 ± 0.666
0.741AlaHis: 0.741 ± 0.436
4.69AlaIle: 4.69 ± 0.418
1.481AlaLys: 1.481 ± 0.668
6.418AlaLeu: 6.418 ± 1.125
0.494AlaMet: 0.494 ± 0.281
1.975AlaAsn: 1.975 ± 0.477
1.728AlaPro: 1.728 ± 0.763
1.234AlaGln: 1.234 ± 0.28
1.481AlaArg: 1.481 ± 0.581
4.69AlaSer: 4.69 ± 1.071
3.456AlaThr: 3.456 ± 0.709
2.469AlaVal: 2.469 ± 1.086
0.987AlaTrp: 0.987 ± 0.363
1.975AlaTyr: 1.975 ± 1.124
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.281
0.0CysCys: 0.0 ± 0.0
0.494CysAsp: 0.494 ± 0.254
0.741CysGlu: 0.741 ± 0.333
0.247CysPhe: 0.247 ± 0.145
0.741CysGly: 0.741 ± 0.333
0.494CysHis: 0.494 ± 0.5
0.987CysIle: 0.987 ± 0.504
0.494CysLys: 0.494 ± 0.291
1.234CysLeu: 1.234 ± 0.545
0.494CysMet: 0.494 ± 0.254
1.234CysAsn: 1.234 ± 0.378
0.247CysPro: 0.247 ± 0.298
0.494CysGln: 0.494 ± 0.426
0.494CysArg: 0.494 ± 0.281
0.494CysSer: 0.494 ± 0.291
0.741CysThr: 0.741 ± 0.534
1.481CysVal: 1.481 ± 0.342
0.494CysTrp: 0.494 ± 0.291
0.247CysTyr: 0.247 ± 0.298
0.0CysXaa: 0.0 ± 0.0
Asp
2.222AspAla: 2.222 ± 0.908
0.741AspCys: 0.741 ± 0.436
3.456AspAsp: 3.456 ± 1.206
3.456AspGlu: 3.456 ± 0.915
0.741AspPhe: 0.741 ± 0.286
1.975AspGly: 1.975 ± 0.334
0.741AspHis: 0.741 ± 0.436
1.728AspIle: 1.728 ± 0.592
2.962AspLys: 2.962 ± 0.858
4.937AspLeu: 4.937 ± 0.888
0.987AspMet: 0.987 ± 0.444
2.715AspAsn: 2.715 ± 0.861
4.443AspPro: 4.443 ± 0.624
0.987AspGln: 0.987 ± 0.372
2.715AspArg: 2.715 ± 0.478
4.937AspSer: 4.937 ± 1.466
3.95AspThr: 3.95 ± 1.036
1.975AspVal: 1.975 ± 0.677
0.987AspTrp: 0.987 ± 0.277
3.456AspTyr: 3.456 ± 0.964
0.0AspXaa: 0.0 ± 0.0
Glu
3.456GluAla: 3.456 ± 1.855
1.234GluCys: 1.234 ± 0.938
4.196GluAsp: 4.196 ± 0.332
4.937GluGlu: 4.937 ± 2.152
2.715GluPhe: 2.715 ± 0.719
4.443GluGly: 4.443 ± 0.463
1.481GluHis: 1.481 ± 0.45
5.184GluIle: 5.184 ± 1.537
4.443GluLys: 4.443 ± 0.907
6.912GluLeu: 6.912 ± 1.697
0.987GluMet: 0.987 ± 0.359
3.209GluAsn: 3.209 ± 0.807
0.987GluPro: 0.987 ± 0.551
1.481GluGln: 1.481 ± 0.35
2.715GluArg: 2.715 ± 0.675
2.962GluSer: 2.962 ± 0.936
3.456GluThr: 3.456 ± 0.709
3.456GluVal: 3.456 ± 0.789
1.728GluTrp: 1.728 ± 0.816
2.962GluTyr: 2.962 ± 0.741
0.0GluXaa: 0.0 ± 0.0
Phe
1.234PheAla: 1.234 ± 0.883
0.987PheCys: 0.987 ± 0.277
1.481PheAsp: 1.481 ± 0.53
2.962PheGlu: 2.962 ± 0.446
2.222PhePhe: 2.222 ± 1.122
1.234PheGly: 1.234 ± 0.605
1.728PheHis: 1.728 ± 0.512
2.469PheIle: 2.469 ± 0.864
3.209PheLys: 3.209 ± 0.939
4.69PheLeu: 4.69 ± 1.062
0.987PheMet: 0.987 ± 0.377
1.975PheAsn: 1.975 ± 0.716
2.962PhePro: 2.962 ± 0.773
1.234PheGln: 1.234 ± 0.727
3.209PheArg: 3.209 ± 1.292
3.703PheSer: 3.703 ± 1.032
1.728PheThr: 1.728 ± 0.559
2.715PheVal: 2.715 ± 0.678
0.741PheTrp: 0.741 ± 0.386
0.741PheTyr: 0.741 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
2.715GlyAla: 2.715 ± 1.239
0.494GlyCys: 0.494 ± 0.254
3.456GlyAsp: 3.456 ± 0.664
2.469GlyGlu: 2.469 ± 0.515
1.481GlyPhe: 1.481 ± 0.399
2.962GlyGly: 2.962 ± 0.713
1.481GlyHis: 1.481 ± 0.392
3.703GlyIle: 3.703 ± 0.906
2.715GlyLys: 2.715 ± 0.991
7.406GlyLeu: 7.406 ± 2.261
0.247GlyMet: 0.247 ± 0.311
1.728GlyAsn: 1.728 ± 1.13
2.962GlyPro: 2.962 ± 0.89
2.715GlyGln: 2.715 ± 0.477
2.962GlyArg: 2.962 ± 0.244
6.665GlySer: 6.665 ± 1.537
1.481GlyThr: 1.481 ± 0.35
4.443GlyVal: 4.443 ± 1.122
0.987GlyTrp: 0.987 ± 0.581
1.975GlyTyr: 1.975 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
0.247HisAla: 0.247 ± 0.145
0.247HisCys: 0.247 ± 0.145
1.975HisAsp: 1.975 ± 0.753
1.234HisGlu: 1.234 ± 0.853
1.234HisPhe: 1.234 ± 0.552
1.481HisGly: 1.481 ± 0.71
0.987HisHis: 0.987 ± 0.691
0.987HisIle: 0.987 ± 0.377
2.222HisLys: 2.222 ± 0.541
2.962HisLeu: 2.962 ± 0.835
0.741HisMet: 0.741 ± 0.291
0.987HisAsn: 0.987 ± 0.444
1.728HisPro: 1.728 ± 0.457
0.494HisGln: 0.494 ± 0.317
1.481HisArg: 1.481 ± 0.443
1.975HisSer: 1.975 ± 0.508
0.987HisThr: 0.987 ± 0.435
0.494HisVal: 0.494 ± 0.291
0.741HisTrp: 0.741 ± 0.386
1.481HisTyr: 1.481 ± 0.35
0.0HisXaa: 0.0 ± 0.0
Ile
3.95IleAla: 3.95 ± 1.635
0.494IleCys: 0.494 ± 0.291
3.95IleAsp: 3.95 ± 0.781
4.69IleGlu: 4.69 ± 0.746
3.209IlePhe: 3.209 ± 0.777
5.184IleGly: 5.184 ± 1.543
1.481IleHis: 1.481 ± 0.53
3.95IleIle: 3.95 ± 0.962
6.171IleLys: 6.171 ± 1.269
7.406IleLeu: 7.406 ± 1.305
1.234IleMet: 1.234 ± 0.727
3.456IleAsn: 3.456 ± 0.752
4.937IlePro: 4.937 ± 1.17
1.975IleGln: 1.975 ± 0.683
5.184IleArg: 5.184 ± 1.03
4.196IleSer: 4.196 ± 0.796
4.196IleThr: 4.196 ± 0.59
3.209IleVal: 3.209 ± 1.092
0.987IleTrp: 0.987 ± 0.356
2.962IleTyr: 2.962 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
1.481LysAla: 1.481 ± 0.35
0.741LysCys: 0.741 ± 0.436
3.456LysAsp: 3.456 ± 0.905
5.678LysGlu: 5.678 ± 1.06
2.469LysPhe: 2.469 ± 0.474
2.962LysGly: 2.962 ± 0.45
1.481LysHis: 1.481 ± 0.436
6.171LysIle: 6.171 ± 1.589
5.184LysLys: 5.184 ± 0.854
5.184LysLeu: 5.184 ± 1.32
3.95LysMet: 3.95 ± 1.404
3.456LysAsn: 3.456 ± 1.112
2.962LysPro: 2.962 ± 0.512
2.962LysGln: 2.962 ± 0.952
2.469LysArg: 2.469 ± 0.817
5.924LysSer: 5.924 ± 1.204
4.196LysThr: 4.196 ± 0.639
3.703LysVal: 3.703 ± 1.202
1.481LysTrp: 1.481 ± 0.418
2.962LysTyr: 2.962 ± 1.15
0.0LysXaa: 0.0 ± 0.0
Leu
7.406LeuAla: 7.406 ± 1.913
1.234LeuCys: 1.234 ± 0.504
4.937LeuAsp: 4.937 ± 0.253
5.184LeuGlu: 5.184 ± 1.214
4.69LeuPhe: 4.69 ± 1.076
6.912LeuGly: 6.912 ± 1.114
2.222LeuHis: 2.222 ± 0.866
8.146LeuIle: 8.146 ± 1.875
5.924LeuLys: 5.924 ± 0.984
8.64LeuLeu: 8.64 ± 1.871
3.95LeuMet: 3.95 ± 1.089
7.159LeuAsn: 7.159 ± 1.176
2.715LeuPro: 2.715 ± 1.59
3.95LeuGln: 3.95 ± 1.235
5.678LeuArg: 5.678 ± 1.265
7.406LeuSer: 7.406 ± 1.695
7.406LeuThr: 7.406 ± 1.424
4.937LeuVal: 4.937 ± 0.998
1.234LeuTrp: 1.234 ± 0.494
2.715LeuTyr: 2.715 ± 0.911
0.0LeuXaa: 0.0 ± 0.0
Met
1.234MetAla: 1.234 ± 0.471
0.247MetCys: 0.247 ± 0.145
1.234MetAsp: 1.234 ± 0.521
1.728MetGlu: 1.728 ± 0.365
1.481MetPhe: 1.481 ± 0.497
1.234MetGly: 1.234 ± 0.503
0.0MetHis: 0.0 ± 0.0
2.469MetIle: 2.469 ± 0.942
2.469MetLys: 2.469 ± 0.605
1.234MetLeu: 1.234 ± 0.727
0.494MetMet: 0.494 ± 0.291
0.987MetAsn: 0.987 ± 0.388
0.247MetPro: 0.247 ± 0.298
0.741MetGln: 0.741 ± 0.352
1.481MetArg: 1.481 ± 0.69
2.715MetSer: 2.715 ± 1.178
0.987MetThr: 0.987 ± 0.772
0.987MetVal: 0.987 ± 0.277
0.0MetTrp: 0.0 ± 0.0
0.987MetTyr: 0.987 ± 0.43
0.0MetXaa: 0.0 ± 0.0
Asn
1.728AsnAla: 1.728 ± 0.827
0.247AsnCys: 0.247 ± 0.311
1.728AsnAsp: 1.728 ± 0.513
2.469AsnGlu: 2.469 ± 0.481
2.469AsnPhe: 2.469 ± 1.013
2.469AsnGly: 2.469 ± 1.111
2.222AsnHis: 2.222 ± 1.037
3.95AsnIle: 3.95 ± 1.28
2.715AsnLys: 2.715 ± 0.566
7.406AsnLeu: 7.406 ± 1.012
0.494AsnMet: 0.494 ± 0.451
2.222AsnAsn: 2.222 ± 0.713
5.678AsnPro: 5.678 ± 1.345
1.975AsnGln: 1.975 ± 0.612
3.95AsnArg: 3.95 ± 1.665
4.937AsnSer: 4.937 ± 0.8
2.222AsnThr: 2.222 ± 0.654
2.962AsnVal: 2.962 ± 0.633
0.741AsnTrp: 0.741 ± 0.286
2.715AsnTyr: 2.715 ± 1.303
0.0AsnXaa: 0.0 ± 0.0
Pro
3.95ProAla: 3.95 ± 0.7
0.247ProCys: 0.247 ± 0.298
4.196ProAsp: 4.196 ± 1.426
3.703ProGlu: 3.703 ± 1.197
0.494ProPhe: 0.494 ± 0.291
2.469ProGly: 2.469 ± 0.947
1.234ProHis: 1.234 ± 0.598
3.703ProIle: 3.703 ± 0.509
2.469ProLys: 2.469 ± 0.634
4.196ProLeu: 4.196 ± 0.92
0.741ProMet: 0.741 ± 0.539
1.975ProAsn: 1.975 ± 0.638
1.728ProPro: 1.728 ± 0.509
1.234ProGln: 1.234 ± 0.497
1.728ProArg: 1.728 ± 0.503
3.703ProSer: 3.703 ± 0.82
3.209ProThr: 3.209 ± 0.735
2.715ProVal: 2.715 ± 0.749
0.494ProTrp: 0.494 ± 0.281
1.234ProTyr: 1.234 ± 0.596
0.0ProXaa: 0.0 ± 0.0
Gln
1.975GlnAla: 1.975 ± 0.432
0.741GlnCys: 0.741 ± 0.352
0.741GlnAsp: 0.741 ± 0.416
2.222GlnGlu: 2.222 ± 0.38
1.728GlnPhe: 1.728 ± 0.678
1.728GlnGly: 1.728 ± 0.599
0.0GlnHis: 0.0 ± 0.0
1.728GlnIle: 1.728 ± 0.497
2.962GlnLys: 2.962 ± 0.724
3.456GlnLeu: 3.456 ± 0.911
0.247GlnMet: 0.247 ± 0.145
2.962GlnAsn: 2.962 ± 0.585
1.481GlnPro: 1.481 ± 0.497
0.987GlnGln: 0.987 ± 0.377
0.741GlnArg: 0.741 ± 0.286
2.469GlnSer: 2.469 ± 0.606
2.469GlnThr: 2.469 ± 0.279
1.975GlnVal: 1.975 ± 0.684
0.494GlnTrp: 0.494 ± 0.281
1.481GlnTyr: 1.481 ± 0.572
0.0GlnXaa: 0.0 ± 0.0
Arg
3.209ArgAla: 3.209 ± 0.888
0.741ArgCys: 0.741 ± 0.436
1.481ArgAsp: 1.481 ± 0.443
3.703ArgGlu: 3.703 ± 0.938
2.715ArgPhe: 2.715 ± 0.965
2.962ArgGly: 2.962 ± 0.849
2.222ArgHis: 2.222 ± 0.829
2.222ArgIle: 2.222 ± 0.599
2.715ArgLys: 2.715 ± 0.698
5.184ArgLeu: 5.184 ± 1.338
1.234ArgMet: 1.234 ± 0.471
3.703ArgAsn: 3.703 ± 0.798
1.481ArgPro: 1.481 ± 0.559
1.728ArgGln: 1.728 ± 0.559
1.481ArgArg: 1.481 ± 0.703
3.209ArgSer: 3.209 ± 1.208
2.715ArgThr: 2.715 ± 0.731
2.715ArgVal: 2.715 ± 0.662
1.481ArgTrp: 1.481 ± 0.35
1.481ArgTyr: 1.481 ± 0.579
0.0ArgXaa: 0.0 ± 0.0
Ser
4.69SerAla: 4.69 ± 1.275
0.494SerCys: 0.494 ± 0.254
3.456SerAsp: 3.456 ± 1.024
5.184SerGlu: 5.184 ± 1.561
2.469SerPhe: 2.469 ± 0.754
3.95SerGly: 3.95 ± 0.4
1.975SerHis: 1.975 ± 0.385
6.171SerIle: 6.171 ± 1.709
5.678SerLys: 5.678 ± 1.277
10.368SerLeu: 10.368 ± 1.476
1.234SerMet: 1.234 ± 0.494
3.703SerAsn: 3.703 ± 0.445
2.962SerPro: 2.962 ± 1.07
1.728SerGln: 1.728 ± 0.758
2.222SerArg: 2.222 ± 0.449
8.64SerSer: 8.64 ± 1.985
6.665SerThr: 6.665 ± 1.036
3.703SerVal: 3.703 ± 0.865
2.715SerTrp: 2.715 ± 0.577
3.209SerTyr: 3.209 ± 0.856
0.0SerXaa: 0.0 ± 0.0
Thr
1.234ThrAla: 1.234 ± 0.449
1.481ThrCys: 1.481 ± 0.422
2.962ThrAsp: 2.962 ± 0.711
3.456ThrGlu: 3.456 ± 1.247
2.962ThrPhe: 2.962 ± 0.773
2.222ThrGly: 2.222 ± 0.566
1.728ThrHis: 1.728 ± 0.922
5.184ThrIle: 5.184 ± 0.649
4.443ThrLys: 4.443 ± 0.946
6.418ThrLeu: 6.418 ± 0.876
2.222ThrMet: 2.222 ± 0.847
3.209ThrAsn: 3.209 ± 0.872
1.728ThrPro: 1.728 ± 0.792
2.469ThrGln: 2.469 ± 0.998
3.456ThrArg: 3.456 ± 0.804
4.196ThrSer: 4.196 ± 0.72
4.443ThrThr: 4.443 ± 0.77
4.443ThrVal: 4.443 ± 1.369
1.481ThrTrp: 1.481 ± 0.377
1.728ThrTyr: 1.728 ± 0.579
0.0ThrXaa: 0.0 ± 0.0
Val
1.481ValAla: 1.481 ± 0.752
0.494ValCys: 0.494 ± 0.426
2.469ValAsp: 2.469 ± 0.79
3.95ValGlu: 3.95 ± 1.379
2.469ValPhe: 2.469 ± 0.315
3.209ValGly: 3.209 ± 0.66
1.234ValHis: 1.234 ± 0.598
6.171ValIle: 6.171 ± 1.551
3.95ValLys: 3.95 ± 1.579
3.95ValLeu: 3.95 ± 0.701
0.987ValMet: 0.987 ± 0.435
4.69ValAsn: 4.69 ± 1.769
2.715ValPro: 2.715 ± 0.744
2.222ValGln: 2.222 ± 0.668
2.222ValArg: 2.222 ± 0.725
3.456ValSer: 3.456 ± 0.921
4.196ValThr: 4.196 ± 1.054
2.715ValVal: 2.715 ± 0.849
0.987ValTrp: 0.987 ± 0.372
2.469ValTyr: 2.469 ± 0.996
0.0ValXaa: 0.0 ± 0.0
Trp
0.987TrpAla: 0.987 ± 0.377
0.494TrpCys: 0.494 ± 0.254
0.494TrpAsp: 0.494 ± 0.291
1.481TrpGlu: 1.481 ± 0.422
1.975TrpPhe: 1.975 ± 0.801
1.728TrpGly: 1.728 ± 0.727
0.494TrpHis: 0.494 ± 0.291
1.728TrpIle: 1.728 ± 0.546
1.234TrpLys: 1.234 ± 0.34
0.741TrpLeu: 0.741 ± 0.342
0.494TrpMet: 0.494 ± 0.242
1.975TrpAsn: 1.975 ± 0.6
0.741TrpPro: 0.741 ± 0.286
0.247TrpGln: 0.247 ± 0.145
0.247TrpArg: 0.247 ± 0.145
1.975TrpSer: 1.975 ± 0.581
0.494TrpThr: 0.494 ± 0.691
1.234TrpVal: 1.234 ± 1.438
0.0TrpTrp: 0.0 ± 0.0
0.247TrpTyr: 0.247 ± 0.34
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.481TyrAla: 1.481 ± 0.497
0.494TyrCys: 0.494 ± 0.269
1.481TyrAsp: 1.481 ± 0.572
0.741TyrGlu: 0.741 ± 0.286
1.975TyrPhe: 1.975 ± 0.477
2.469TyrGly: 2.469 ± 0.538
0.741TyrHis: 0.741 ± 0.352
1.481TyrIle: 1.481 ± 0.471
5.431TyrLys: 5.431 ± 1.444
3.703TyrLeu: 3.703 ± 0.647
0.494TyrMet: 0.494 ± 0.291
1.975TyrAsn: 1.975 ± 0.553
0.987TyrPro: 0.987 ± 0.363
1.728TyrGln: 1.728 ± 0.989
2.469TyrArg: 2.469 ± 0.519
2.962TyrSer: 2.962 ± 0.737
2.469TyrThr: 2.469 ± 1.224
3.703TyrVal: 3.703 ± 0.789
0.247TyrTrp: 0.247 ± 0.34
1.481TyrTyr: 1.481 ± 0.509
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski