Amino acid dipepetide frequency for Drosophila affinis sigmavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.186AlaAla: 3.186 ± 0.771
0.735AlaCys: 0.735 ± 0.525
2.696AlaAsp: 2.696 ± 0.858
2.941AlaGlu: 2.941 ± 0.524
1.225AlaPhe: 1.225 ± 0.522
1.716AlaGly: 1.716 ± 0.299
0.735AlaHis: 0.735 ± 0.351
3.431AlaIle: 3.431 ± 1.062
2.451AlaLys: 2.451 ± 0.562
5.392AlaLeu: 5.392 ± 0.951
1.471AlaMet: 1.471 ± 0.487
2.206AlaAsn: 2.206 ± 0.92
1.471AlaPro: 1.471 ± 1.063
2.206AlaGln: 2.206 ± 0.491
2.206AlaArg: 2.206 ± 0.411
4.167AlaSer: 4.167 ± 1.126
3.676AlaThr: 3.676 ± 1.142
1.716AlaVal: 1.716 ± 0.464
0.49AlaTrp: 0.49 ± 0.242
2.451AlaTyr: 2.451 ± 0.674
0.0AlaXaa: 0.0 ± 0.0
Cys
0.98CysAla: 0.98 ± 0.865
0.49CysCys: 0.49 ± 0.598
1.225CysAsp: 1.225 ± 0.517
0.49CysGlu: 0.49 ± 0.286
0.98CysPhe: 0.98 ± 0.374
0.735CysGly: 0.735 ± 0.429
0.245CysHis: 0.245 ± 0.299
1.225CysIle: 1.225 ± 0.537
1.716CysLys: 1.716 ± 0.644
2.696CysLeu: 2.696 ± 0.882
0.0CysMet: 0.0 ± 0.0
0.245CysAsn: 0.245 ± 0.143
0.98CysPro: 0.98 ± 0.485
0.0CysGln: 0.0 ± 0.0
0.98CysArg: 0.98 ± 0.705
1.716CysSer: 1.716 ± 0.629
0.98CysThr: 0.98 ± 0.578
1.716CysVal: 1.716 ± 0.802
0.49CysTrp: 0.49 ± 0.286
0.735CysTyr: 0.735 ± 0.351
0.0CysXaa: 0.0 ± 0.0
Asp
1.471AspAla: 1.471 ± 0.274
0.735AspCys: 0.735 ± 0.299
3.922AspAsp: 3.922 ± 0.78
1.225AspGlu: 1.225 ± 0.407
3.676AspPhe: 3.676 ± 1.482
2.941AspGly: 2.941 ± 1.219
1.716AspHis: 1.716 ± 0.442
3.922AspIle: 3.922 ± 1.012
3.676AspLys: 3.676 ± 0.944
4.167AspLeu: 4.167 ± 1.278
1.961AspMet: 1.961 ± 0.652
3.676AspAsn: 3.676 ± 0.791
5.392AspPro: 5.392 ± 0.992
2.451AspGln: 2.451 ± 0.96
4.167AspArg: 4.167 ± 0.689
3.922AspSer: 3.922 ± 0.558
3.676AspThr: 3.676 ± 1.483
3.186AspVal: 3.186 ± 1.165
0.98AspTrp: 0.98 ± 0.578
2.696AspTyr: 2.696 ± 1.279
0.0AspXaa: 0.0 ± 0.0
Glu
1.471GluAla: 1.471 ± 0.364
0.98GluCys: 0.98 ± 0.485
2.696GluAsp: 2.696 ± 0.914
2.696GluGlu: 2.696 ± 0.903
3.431GluPhe: 3.431 ± 0.711
3.431GluGly: 3.431 ± 0.954
1.471GluHis: 1.471 ± 0.954
2.696GluIle: 2.696 ± 0.649
2.206GluLys: 2.206 ± 0.946
4.412GluLeu: 4.412 ± 0.834
1.225GluMet: 1.225 ± 0.441
2.696GluAsn: 2.696 ± 0.895
1.716GluPro: 1.716 ± 0.492
2.696GluGln: 2.696 ± 0.568
2.451GluArg: 2.451 ± 0.419
4.902GluSer: 4.902 ± 1.136
5.147GluThr: 5.147 ± 1.091
3.186GluVal: 3.186 ± 0.563
0.49GluTrp: 0.49 ± 0.353
1.471GluTyr: 1.471 ± 0.723
0.0GluXaa: 0.0 ± 0.0
Phe
1.961PheAla: 1.961 ± 0.643
1.225PheCys: 1.225 ± 0.485
1.471PheAsp: 1.471 ± 0.483
1.961PheGlu: 1.961 ± 0.38
3.186PhePhe: 3.186 ± 1.076
2.941PheGly: 2.941 ± 0.951
0.98PheHis: 0.98 ± 0.347
2.941PheIle: 2.941 ± 1.04
4.412PheLys: 4.412 ± 0.976
4.412PheLeu: 4.412 ± 1.517
0.98PheMet: 0.98 ± 0.604
1.471PheAsn: 1.471 ± 0.274
3.186PhePro: 3.186 ± 0.673
1.225PheGln: 1.225 ± 0.509
3.186PheArg: 3.186 ± 1.14
2.941PheSer: 2.941 ± 0.684
3.431PheThr: 3.431 ± 0.751
2.206PheVal: 2.206 ± 0.994
0.245PheTrp: 0.245 ± 0.143
0.735PheTyr: 0.735 ± 0.739
0.0PheXaa: 0.0 ± 0.0
Gly
2.941GlyAla: 2.941 ± 0.663
0.49GlyCys: 0.49 ± 0.286
3.186GlyAsp: 3.186 ± 1.464
3.431GlyGlu: 3.431 ± 1.338
2.451GlyPhe: 2.451 ± 0.732
3.922GlyGly: 3.922 ± 0.99
2.941GlyHis: 2.941 ± 0.994
1.961GlyIle: 1.961 ± 0.588
3.922GlyLys: 3.922 ± 0.41
6.373GlyLeu: 6.373 ± 1.432
1.225GlyMet: 1.225 ± 0.698
1.471GlyAsn: 1.471 ± 0.651
2.206GlyPro: 2.206 ± 0.757
2.206GlyGln: 2.206 ± 0.614
2.941GlyArg: 2.941 ± 0.739
4.412GlySer: 4.412 ± 1.025
2.941GlyThr: 2.941 ± 1.773
3.186GlyVal: 3.186 ± 0.797
1.716GlyTrp: 1.716 ± 0.519
2.696GlyTyr: 2.696 ± 1.158
0.0GlyXaa: 0.0 ± 0.0
His
0.98HisAla: 0.98 ± 0.499
0.49HisCys: 0.49 ± 0.33
0.245HisAsp: 0.245 ± 0.143
2.206HisGlu: 2.206 ± 1.295
0.735HisPhe: 0.735 ± 0.263
1.716HisGly: 1.716 ± 0.644
1.225HisHis: 1.225 ± 0.962
1.716HisIle: 1.716 ± 0.299
1.471HisLys: 1.471 ± 0.395
2.941HisLeu: 2.941 ± 0.874
0.49HisMet: 0.49 ± 0.445
0.98HisAsn: 0.98 ± 0.411
2.451HisPro: 2.451 ± 0.73
1.716HisGln: 1.716 ± 0.637
1.961HisArg: 1.961 ± 0.794
2.451HisSer: 2.451 ± 0.653
1.471HisThr: 1.471 ± 0.481
1.716HisVal: 1.716 ± 0.464
0.49HisTrp: 0.49 ± 0.286
1.961HisTyr: 1.961 ± 0.657
0.0HisXaa: 0.0 ± 0.0
Ile
3.186IleAla: 3.186 ± 0.899
2.941IleCys: 2.941 ± 1.011
5.637IleAsp: 5.637 ± 0.897
3.922IleGlu: 3.922 ± 0.738
2.451IlePhe: 2.451 ± 0.97
5.882IleGly: 5.882 ± 1.476
0.98IleHis: 0.98 ± 0.544
7.598IleIle: 7.598 ± 0.685
3.922IleLys: 3.922 ± 0.761
6.127IleLeu: 6.127 ± 0.434
1.716IleMet: 1.716 ± 0.768
5.147IleAsn: 5.147 ± 1.434
6.127IlePro: 6.127 ± 1.075
2.206IleGln: 2.206 ± 0.919
3.922IleArg: 3.922 ± 1.741
6.127IleSer: 6.127 ± 1.754
4.167IleThr: 4.167 ± 1.096
3.676IleVal: 3.676 ± 0.402
1.961IleTrp: 1.961 ± 0.516
2.696IleTyr: 2.696 ± 0.929
0.0IleXaa: 0.0 ± 0.0
Lys
2.451LysAla: 2.451 ± 0.531
1.225LysCys: 1.225 ± 0.326
3.431LysAsp: 3.431 ± 1.252
2.696LysGlu: 2.696 ± 0.353
1.471LysPhe: 1.471 ± 0.493
2.696LysGly: 2.696 ± 0.551
1.471LysHis: 1.471 ± 0.526
4.902LysIle: 4.902 ± 1.474
3.922LysLys: 3.922 ± 0.725
6.863LysLeu: 6.863 ± 1.501
1.471LysMet: 1.471 ± 0.729
1.961LysAsn: 1.961 ± 1.147
3.186LysPro: 3.186 ± 0.595
0.98LysGln: 0.98 ± 0.708
2.451LysArg: 2.451 ± 1.066
5.392LysSer: 5.392 ± 1.207
4.167LysThr: 4.167 ± 0.617
3.676LysVal: 3.676 ± 0.962
1.961LysTrp: 1.961 ± 0.476
2.451LysTyr: 2.451 ± 0.939
0.0LysXaa: 0.0 ± 0.0
Leu
5.637LeuAla: 5.637 ± 1.209
1.716LeuCys: 1.716 ± 0.644
4.657LeuAsp: 4.657 ± 1.029
4.902LeuGlu: 4.902 ± 0.988
4.902LeuPhe: 4.902 ± 1.485
5.882LeuGly: 5.882 ± 0.586
1.961LeuHis: 1.961 ± 0.929
7.353LeuIle: 7.353 ± 1.858
3.186LeuLys: 3.186 ± 1.379
5.637LeuLeu: 5.637 ± 1.453
2.206LeuMet: 2.206 ± 0.858
5.147LeuAsn: 5.147 ± 1.092
3.676LeuPro: 3.676 ± 0.851
4.657LeuGln: 4.657 ± 0.427
6.373LeuArg: 6.373 ± 1.449
6.127LeuSer: 6.127 ± 1.29
6.127LeuThr: 6.127 ± 0.944
5.882LeuVal: 5.882 ± 0.75
0.735LeuTrp: 0.735 ± 0.351
2.206LeuTyr: 2.206 ± 0.578
0.0LeuXaa: 0.0 ± 0.0
Met
0.98MetAla: 0.98 ± 0.411
0.0MetCys: 0.0 ± 0.0
0.98MetAsp: 0.98 ± 0.572
1.471MetGlu: 1.471 ± 0.651
1.471MetPhe: 1.471 ± 0.487
1.471MetGly: 1.471 ± 0.831
0.245MetHis: 0.245 ± 0.395
1.961MetIle: 1.961 ± 0.693
0.98MetLys: 0.98 ± 0.705
2.941MetLeu: 2.941 ± 1.588
0.245MetMet: 0.245 ± 0.143
0.98MetAsn: 0.98 ± 0.294
0.245MetPro: 0.245 ± 0.395
0.245MetGln: 0.245 ± 0.143
1.716MetArg: 1.716 ± 0.647
3.186MetSer: 3.186 ± 0.798
1.471MetThr: 1.471 ± 0.506
2.206MetVal: 2.206 ± 0.92
0.0MetTrp: 0.0 ± 0.0
0.245MetTyr: 0.245 ± 0.299
0.0MetXaa: 0.0 ± 0.0
Asn
2.451AsnAla: 2.451 ± 0.773
0.49AsnCys: 0.49 ± 0.286
3.676AsnAsp: 3.676 ± 1.142
1.471AsnGlu: 1.471 ± 0.409
2.451AsnPhe: 2.451 ± 0.653
2.206AsnGly: 2.206 ± 0.495
1.961AsnHis: 1.961 ± 0.642
4.902AsnIle: 4.902 ± 0.508
2.206AsnLys: 2.206 ± 0.495
4.657AsnLeu: 4.657 ± 1.626
0.98AsnMet: 0.98 ± 0.347
2.451AsnAsn: 2.451 ± 0.787
4.902AsnPro: 4.902 ± 1.742
1.716AsnGln: 1.716 ± 0.682
2.941AsnArg: 2.941 ± 1.181
3.431AsnSer: 3.431 ± 1.393
2.696AsnThr: 2.696 ± 0.574
1.225AsnVal: 1.225 ± 0.461
0.735AsnTrp: 0.735 ± 0.438
2.696AsnTyr: 2.696 ± 0.845
0.0AsnXaa: 0.0 ± 0.0
Pro
2.941ProAla: 2.941 ± 0.691
0.735ProCys: 0.735 ± 0.375
4.167ProAsp: 4.167 ± 0.546
3.922ProGlu: 3.922 ± 0.327
1.961ProPhe: 1.961 ± 0.693
3.922ProGly: 3.922 ± 2.777
1.716ProHis: 1.716 ± 0.492
4.167ProIle: 4.167 ± 1.368
3.186ProLys: 3.186 ± 1.579
4.657ProLeu: 4.657 ± 0.768
1.225ProMet: 1.225 ± 0.773
2.941ProAsn: 2.941 ± 1.338
2.941ProPro: 2.941 ± 0.766
0.735ProGln: 0.735 ± 0.299
2.451ProArg: 2.451 ± 0.706
4.167ProSer: 4.167 ± 1.01
2.206ProThr: 2.206 ± 1.063
4.167ProVal: 4.167 ± 1.345
0.49ProTrp: 0.49 ± 0.242
1.471ProTyr: 1.471 ± 0.608
0.0ProXaa: 0.0 ± 0.0
Gln
1.225GlnAla: 1.225 ± 0.521
0.98GlnCys: 0.98 ± 0.347
1.961GlnAsp: 1.961 ± 0.916
2.451GlnGlu: 2.451 ± 1.499
1.716GlnPhe: 1.716 ± 0.519
2.696GlnGly: 2.696 ± 1.1
1.225GlnHis: 1.225 ± 0.537
3.922GlnIle: 3.922 ± 0.745
2.696GlnLys: 2.696 ± 0.729
1.716GlnLeu: 1.716 ± 0.513
0.49GlnMet: 0.49 ± 0.242
0.735GlnAsn: 0.735 ± 0.684
1.716GlnPro: 1.716 ± 1.07
0.735GlnGln: 0.735 ± 0.527
1.225GlnArg: 1.225 ± 0.366
3.186GlnSer: 3.186 ± 0.582
1.716GlnThr: 1.716 ± 0.858
3.186GlnVal: 3.186 ± 0.934
0.0GlnTrp: 0.0 ± 0.0
1.225GlnTyr: 1.225 ± 0.526
0.0GlnXaa: 0.0 ± 0.0
Arg
2.941ArgAla: 2.941 ± 0.648
1.716ArgCys: 1.716 ± 1.069
3.431ArgAsp: 3.431 ± 0.624
2.941ArgGlu: 2.941 ± 0.785
2.696ArgPhe: 2.696 ± 0.768
3.922ArgGly: 3.922 ± 0.939
2.206ArgHis: 2.206 ± 0.951
2.696ArgIle: 2.696 ± 0.637
3.676ArgLys: 3.676 ± 1.079
2.451ArgLeu: 2.451 ± 0.732
0.98ArgMet: 0.98 ± 1.151
3.431ArgAsn: 3.431 ± 0.675
2.206ArgPro: 2.206 ± 0.495
1.471ArgGln: 1.471 ± 0.703
2.696ArgArg: 2.696 ± 0.845
2.941ArgSer: 2.941 ± 0.934
2.451ArgThr: 2.451 ± 0.905
3.186ArgVal: 3.186 ± 0.18
0.49ArgTrp: 0.49 ± 0.286
2.206ArgTyr: 2.206 ± 0.845
0.0ArgXaa: 0.0 ± 0.0
Ser
3.922SerAla: 3.922 ± 0.926
0.98SerCys: 0.98 ± 0.347
5.882SerAsp: 5.882 ± 1.792
4.412SerGlu: 4.412 ± 0.758
2.451SerPhe: 2.451 ± 0.444
4.167SerGly: 4.167 ± 1.775
2.451SerHis: 2.451 ± 0.718
7.108SerIle: 7.108 ± 0.72
4.902SerLys: 4.902 ± 0.73
8.824SerLeu: 8.824 ± 1.669
0.98SerMet: 0.98 ± 0.826
5.147SerAsn: 5.147 ± 1.147
2.451SerPro: 2.451 ± 0.428
3.186SerGln: 3.186 ± 0.395
1.225SerArg: 1.225 ± 0.494
7.353SerSer: 7.353 ± 2.203
4.412SerThr: 4.412 ± 0.634
3.922SerVal: 3.922 ± 0.828
1.716SerTrp: 1.716 ± 0.519
3.922SerTyr: 3.922 ± 0.785
0.0SerXaa: 0.0 ± 0.0
Thr
0.98ThrAla: 0.98 ± 0.532
0.735ThrCys: 0.735 ± 0.429
3.431ThrAsp: 3.431 ± 0.852
2.941ThrGlu: 2.941 ± 1.331
2.696ThrPhe: 2.696 ± 0.857
2.451ThrGly: 2.451 ± 0.646
2.206ThrHis: 2.206 ± 0.643
7.353ThrIle: 7.353 ± 1.651
2.696ThrLys: 2.696 ± 0.925
5.637ThrLeu: 5.637 ± 0.552
1.471ThrMet: 1.471 ± 0.652
2.696ThrAsn: 2.696 ± 0.65
2.941ThrPro: 2.941 ± 1.177
2.941ThrGln: 2.941 ± 0.623
2.941ThrArg: 2.941 ± 0.896
6.373ThrSer: 6.373 ± 1.095
3.676ThrThr: 3.676 ± 1.403
3.431ThrVal: 3.431 ± 1.198
1.961ThrTrp: 1.961 ± 0.54
2.206ThrTyr: 2.206 ± 0.72
0.0ThrXaa: 0.0 ± 0.0
Val
4.412ValAla: 4.412 ± 1.194
0.49ValCys: 0.49 ± 0.286
3.922ValAsp: 3.922 ± 0.821
2.696ValGlu: 2.696 ± 0.598
2.206ValPhe: 2.206 ± 0.495
1.471ValGly: 1.471 ± 0.446
1.471ValHis: 1.471 ± 0.685
4.902ValIle: 4.902 ± 0.794
3.676ValLys: 3.676 ± 0.489
4.167ValLeu: 4.167 ± 0.405
1.716ValMet: 1.716 ± 0.768
3.431ValAsn: 3.431 ± 0.608
3.186ValPro: 3.186 ± 0.997
1.225ValGln: 1.225 ± 0.425
2.206ValArg: 2.206 ± 0.713
3.186ValSer: 3.186 ± 0.623
4.412ValThr: 4.412 ± 1.379
0.735ValVal: 0.735 ± 0.525
0.735ValTrp: 0.735 ± 0.263
3.676ValTyr: 3.676 ± 0.792
0.0ValXaa: 0.0 ± 0.0
Trp
0.735TrpAla: 0.735 ± 0.525
0.245TrpCys: 0.245 ± 0.265
0.98TrpAsp: 0.98 ± 0.572
1.471TrpGlu: 1.471 ± 0.437
0.98TrpPhe: 0.98 ± 0.294
0.98TrpGly: 0.98 ± 0.347
0.735TrpHis: 0.735 ± 0.525
1.961TrpIle: 1.961 ± 0.753
0.735TrpLys: 0.735 ± 0.401
0.0TrpLeu: 0.0 ± 0.0
0.735TrpMet: 0.735 ± 0.299
0.49TrpAsn: 0.49 ± 0.286
0.49TrpPro: 0.49 ± 0.286
0.49TrpGln: 0.49 ± 0.242
0.98TrpArg: 0.98 ± 0.572
1.716TrpSer: 1.716 ± 0.72
0.98TrpThr: 0.98 ± 0.662
0.735TrpVal: 0.735 ± 0.299
0.0TrpTrp: 0.0 ± 0.0
0.98TrpTyr: 0.98 ± 0.427
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.716TyrAla: 1.716 ± 0.637
1.225TyrCys: 1.225 ± 1.007
1.961TyrAsp: 1.961 ± 0.434
1.225TyrGlu: 1.225 ± 0.436
1.961TyrPhe: 1.961 ± 0.652
1.471TyrGly: 1.471 ± 0.685
1.471TyrHis: 1.471 ± 0.702
3.676TyrIle: 3.676 ± 1.593
3.186TyrLys: 3.186 ± 0.473
4.902TyrLeu: 4.902 ± 0.754
1.225TyrMet: 1.225 ± 0.527
2.941TyrAsn: 2.941 ± 1.878
2.696TyrPro: 2.696 ± 0.728
1.716TyrGln: 1.716 ± 1.227
1.716TyrArg: 1.716 ± 0.95
1.961TyrSer: 1.961 ± 0.471
1.961TyrThr: 1.961 ± 0.672
0.98TyrVal: 0.98 ± 1.196
0.735TyrTrp: 0.735 ± 0.299
1.225TyrTyr: 1.225 ± 0.73
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4081 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski