Amino acid dipepetide frequency for Drosophila melanogaster birnavirus SW-2009a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.847AlaAla: 4.847 ± 2.978
1.454AlaCys: 1.454 ± 0.441
4.847AlaAsp: 4.847 ± 1.122
0.969AlaGlu: 0.969 ± 0.714
2.908AlaPhe: 2.908 ± 0.485
2.908AlaGly: 2.908 ± 1.366
1.454AlaHis: 1.454 ± 1.087
0.969AlaIle: 0.969 ± 0.714
4.363AlaLys: 4.363 ± 0.758
6.786AlaLeu: 6.786 ± 1.045
2.908AlaMet: 2.908 ± 0.883
3.878AlaAsn: 3.878 ± 0.531
0.485AlaPro: 0.485 ± 0.357
5.332AlaGln: 5.332 ± 0.906
2.424AlaArg: 2.424 ± 1.034
8.24AlaSer: 8.24 ± 0.599
2.908AlaThr: 2.908 ± 0.773
5.332AlaVal: 5.332 ± 1.287
0.0AlaTrp: 0.0 ± 0.0
1.454AlaTyr: 1.454 ± 1.087
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.485CysCys: 0.485 ± 0.357
0.485CysAsp: 0.485 ± 0.335
0.0CysGlu: 0.0 ± 0.0
0.485CysPhe: 0.485 ± 0.357
0.485CysGly: 0.485 ± 0.357
0.0CysHis: 0.0 ± 0.0
1.454CysIle: 1.454 ± 1.071
0.485CysLys: 0.485 ± 0.357
0.485CysLeu: 0.485 ± 1.003
0.969CysMet: 0.969 ± 0.275
0.0CysAsn: 0.0 ± 0.0
0.485CysPro: 0.485 ± 0.357
0.485CysGln: 0.485 ± 1.003
0.485CysArg: 0.485 ± 0.357
0.485CysSer: 0.485 ± 1.003
0.969CysThr: 0.969 ± 0.162
0.969CysVal: 0.969 ± 0.67
0.0CysTrp: 0.0 ± 0.0
0.485CysTyr: 0.485 ± 0.335
0.0CysXaa: 0.0 ± 0.0
Asp
3.878AspAla: 3.878 ± 0.647
0.485AspCys: 0.485 ± 0.357
2.908AspAsp: 2.908 ± 0.485
1.939AspGlu: 1.939 ± 0.323
2.424AspPhe: 2.424 ± 0.561
2.424AspGly: 2.424 ± 1.138
0.0AspHis: 0.0 ± 0.0
1.454AspIle: 1.454 ± 0.441
3.393AspLys: 3.393 ± 1.635
7.271AspLeu: 7.271 ± 1.471
1.454AspMet: 1.454 ± 1.071
2.908AspAsn: 2.908 ± 0.485
3.393AspPro: 3.393 ± 0.697
2.424AspGln: 2.424 ± 0.952
2.424AspArg: 2.424 ± 0.561
3.393AspSer: 3.393 ± 0.697
2.908AspThr: 2.908 ± 0.68
3.393AspVal: 3.393 ± 2.345
0.485AspTrp: 0.485 ± 0.335
4.847AspTyr: 4.847 ± 0.809
0.0AspXaa: 0.0 ± 0.0
Glu
4.363GluAla: 4.363 ± 1.324
0.0GluCys: 0.0 ± 0.0
4.363GluAsp: 4.363 ± 2.104
3.878GluGlu: 3.878 ± 1.78
1.939GluPhe: 1.939 ± 1.427
3.878GluGly: 3.878 ± 2.68
0.485GluHis: 0.485 ± 0.357
3.393GluIle: 3.393 ± 1.087
4.363GluLys: 4.363 ± 1.445
5.332GluLeu: 5.332 ± 0.775
0.969GluMet: 0.969 ± 0.162
3.393GluAsn: 3.393 ± 0.617
1.454GluPro: 1.454 ± 0.387
2.424GluGln: 2.424 ± 2.013
2.908GluArg: 2.908 ± 0.68
3.393GluSer: 3.393 ± 2.754
5.332GluThr: 5.332 ± 1.435
5.332GluVal: 5.332 ± 1.65
0.969GluTrp: 0.969 ± 0.162
1.454GluTyr: 1.454 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
0.485PheAla: 0.485 ± 0.335
0.485PheCys: 0.485 ± 1.003
0.0PheAsp: 0.0 ± 0.0
1.454PheGlu: 1.454 ± 0.441
0.0PhePhe: 0.0 ± 0.0
1.939PheGly: 1.939 ± 0.786
0.485PheHis: 0.485 ± 0.357
1.454PheIle: 1.454 ± 1.071
1.939PheLys: 1.939 ± 0.323
3.393PheLeu: 3.393 ± 2.498
0.969PheMet: 0.969 ± 0.714
0.969PheAsn: 0.969 ± 0.67
4.363PhePro: 4.363 ± 0.758
1.454PheGln: 1.454 ± 1.071
0.485PheArg: 0.485 ± 0.335
1.939PheSer: 1.939 ± 0.786
1.454PheThr: 1.454 ± 1.071
0.485PheVal: 0.485 ± 0.335
0.0PheTrp: 0.0 ± 0.0
2.908PheTyr: 2.908 ± 0.883
0.0PheXaa: 0.0 ± 0.0
Gly
3.393GlyAla: 3.393 ± 0.617
0.969GlyCys: 0.969 ± 0.162
1.939GlyAsp: 1.939 ± 0.786
2.424GlyGlu: 2.424 ± 0.561
1.454GlyPhe: 1.454 ± 0.441
1.939GlyGly: 1.939 ± 0.323
1.454GlyHis: 1.454 ± 1.005
4.847GlyIle: 4.847 ± 2.069
3.878GlyLys: 3.878 ± 0.866
4.847GlyLeu: 4.847 ± 1.471
0.969GlyMet: 0.969 ± 0.67
3.878GlyAsn: 3.878 ± 0.647
4.847GlyPro: 4.847 ± 1.643
3.393GlyGln: 3.393 ± 0.697
1.939GlyArg: 1.939 ± 0.705
2.908GlySer: 2.908 ± 1.366
3.878GlyThr: 3.878 ± 0.866
6.786GlyVal: 6.786 ± 2.174
0.969GlyTrp: 0.969 ± 0.714
2.908GlyTyr: 2.908 ± 0.883
0.0GlyXaa: 0.0 ± 0.0
His
1.454HisAla: 1.454 ± 1.071
0.485HisCys: 0.485 ± 0.335
0.485HisAsp: 0.485 ± 0.335
1.454HisGlu: 1.454 ± 1.071
0.485HisPhe: 0.485 ± 0.335
0.969HisGly: 0.969 ± 0.714
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.969HisLys: 0.969 ± 0.714
1.939HisLeu: 1.939 ± 0.832
0.485HisMet: 0.485 ± 0.357
2.424HisAsn: 2.424 ± 1.034
0.485HisPro: 0.485 ± 0.335
1.454HisGln: 1.454 ± 0.842
1.454HisArg: 1.454 ± 0.441
1.939HisSer: 1.939 ± 1.845
0.969HisThr: 0.969 ± 0.714
0.0HisVal: 0.0 ± 0.0
0.485HisTrp: 0.485 ± 0.357
0.969HisTyr: 0.969 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
4.847IleAla: 4.847 ± 1.77
0.969IleCys: 0.969 ± 0.99
3.878IleAsp: 3.878 ± 1.41
3.878IleGlu: 3.878 ± 0.996
1.454IlePhe: 1.454 ± 0.441
1.939IleGly: 1.939 ± 1.34
2.424IleHis: 2.424 ± 0.561
1.939IleIle: 1.939 ± 0.834
5.817IleLys: 5.817 ± 2.116
3.393IleLeu: 3.393 ± 1.642
1.454IleMet: 1.454 ± 0.441
3.878IleAsn: 3.878 ± 0.866
4.847IlePro: 4.847 ± 0.978
0.485IleGln: 0.485 ± 0.357
5.332IleArg: 5.332 ± 0.906
4.363IleSer: 4.363 ± 0.843
6.302IleThr: 6.302 ± 3.268
4.847IleVal: 4.847 ± 0.809
0.0IleTrp: 0.0 ± 0.0
3.393IleTyr: 3.393 ± 1.087
0.0IleXaa: 0.0 ± 0.0
Lys
3.393LysAla: 3.393 ± 0.828
0.0LysCys: 0.0 ± 0.0
3.393LysAsp: 3.393 ± 1.224
3.393LysGlu: 3.393 ± 0.697
1.454LysPhe: 1.454 ± 1.071
4.847LysGly: 4.847 ± 1.11
0.969LysHis: 0.969 ± 0.714
5.332LysIle: 5.332 ± 0.906
4.847LysLys: 4.847 ± 2.436
6.302LysLeu: 6.302 ± 3.341
1.454LysMet: 1.454 ± 0.441
2.424LysAsn: 2.424 ± 1.034
3.878LysPro: 3.878 ± 2.204
2.908LysGln: 2.908 ± 2.778
2.424LysArg: 2.424 ± 0.561
4.847LysSer: 4.847 ± 0.809
4.363LysThr: 4.363 ± 0.843
0.969LysVal: 0.969 ± 0.162
0.969LysTrp: 0.969 ± 0.162
3.393LysTyr: 3.393 ± 0.617
0.0LysXaa: 0.0 ± 0.0
Leu
9.695LeuAla: 9.695 ± 1.578
0.0LeuCys: 0.0 ± 0.0
3.878LeuAsp: 3.878 ± 1.14
6.786LeuGlu: 6.786 ± 1.792
2.424LeuPhe: 2.424 ± 0.682
7.271LeuGly: 7.271 ± 1.213
1.939LeuHis: 1.939 ± 0.786
3.878LeuIle: 3.878 ± 1.699
4.847LeuLys: 4.847 ± 1.11
5.817LeuLeu: 5.817 ± 1.398
2.424LeuMet: 2.424 ± 0.492
3.878LeuAsn: 3.878 ± 0.581
4.363LeuPro: 4.363 ± 0.843
3.878LeuGln: 3.878 ± 0.996
5.332LeuArg: 5.332 ± 2.712
5.817LeuSer: 5.817 ± 2.309
6.786LeuThr: 6.786 ± 2.327
5.332LeuVal: 5.332 ± 1.791
1.454LeuTrp: 1.454 ± 0.842
2.908LeuTyr: 2.908 ± 0.773
0.0LeuXaa: 0.0 ± 0.0
Met
1.939MetAla: 1.939 ± 0.323
0.969MetCys: 0.969 ± 0.162
2.908MetAsp: 2.908 ± 0.883
3.393MetGlu: 3.393 ± 0.697
0.485MetPhe: 0.485 ± 0.357
1.454MetGly: 1.454 ± 0.441
0.0MetHis: 0.0 ± 0.0
0.969MetIle: 0.969 ± 0.67
2.424MetLys: 2.424 ± 0.952
1.454MetLeu: 1.454 ± 0.387
0.969MetMet: 0.969 ± 0.973
1.454MetAsn: 1.454 ± 0.387
1.939MetPro: 1.939 ± 0.832
1.454MetGln: 1.454 ± 0.441
0.969MetArg: 0.969 ± 0.973
1.454MetSer: 1.454 ± 0.842
0.969MetThr: 0.969 ± 0.973
0.485MetVal: 0.485 ± 0.335
0.485MetTrp: 0.485 ± 0.357
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.393AsnAla: 3.393 ± 0.522
0.969AsnCys: 0.969 ± 0.162
1.454AsnAsp: 1.454 ± 1.005
3.393AsnGlu: 3.393 ± 1.087
0.969AsnPhe: 0.969 ± 0.162
2.424AsnGly: 2.424 ± 1.034
0.969AsnHis: 0.969 ± 0.973
5.332AsnIle: 5.332 ± 0.994
1.454AsnLys: 1.454 ± 0.387
4.847AsnLeu: 4.847 ± 2.45
1.454AsnMet: 1.454 ± 0.441
3.393AsnAsn: 3.393 ± 1.415
3.878AsnPro: 3.878 ± 0.866
2.424AsnGln: 2.424 ± 1.034
3.878AsnArg: 3.878 ± 2.033
1.454AsnSer: 1.454 ± 0.387
4.363AsnThr: 4.363 ± 0.811
1.939AsnVal: 1.939 ± 0.705
0.485AsnTrp: 0.485 ± 0.357
1.939AsnTyr: 1.939 ± 1.34
0.0AsnXaa: 0.0 ± 0.0
Pro
3.393ProAla: 3.393 ± 1.087
0.0ProCys: 0.0 ± 0.0
2.908ProAsp: 2.908 ± 0.883
5.332ProGlu: 5.332 ± 0.906
1.939ProPhe: 1.939 ± 0.323
3.878ProGly: 3.878 ± 1.41
0.969ProHis: 0.969 ± 0.99
1.939ProIle: 1.939 ± 0.834
3.878ProLys: 3.878 ± 2.204
2.908ProLeu: 2.908 ± 0.68
0.969ProMet: 0.969 ± 0.67
3.393ProAsn: 3.393 ± 1.904
8.725ProPro: 8.725 ± 4.662
1.454ProGln: 1.454 ± 1.087
5.332ProArg: 5.332 ± 0.775
4.847ProSer: 4.847 ± 1.471
4.847ProThr: 4.847 ± 0.809
2.908ProVal: 2.908 ± 0.699
0.0ProTrp: 0.0 ± 0.0
1.454ProTyr: 1.454 ± 0.441
0.0ProXaa: 0.0 ± 0.0
Gln
3.393GlnAla: 3.393 ± 1.848
0.969GlnCys: 0.969 ± 0.714
2.424GlnAsp: 2.424 ± 0.952
4.847GlnGlu: 4.847 ± 2.473
0.0GlnPhe: 0.0 ± 0.0
2.424GlnGly: 2.424 ± 0.561
1.454GlnHis: 1.454 ± 0.387
1.454GlnIle: 1.454 ± 1.069
2.424GlnLys: 2.424 ± 0.561
4.363GlnLeu: 4.363 ± 0.365
0.969GlnMet: 0.969 ± 0.67
1.939GlnAsn: 1.939 ± 0.323
0.0GlnPro: 0.0 ± 0.0
0.969GlnGln: 0.969 ± 0.714
2.424GlnArg: 2.424 ± 0.561
5.332GlnSer: 5.332 ± 2.732
2.908GlnThr: 2.908 ± 2.138
2.908GlnVal: 2.908 ± 0.773
0.485GlnTrp: 0.485 ± 0.357
1.939GlnTyr: 1.939 ± 0.705
0.0GlnXaa: 0.0 ± 0.0
Arg
0.485ArgAla: 0.485 ± 1.003
0.485ArgCys: 0.485 ± 0.357
3.878ArgAsp: 3.878 ± 0.647
2.424ArgGlu: 2.424 ± 1.803
2.424ArgPhe: 2.424 ± 0.561
4.363ArgGly: 4.363 ± 1.16
2.424ArgHis: 2.424 ± 1.138
5.332ArgIle: 5.332 ± 0.906
2.908ArgLys: 2.908 ± 0.883
4.363ArgLeu: 4.363 ± 0.758
2.424ArgMet: 2.424 ± 1.492
2.424ArgAsn: 2.424 ± 0.561
1.454ArgPro: 1.454 ± 0.441
2.424ArgGln: 2.424 ± 0.561
1.454ArgArg: 1.454 ± 0.842
2.908ArgSer: 2.908 ± 1.366
3.878ArgThr: 3.878 ± 1.064
1.454ArgVal: 1.454 ± 0.441
0.0ArgTrp: 0.0 ± 0.0
3.393ArgTyr: 3.393 ± 0.876
0.0ArgXaa: 0.0 ± 0.0
Ser
5.332SerAla: 5.332 ± 2.625
0.485SerCys: 0.485 ± 1.003
2.908SerAsp: 2.908 ± 1.159
3.878SerGlu: 3.878 ± 1.663
0.969SerPhe: 0.969 ± 0.162
6.302SerGly: 6.302 ± 1.856
1.454SerHis: 1.454 ± 1.069
5.817SerIle: 5.817 ± 2.732
4.847SerLys: 4.847 ± 3.569
8.24SerLeu: 8.24 ± 1.886
2.908SerMet: 2.908 ± 2.778
1.939SerAsn: 1.939 ± 1.34
3.878SerPro: 3.878 ± 2.814
1.939SerGln: 1.939 ± 0.832
1.939SerArg: 1.939 ± 0.705
6.786SerSer: 6.786 ± 0.447
5.332SerThr: 5.332 ± 1.334
2.424SerVal: 2.424 ± 1.138
1.454SerTrp: 1.454 ± 0.441
3.393SerTyr: 3.393 ± 1.642
0.0SerXaa: 0.0 ± 0.0
Thr
4.847ThrAla: 4.847 ± 2.702
0.0ThrCys: 0.0 ± 0.0
2.908ThrAsp: 2.908 ± 0.485
3.393ThrGlu: 3.393 ± 1.635
2.424ThrPhe: 2.424 ± 1.784
3.878ThrGly: 3.878 ± 1.41
1.939ThrHis: 1.939 ± 1.427
9.21ThrIle: 9.21 ± 0.672
4.847ThrLys: 4.847 ± 1.664
5.817ThrLeu: 5.817 ± 1.36
0.485ThrMet: 0.485 ± 0.335
2.424ThrAsn: 2.424 ± 0.964
6.302ThrPro: 6.302 ± 0.577
4.363ThrGln: 4.363 ± 1.16
3.878ThrArg: 3.878 ± 0.581
6.302ThrSer: 6.302 ± 3.31
11.149ThrThr: 11.149 ± 2.936
1.454ThrVal: 1.454 ± 1.005
0.969ThrTrp: 0.969 ± 0.714
2.424ThrTyr: 2.424 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
2.908ValAla: 2.908 ± 2.01
0.485ValCys: 0.485 ± 0.357
3.878ValAsp: 3.878 ± 0.996
1.454ValGlu: 1.454 ± 0.387
0.969ValPhe: 0.969 ± 0.162
3.393ValGly: 3.393 ± 0.617
0.485ValHis: 0.485 ± 0.335
5.332ValIle: 5.332 ± 1.954
1.939ValLys: 1.939 ± 1.427
5.332ValLeu: 5.332 ± 1.25
0.969ValMet: 0.969 ± 0.162
1.454ValAsn: 1.454 ± 0.441
3.878ValPro: 3.878 ± 2.037
2.908ValGln: 2.908 ± 0.485
3.393ValArg: 3.393 ± 0.617
1.939ValSer: 1.939 ± 1.34
4.847ValThr: 4.847 ± 2.069
4.363ValVal: 4.363 ± 1.738
0.485ValTrp: 0.485 ± 0.335
2.908ValTyr: 2.908 ± 0.699
0.0ValXaa: 0.0 ± 0.0
Trp
0.969TrpAla: 0.969 ± 0.67
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.485TrpPhe: 0.485 ± 0.357
0.485TrpGly: 0.485 ± 0.357
0.0TrpHis: 0.0 ± 0.0
1.939TrpIle: 1.939 ± 0.323
0.0TrpLys: 0.0 ± 0.0
1.454TrpLeu: 1.454 ± 0.441
0.0TrpMet: 0.0 ± 0.0
0.485TrpAsn: 0.485 ± 0.357
0.969TrpPro: 0.969 ± 0.162
0.485TrpGln: 0.485 ± 0.357
0.0TrpArg: 0.0 ± 0.0
1.454TrpSer: 1.454 ± 1.069
0.485TrpThr: 0.485 ± 0.357
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.485TrpTyr: 0.485 ± 0.357
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.454TyrAla: 1.454 ± 0.387
0.0TyrCys: 0.0 ± 0.0
4.363TyrAsp: 4.363 ± 0.718
5.332TyrGlu: 5.332 ± 1.436
0.969TyrPhe: 0.969 ± 0.714
2.424TyrGly: 2.424 ± 1.675
0.0TyrHis: 0.0 ± 0.0
3.393TyrIle: 3.393 ± 0.697
1.939TyrLys: 1.939 ± 0.705
4.847TyrLeu: 4.847 ± 1.122
0.485TyrMet: 0.485 ± 0.335
3.878TyrAsn: 3.878 ± 0.866
1.454TyrPro: 1.454 ± 0.387
0.969TyrGln: 0.969 ± 0.99
2.424TyrArg: 2.424 ± 1.138
2.424TyrSer: 2.424 ± 0.964
4.363TyrThr: 4.363 ± 1.999
1.939TyrVal: 1.939 ± 0.323
0.0TyrTrp: 0.0 ± 0.0
2.424TyrTyr: 2.424 ± 1.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2064 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski