Amino acid dipepetide frequency for Drosophila busckii rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.848AlaAla: 2.848 ± 1.381
0.712AlaCys: 0.712 ± 0.338
1.424AlaAsp: 1.424 ± 0.326
1.899AlaGlu: 1.899 ± 0.874
1.187AlaPhe: 1.187 ± 0.561
4.035AlaGly: 4.035 ± 0.714
1.424AlaHis: 1.424 ± 0.196
3.56AlaIle: 3.56 ± 1.397
2.611AlaLys: 2.611 ± 0.933
6.883AlaLeu: 6.883 ± 1.091
0.712AlaMet: 0.712 ± 0.398
1.899AlaAsn: 1.899 ± 0.51
3.086AlaPro: 3.086 ± 0.769
1.424AlaGln: 1.424 ± 0.709
2.611AlaArg: 2.611 ± 0.899
4.985AlaSer: 4.985 ± 0.676
3.798AlaThr: 3.798 ± 1.317
1.899AlaVal: 1.899 ± 0.502
0.712AlaTrp: 0.712 ± 0.472
0.949AlaTyr: 0.949 ± 0.361
0.0AlaXaa: 0.0 ± 0.0
Cys
0.712CysAla: 0.712 ± 0.431
0.475CysCys: 0.475 ± 0.265
1.899CysAsp: 1.899 ± 0.587
0.712CysGlu: 0.712 ± 0.218
1.187CysPhe: 1.187 ± 0.517
0.475CysGly: 0.475 ± 0.236
0.237CysHis: 0.237 ± 0.133
0.475CysIle: 0.475 ± 0.333
0.949CysLys: 0.949 ± 0.375
1.662CysLeu: 1.662 ± 0.461
0.949CysMet: 0.949 ± 0.361
0.949CysAsn: 0.949 ± 0.517
1.424CysPro: 1.424 ± 0.404
0.475CysGln: 0.475 ± 0.276
1.187CysArg: 1.187 ± 0.256
1.424CysSer: 1.424 ± 0.795
0.949CysThr: 0.949 ± 0.53
0.949CysVal: 0.949 ± 0.686
0.0CysTrp: 0.0 ± 0.0
0.237CysTyr: 0.237 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
0.949AspAla: 0.949 ± 0.53
0.475AspCys: 0.475 ± 0.552
2.611AspAsp: 2.611 ± 0.414
2.374AspGlu: 2.374 ± 1.558
1.424AspPhe: 1.424 ± 0.326
3.086AspGly: 3.086 ± 0.762
0.949AspHis: 0.949 ± 0.484
2.374AspIle: 2.374 ± 0.727
3.56AspLys: 3.56 ± 0.726
5.697AspLeu: 5.697 ± 0.665
1.187AspMet: 1.187 ± 0.47
1.662AspAsn: 1.662 ± 0.5
5.697AspPro: 5.697 ± 1.159
2.374AspGln: 2.374 ± 0.347
2.374AspArg: 2.374 ± 0.707
1.899AspSer: 1.899 ± 0.488
4.747AspThr: 4.747 ± 1.117
1.187AspVal: 1.187 ± 0.334
0.475AspTrp: 0.475 ± 0.265
2.136AspTyr: 2.136 ± 0.58
0.0AspXaa: 0.0 ± 0.0
Glu
4.51GluAla: 4.51 ± 0.623
1.662GluCys: 1.662 ± 0.409
3.56GluAsp: 3.56 ± 1.608
2.848GluGlu: 2.848 ± 0.86
1.899GluPhe: 1.899 ± 0.885
5.222GluGly: 5.222 ± 0.658
0.475GluHis: 0.475 ± 0.623
3.798GluIle: 3.798 ± 0.818
3.086GluLys: 3.086 ± 0.275
4.51GluLeu: 4.51 ± 0.579
2.136GluMet: 2.136 ± 0.654
3.798GluAsn: 3.798 ± 0.787
1.662GluPro: 1.662 ± 1.001
1.899GluGln: 1.899 ± 0.782
3.086GluArg: 3.086 ± 0.748
3.323GluSer: 3.323 ± 0.419
3.56GluThr: 3.56 ± 0.482
3.086GluVal: 3.086 ± 0.304
1.899GluTrp: 1.899 ± 0.912
3.56GluTyr: 3.56 ± 0.639
0.0GluXaa: 0.0 ± 0.0
Phe
1.899PheAla: 1.899 ± 0.538
0.712PheCys: 0.712 ± 0.398
1.187PheAsp: 1.187 ± 0.56
1.899PheGlu: 1.899 ± 0.769
1.899PhePhe: 1.899 ± 0.587
2.374PheGly: 2.374 ± 0.416
1.899PheHis: 1.899 ± 0.587
2.374PheIle: 2.374 ± 0.347
2.136PheLys: 2.136 ± 0.654
4.985PheLeu: 4.985 ± 1.039
0.949PheMet: 0.949 ± 0.334
1.899PheAsn: 1.899 ± 0.812
3.086PhePro: 3.086 ± 0.601
2.611PheGln: 2.611 ± 0.872
2.374PheArg: 2.374 ± 0.586
4.985PheSer: 4.985 ± 1.034
1.187PheThr: 1.187 ± 0.475
1.187PheVal: 1.187 ± 0.4
0.475PheTrp: 0.475 ± 0.236
0.237PheTyr: 0.237 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
3.086GlyAla: 3.086 ± 0.77
1.187GlyCys: 1.187 ± 0.463
1.899GlyAsp: 1.899 ± 0.615
2.611GlyGlu: 2.611 ± 0.415
3.323GlyPhe: 3.323 ± 0.406
2.611GlyGly: 2.611 ± 0.544
1.424GlyHis: 1.424 ± 0.326
4.747GlyIle: 4.747 ± 0.412
2.611GlyLys: 2.611 ± 0.42
4.747GlyLeu: 4.747 ± 0.688
1.187GlyMet: 1.187 ± 0.569
1.662GlyAsn: 1.662 ± 0.879
2.848GlyPro: 2.848 ± 1.768
0.475GlyGln: 0.475 ± 0.265
3.56GlyArg: 3.56 ± 0.624
3.086GlySer: 3.086 ± 0.77
4.747GlyThr: 4.747 ± 0.535
3.56GlyVal: 3.56 ± 0.802
1.899GlyTrp: 1.899 ± 0.541
3.086GlyTyr: 3.086 ± 0.637
0.0GlyXaa: 0.0 ± 0.0
His
1.187HisAla: 1.187 ± 0.345
0.0HisCys: 0.0 ± 0.0
1.187HisAsp: 1.187 ± 0.378
0.949HisGlu: 0.949 ± 0.246
0.949HisPhe: 0.949 ± 0.396
1.424HisGly: 1.424 ± 0.39
1.187HisHis: 1.187 ± 0.345
1.424HisIle: 1.424 ± 0.588
0.475HisLys: 0.475 ± 0.5
3.798HisLeu: 3.798 ± 1.027
0.475HisMet: 0.475 ± 0.497
1.899HisAsn: 1.899 ± 0.782
2.374HisPro: 2.374 ± 0.581
1.424HisGln: 1.424 ± 0.588
1.424HisArg: 1.424 ± 0.795
1.662HisSer: 1.662 ± 0.599
0.237HisThr: 0.237 ± 0.133
1.662HisVal: 1.662 ± 0.477
0.949HisTrp: 0.949 ± 0.245
0.237HisTyr: 0.237 ± 0.133
0.0HisXaa: 0.0 ± 0.0
Ile
3.56IleAla: 3.56 ± 0.707
1.187IleCys: 1.187 ± 0.475
3.086IleAsp: 3.086 ± 1.403
4.747IleGlu: 4.747 ± 0.606
2.136IlePhe: 2.136 ± 0.689
3.56IleGly: 3.56 ± 0.666
1.424IleHis: 1.424 ± 0.676
3.798IleIle: 3.798 ± 0.782
3.56IleLys: 3.56 ± 0.833
6.646IleLeu: 6.646 ± 1.009
1.424IleMet: 1.424 ± 0.518
5.222IleAsn: 5.222 ± 0.605
4.272IlePro: 4.272 ± 0.913
3.323IleGln: 3.323 ± 0.522
5.459IleArg: 5.459 ± 0.734
6.883IleSer: 6.883 ± 0.952
4.51IleThr: 4.51 ± 0.88
2.374IleVal: 2.374 ± 0.553
1.424IleTrp: 1.424 ± 0.692
2.848IleTyr: 2.848 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
2.136LysAla: 2.136 ± 0.658
1.424LysCys: 1.424 ± 0.535
2.611LysAsp: 2.611 ± 0.635
4.272LysGlu: 4.272 ± 0.627
2.848LysPhe: 2.848 ± 0.621
4.035LysGly: 4.035 ± 1.13
0.475LysHis: 0.475 ± 0.276
5.222LysIle: 5.222 ± 1.657
2.611LysLys: 2.611 ± 1.354
4.985LysLeu: 4.985 ± 1.143
2.848LysMet: 2.848 ± 1.266
4.035LysAsn: 4.035 ± 0.463
1.424LysPro: 1.424 ± 0.594
3.086LysGln: 3.086 ± 1.452
2.848LysArg: 2.848 ± 0.798
3.56LysSer: 3.56 ± 0.72
2.848LysThr: 2.848 ± 0.678
3.086LysVal: 3.086 ± 0.467
1.424LysTrp: 1.424 ± 0.972
1.899LysTyr: 1.899 ± 0.591
0.0LysXaa: 0.0 ± 0.0
Leu
3.56LeuAla: 3.56 ± 0.63
0.949LeuCys: 0.949 ± 0.53
5.459LeuAsp: 5.459 ± 0.828
6.883LeuGlu: 6.883 ± 1.335
3.798LeuPhe: 3.798 ± 1.243
4.747LeuGly: 4.747 ± 0.915
1.662LeuHis: 1.662 ± 0.712
7.833LeuIle: 7.833 ± 0.727
6.646LeuLys: 6.646 ± 1.092
9.732LeuLeu: 9.732 ± 1.157
2.611LeuMet: 2.611 ± 0.618
4.747LeuAsn: 4.747 ± 0.35
4.51LeuPro: 4.51 ± 0.566
3.56LeuGln: 3.56 ± 0.649
4.747LeuArg: 4.747 ± 0.95
13.055LeuSer: 13.055 ± 1.128
6.409LeuThr: 6.409 ± 0.671
4.747LeuVal: 4.747 ± 0.334
1.187LeuTrp: 1.187 ± 0.345
3.086LeuTyr: 3.086 ± 0.601
0.0LeuXaa: 0.0 ± 0.0
Met
1.899MetAla: 1.899 ± 0.791
0.712MetCys: 0.712 ± 0.338
0.712MetAsp: 0.712 ± 0.218
2.611MetGlu: 2.611 ± 0.8
0.475MetPhe: 0.475 ± 0.265
0.949MetGly: 0.949 ± 0.472
0.237MetHis: 0.237 ± 0.267
2.374MetIle: 2.374 ± 0.567
3.086MetLys: 3.086 ± 0.487
2.136MetLeu: 2.136 ± 0.728
0.712MetMet: 0.712 ± 0.419
0.475MetAsn: 0.475 ± 0.552
0.475MetPro: 0.475 ± 0.265
0.712MetGln: 0.712 ± 0.302
1.899MetArg: 1.899 ± 0.387
1.899MetSer: 1.899 ± 0.545
1.424MetThr: 1.424 ± 0.196
1.187MetVal: 1.187 ± 0.476
0.712MetTrp: 0.712 ± 0.609
0.712MetTyr: 0.712 ± 0.486
0.0MetXaa: 0.0 ± 0.0
Asn
1.899AsnAla: 1.899 ± 1.533
0.712AsnCys: 0.712 ± 0.302
2.374AsnAsp: 2.374 ± 0.768
1.662AsnGlu: 1.662 ± 0.548
3.086AsnPhe: 3.086 ± 0.862
1.187AsnGly: 1.187 ± 0.563
1.187AsnHis: 1.187 ± 0.657
2.374AsnIle: 2.374 ± 0.59
2.848AsnLys: 2.848 ± 0.606
7.121AsnLeu: 7.121 ± 1.083
0.475AsnMet: 0.475 ± 0.533
2.374AsnAsn: 2.374 ± 0.416
4.985AsnPro: 4.985 ± 1.612
2.374AsnGln: 2.374 ± 0.913
1.424AsnArg: 1.424 ± 0.319
3.086AsnSer: 3.086 ± 0.974
3.086AsnThr: 3.086 ± 0.47
1.424AsnVal: 1.424 ± 0.795
0.712AsnTrp: 0.712 ± 0.624
3.086AsnTyr: 3.086 ± 0.666
0.0AsnXaa: 0.0 ± 0.0
Pro
2.611ProAla: 2.611 ± 0.619
1.662ProCys: 1.662 ± 0.483
2.374ProAsp: 2.374 ± 0.746
4.035ProGlu: 4.035 ± 0.73
2.374ProPhe: 2.374 ± 0.51
2.848ProGly: 2.848 ± 0.691
1.899ProHis: 1.899 ± 0.267
3.56ProIle: 3.56 ± 0.648
3.086ProLys: 3.086 ± 0.76
6.171ProLeu: 6.171 ± 0.879
0.949ProMet: 0.949 ± 0.507
2.374ProAsn: 2.374 ± 1.062
5.934ProPro: 5.934 ± 0.885
2.374ProGln: 2.374 ± 0.759
3.56ProArg: 3.56 ± 1.844
6.409ProSer: 6.409 ± 0.559
4.51ProThr: 4.51 ± 1.026
4.272ProVal: 4.272 ± 0.999
0.475ProTrp: 0.475 ± 0.392
2.611ProTyr: 2.611 ± 0.786
0.0ProXaa: 0.0 ± 0.0
Gln
1.662GlnAla: 1.662 ± 0.7
0.712GlnCys: 0.712 ± 0.218
2.136GlnAsp: 2.136 ± 0.728
3.086GlnGlu: 3.086 ± 1.138
1.899GlnPhe: 1.899 ± 0.438
3.56GlnGly: 3.56 ± 1.06
0.949GlnHis: 0.949 ± 0.375
3.323GlnIle: 3.323 ± 1.269
2.374GlnLys: 2.374 ± 0.965
3.798GlnLeu: 3.798 ± 1.118
1.187GlnMet: 1.187 ± 0.423
0.949GlnAsn: 0.949 ± 0.553
0.949GlnPro: 0.949 ± 0.785
0.949GlnGln: 0.949 ± 0.361
1.662GlnArg: 1.662 ± 0.642
3.323GlnSer: 3.323 ± 0.907
1.899GlnThr: 1.899 ± 0.483
1.662GlnVal: 1.662 ± 0.577
0.949GlnTrp: 0.949 ± 0.53
1.187GlnTyr: 1.187 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
1.899ArgAla: 1.899 ± 0.538
0.475ArgCys: 0.475 ± 0.265
2.611ArgAsp: 2.611 ± 0.582
5.222ArgGlu: 5.222 ± 0.908
2.611ArgPhe: 2.611 ± 0.754
3.086ArgGly: 3.086 ± 0.773
1.187ArgHis: 1.187 ± 0.388
4.272ArgIle: 4.272 ± 0.695
2.374ArgLys: 2.374 ± 0.703
4.035ArgLeu: 4.035 ± 0.514
1.187ArgMet: 1.187 ± 0.443
3.798ArgAsn: 3.798 ± 0.769
3.323ArgPro: 3.323 ± 0.797
1.424ArgGln: 1.424 ± 0.326
1.899ArgArg: 1.899 ± 0.904
3.798ArgSer: 3.798 ± 0.851
3.798ArgThr: 3.798 ± 0.725
2.611ArgVal: 2.611 ± 1.024
0.712ArgTrp: 0.712 ± 0.338
1.424ArgTyr: 1.424 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
4.985SerAla: 4.985 ± 0.564
1.424SerCys: 1.424 ± 0.326
4.272SerAsp: 4.272 ± 0.773
3.798SerGlu: 3.798 ± 0.531
3.323SerPhe: 3.323 ± 1.087
4.035SerGly: 4.035 ± 0.714
3.798SerHis: 3.798 ± 0.828
5.697SerIle: 5.697 ± 1.021
4.272SerLys: 4.272 ± 0.56
9.02SerLeu: 9.02 ± 0.917
1.662SerMet: 1.662 ± 1.063
1.899SerAsn: 1.899 ± 1.06
4.272SerPro: 4.272 ± 0.627
4.035SerGln: 4.035 ± 0.69
4.272SerArg: 4.272 ± 1.593
7.358SerSer: 7.358 ± 1.195
5.459SerThr: 5.459 ± 1.045
5.459SerVal: 5.459 ± 0.596
0.949SerTrp: 0.949 ± 0.294
3.56SerTyr: 3.56 ± 0.653
0.0SerXaa: 0.0 ± 0.0
Thr
2.611ThrAla: 2.611 ± 0.645
1.187ThrCys: 1.187 ± 0.545
2.848ThrAsp: 2.848 ± 0.781
4.272ThrGlu: 4.272 ± 0.71
2.374ThrPhe: 2.374 ± 0.569
2.611ThrGly: 2.611 ± 0.779
2.374ThrHis: 2.374 ± 0.487
4.985ThrIle: 4.985 ± 0.847
3.086ThrLys: 3.086 ± 0.776
3.56ThrLeu: 3.56 ± 0.857
2.611ThrMet: 2.611 ± 0.416
3.323ThrAsn: 3.323 ± 0.717
5.697ThrPro: 5.697 ± 0.932
3.086ThrGln: 3.086 ± 0.729
3.086ThrArg: 3.086 ± 0.892
4.985ThrSer: 4.985 ± 0.553
2.611ThrThr: 2.611 ± 0.811
2.611ThrVal: 2.611 ± 1.16
1.662ThrTrp: 1.662 ± 0.528
1.662ThrTyr: 1.662 ± 0.5
0.0ThrXaa: 0.0 ± 0.0
Val
4.51ValAla: 4.51 ± 1.197
0.712ValCys: 0.712 ± 0.218
3.086ValAsp: 3.086 ± 1.205
3.086ValGlu: 3.086 ± 0.781
0.949ValPhe: 0.949 ± 0.294
1.899ValGly: 1.899 ± 0.358
1.187ValHis: 1.187 ± 0.47
5.459ValIle: 5.459 ± 1.231
3.56ValLys: 3.56 ± 1.345
4.272ValLeu: 4.272 ± 0.928
0.712ValMet: 0.712 ± 0.338
1.187ValAsn: 1.187 ± 0.267
4.272ValPro: 4.272 ± 0.622
1.424ValGln: 1.424 ± 0.445
2.611ValArg: 2.611 ± 0.833
4.51ValSer: 4.51 ± 0.853
1.187ValThr: 1.187 ± 0.517
4.51ValVal: 4.51 ± 0.921
0.712ValTrp: 0.712 ± 0.302
1.662ValTyr: 1.662 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.949TrpAla: 0.949 ± 0.678
0.475TrpCys: 0.475 ± 0.211
0.712TrpAsp: 0.712 ± 0.609
1.424TrpGlu: 1.424 ± 0.487
0.949TrpPhe: 0.949 ± 0.36
1.424TrpGly: 1.424 ± 0.605
0.0TrpHis: 0.0 ± 0.0
0.712TrpIle: 0.712 ± 0.218
2.374TrpLys: 2.374 ± 0.678
1.424TrpLeu: 1.424 ± 0.632
0.237TrpMet: 0.237 ± 0.133
1.662TrpAsn: 1.662 ± 0.667
0.949TrpPro: 0.949 ± 0.472
0.0TrpGln: 0.0 ± 0.0
0.712TrpArg: 0.712 ± 0.398
0.949TrpSer: 0.949 ± 0.391
1.187TrpThr: 1.187 ± 0.47
1.424TrpVal: 1.424 ± 0.447
0.475TrpTrp: 0.475 ± 0.211
0.712TrpTyr: 0.712 ± 0.486
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.187TyrAla: 1.187 ± 0.256
0.475TyrCys: 0.475 ± 0.276
1.187TyrAsp: 1.187 ± 0.256
1.187TyrGlu: 1.187 ± 0.921
1.899TyrPhe: 1.899 ± 0.689
1.187TyrGly: 1.187 ± 0.694
0.949TyrHis: 0.949 ± 0.391
3.086TyrIle: 3.086 ± 0.99
2.374TyrLys: 2.374 ± 0.96
4.747TyrLeu: 4.747 ± 0.789
0.949TyrMet: 0.949 ± 0.516
1.662TyrAsn: 1.662 ± 0.5
2.848TyrPro: 2.848 ± 0.621
1.187TyrGln: 1.187 ± 0.845
0.949TyrArg: 0.949 ± 0.294
2.374TyrSer: 2.374 ± 1.192
3.086TyrThr: 3.086 ± 0.476
2.611TyrVal: 2.611 ± 0.6
0.949TyrTrp: 0.949 ± 0.294
1.662TyrTyr: 1.662 ± 0.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4214 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski