Amino acid dipepetide frequency for Changping earthworm virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.999AlaAla: 3.999 ± 0.603
1.176AlaCys: 1.176 ± 0.38
3.293AlaAsp: 3.293 ± 0.382
3.293AlaGlu: 3.293 ± 0.473
2.352AlaPhe: 2.352 ± 0.58
3.764AlaGly: 3.764 ± 1.102
1.411AlaHis: 1.411 ± 0.472
2.823AlaIle: 2.823 ± 0.948
3.764AlaLys: 3.764 ± 1.129
4.47AlaLeu: 4.47 ± 1.176
1.882AlaMet: 1.882 ± 0.411
3.529AlaAsn: 3.529 ± 0.619
1.882AlaPro: 1.882 ± 0.401
1.882AlaGln: 1.882 ± 0.647
2.823AlaArg: 2.823 ± 0.675
7.763AlaSer: 7.763 ± 1.509
1.882AlaThr: 1.882 ± 0.92
2.588AlaVal: 2.588 ± 0.708
0.941AlaTrp: 0.941 ± 0.279
1.411AlaTyr: 1.411 ± 0.263
0.0AlaXaa: 0.0 ± 0.0
Cys
1.647CysAla: 1.647 ± 0.438
0.706CysCys: 0.706 ± 0.319
1.411CysAsp: 1.411 ± 0.387
1.176CysGlu: 1.176 ± 0.294
1.882CysPhe: 1.882 ± 0.514
0.941CysGly: 0.941 ± 0.483
0.706CysHis: 0.706 ± 0.502
1.411CysIle: 1.411 ± 0.48
0.47CysLys: 0.47 ± 0.222
1.882CysLeu: 1.882 ± 0.587
0.0CysMet: 0.0 ± 0.0
0.941CysAsn: 0.941 ± 0.272
0.941CysPro: 0.941 ± 0.409
0.706CysGln: 0.706 ± 0.3
0.47CysArg: 0.47 ± 0.257
1.882CysSer: 1.882 ± 0.369
1.882CysThr: 1.882 ± 0.636
0.706CysVal: 0.706 ± 0.407
0.706CysTrp: 0.706 ± 0.442
0.706CysTyr: 0.706 ± 0.402
0.0CysXaa: 0.0 ± 0.0
Asp
2.352AspAla: 2.352 ± 0.586
1.411AspCys: 1.411 ± 0.313
3.529AspAsp: 3.529 ± 1.07
2.823AspGlu: 2.823 ± 1.021
3.058AspPhe: 3.058 ± 0.401
3.058AspGly: 3.058 ± 1.025
0.47AspHis: 0.47 ± 0.334
2.588AspIle: 2.588 ± 0.683
2.823AspLys: 2.823 ± 0.708
4.47AspLeu: 4.47 ± 1.001
1.647AspMet: 1.647 ± 0.643
2.117AspAsn: 2.117 ± 0.444
2.588AspPro: 2.588 ± 0.953
1.882AspGln: 1.882 ± 0.96
2.588AspArg: 2.588 ± 0.752
3.293AspSer: 3.293 ± 0.528
2.823AspThr: 2.823 ± 0.627
3.293AspVal: 3.293 ± 0.693
0.941AspTrp: 0.941 ± 0.465
1.411AspTyr: 1.411 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
3.293GluAla: 3.293 ± 0.442
0.941GluCys: 0.941 ± 0.485
3.764GluAsp: 3.764 ± 1.18
3.529GluGlu: 3.529 ± 1.822
3.293GluPhe: 3.293 ± 1.095
3.529GluGly: 3.529 ± 0.747
0.706GluHis: 0.706 ± 0.502
3.764GluIle: 3.764 ± 0.833
2.588GluLys: 2.588 ± 0.667
5.175GluLeu: 5.175 ± 1.144
1.176GluMet: 1.176 ± 0.386
1.882GluAsn: 1.882 ± 0.48
2.117GluPro: 2.117 ± 0.249
1.647GluGln: 1.647 ± 0.841
1.882GluArg: 1.882 ± 0.603
4.47GluSer: 4.47 ± 1.089
5.646GluThr: 5.646 ± 0.949
3.999GluVal: 3.999 ± 0.436
1.176GluTrp: 1.176 ± 0.367
2.823GluTyr: 2.823 ± 0.699
0.0GluXaa: 0.0 ± 0.0
Phe
2.352PheAla: 2.352 ± 0.687
1.411PheCys: 1.411 ± 0.442
2.117PheAsp: 2.117 ± 0.676
2.823PheGlu: 2.823 ± 0.755
1.176PhePhe: 1.176 ± 0.592
1.882PheGly: 1.882 ± 0.481
0.941PheHis: 0.941 ± 0.558
2.352PheIle: 2.352 ± 1.292
1.882PheLys: 1.882 ± 0.438
5.881PheLeu: 5.881 ± 1.029
0.941PheMet: 0.941 ± 0.231
3.058PheAsn: 3.058 ± 0.9
2.352PhePro: 2.352 ± 0.676
1.647PheGln: 1.647 ± 0.527
1.411PheArg: 1.411 ± 0.273
3.764PheSer: 3.764 ± 0.652
2.823PheThr: 2.823 ± 0.587
3.764PheVal: 3.764 ± 0.774
0.941PheTrp: 0.941 ± 0.499
2.352PheTyr: 2.352 ± 0.48
0.0PheXaa: 0.0 ± 0.0
Gly
3.293GlyAla: 3.293 ± 0.84
0.706GlyCys: 0.706 ± 0.325
2.352GlyAsp: 2.352 ± 0.339
2.823GlyGlu: 2.823 ± 0.827
2.352GlyPhe: 2.352 ± 0.78
3.764GlyGly: 3.764 ± 0.747
1.882GlyHis: 1.882 ± 0.487
3.058GlyIle: 3.058 ± 1.15
3.058GlyLys: 3.058 ± 0.634
4.47GlyLeu: 4.47 ± 0.79
2.352GlyMet: 2.352 ± 0.59
1.647GlyAsn: 1.647 ± 0.406
2.588GlyPro: 2.588 ± 0.799
2.823GlyGln: 2.823 ± 0.865
2.588GlyArg: 2.588 ± 0.763
3.764GlySer: 3.764 ± 1.085
5.41GlyThr: 5.41 ± 0.74
3.058GlyVal: 3.058 ± 0.83
0.706GlyTrp: 0.706 ± 0.455
1.882GlyTyr: 1.882 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
1.176HisAla: 1.176 ± 0.529
0.706HisCys: 0.706 ± 0.325
1.176HisAsp: 1.176 ± 0.353
0.941HisGlu: 0.941 ± 0.255
0.941HisPhe: 0.941 ± 0.328
0.941HisGly: 0.941 ± 0.364
0.706HisHis: 0.706 ± 0.301
1.411HisIle: 1.411 ± 0.772
0.941HisLys: 0.941 ± 0.452
2.823HisLeu: 2.823 ± 0.667
0.235HisMet: 0.235 ± 0.313
1.411HisAsn: 1.411 ± 0.488
0.941HisPro: 0.941 ± 0.67
0.706HisGln: 0.706 ± 0.225
1.411HisArg: 1.411 ± 0.638
3.293HisSer: 3.293 ± 0.692
1.647HisThr: 1.647 ± 0.451
1.411HisVal: 1.411 ± 0.508
0.235HisTrp: 0.235 ± 0.178
0.47HisTyr: 0.47 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
3.058IleAla: 3.058 ± 0.95
0.706IleCys: 0.706 ± 0.382
2.117IleAsp: 2.117 ± 0.614
3.529IleGlu: 3.529 ± 1.196
2.117IlePhe: 2.117 ± 0.784
2.823IleGly: 2.823 ± 0.943
1.411IleHis: 1.411 ± 0.571
2.823IleIle: 2.823 ± 0.573
4.234IleLys: 4.234 ± 0.792
7.998IleLeu: 7.998 ± 1.416
1.647IleMet: 1.647 ± 0.479
2.588IleAsn: 2.588 ± 0.763
1.647IlePro: 1.647 ± 1.172
1.882IleGln: 1.882 ± 0.782
2.352IleArg: 2.352 ± 0.191
6.116IleSer: 6.116 ± 1.1
3.999IleThr: 3.999 ± 1.007
3.999IleVal: 3.999 ± 0.783
0.941IleTrp: 0.941 ± 0.67
0.941IleTyr: 0.941 ± 0.479
0.0IleXaa: 0.0 ± 0.0
Lys
3.764LysAla: 3.764 ± 0.77
2.117LysCys: 2.117 ± 0.699
2.823LysAsp: 2.823 ± 0.831
2.352LysGlu: 2.352 ± 0.381
3.764LysPhe: 3.764 ± 1.121
1.882LysGly: 1.882 ± 0.499
1.176LysHis: 1.176 ± 0.469
3.999LysIle: 3.999 ± 0.653
2.117LysLys: 2.117 ± 0.427
4.234LysLeu: 4.234 ± 0.879
1.882LysMet: 1.882 ± 0.91
1.882LysAsn: 1.882 ± 0.687
3.293LysPro: 3.293 ± 0.514
3.293LysGln: 3.293 ± 1.031
3.764LysArg: 3.764 ± 0.712
5.175LysSer: 5.175 ± 0.48
3.764LysThr: 3.764 ± 0.724
4.47LysVal: 4.47 ± 0.701
0.941LysTrp: 0.941 ± 0.499
1.882LysTyr: 1.882 ± 0.553
0.0LysXaa: 0.0 ± 0.0
Leu
4.94LeuAla: 4.94 ± 0.697
1.176LeuCys: 1.176 ± 0.436
3.529LeuAsp: 3.529 ± 0.582
8.233LeuGlu: 8.233 ± 0.754
3.999LeuPhe: 3.999 ± 0.777
5.881LeuGly: 5.881 ± 0.842
2.823LeuHis: 2.823 ± 0.638
4.234LeuIle: 4.234 ± 0.602
4.47LeuLys: 4.47 ± 0.97
10.115LeuLeu: 10.115 ± 2.097
3.999LeuMet: 3.999 ± 0.873
4.94LeuAsn: 4.94 ± 0.626
4.94LeuPro: 4.94 ± 0.889
2.352LeuGln: 2.352 ± 0.723
6.822LeuArg: 6.822 ± 1.921
9.174LeuSer: 9.174 ± 1.233
3.764LeuThr: 3.764 ± 1.668
6.116LeuVal: 6.116 ± 1.272
2.588LeuTrp: 2.588 ± 0.937
3.293LeuTyr: 3.293 ± 1.09
0.0LeuXaa: 0.0 ± 0.0
Met
2.117MetAla: 2.117 ± 0.651
0.235MetCys: 0.235 ± 0.227
0.235MetAsp: 0.235 ± 0.167
1.176MetGlu: 1.176 ± 0.399
1.882MetPhe: 1.882 ± 0.369
1.411MetGly: 1.411 ± 0.602
0.0MetHis: 0.0 ± 0.0
1.411MetIle: 1.411 ± 0.489
1.176MetLys: 1.176 ± 0.434
2.352MetLeu: 2.352 ± 0.705
0.941MetMet: 0.941 ± 0.364
1.882MetAsn: 1.882 ± 0.754
1.647MetPro: 1.647 ± 0.609
0.706MetGln: 0.706 ± 0.419
1.411MetArg: 1.411 ± 0.654
3.058MetSer: 3.058 ± 0.353
1.411MetThr: 1.411 ± 0.45
2.352MetVal: 2.352 ± 0.676
0.47MetTrp: 0.47 ± 0.336
0.706MetTyr: 0.706 ± 0.356
0.0MetXaa: 0.0 ± 0.0
Asn
0.706AsnAla: 0.706 ± 0.234
1.176AsnCys: 1.176 ± 0.507
3.058AsnAsp: 3.058 ± 0.947
1.882AsnGlu: 1.882 ± 0.43
2.823AsnPhe: 2.823 ± 0.508
2.352AsnGly: 2.352 ± 0.78
1.411AsnHis: 1.411 ± 0.602
3.999AsnIle: 3.999 ± 0.984
2.823AsnLys: 2.823 ± 0.705
3.999AsnLeu: 3.999 ± 0.902
0.941AsnMet: 0.941 ± 0.518
1.647AsnAsn: 1.647 ± 0.335
2.352AsnPro: 2.352 ± 0.656
2.352AsnGln: 2.352 ± 0.788
3.058AsnArg: 3.058 ± 0.457
3.529AsnSer: 3.529 ± 0.671
2.117AsnThr: 2.117 ± 0.733
3.529AsnVal: 3.529 ± 0.897
0.235AsnTrp: 0.235 ± 0.227
1.176AsnTyr: 1.176 ± 0.468
0.0AsnXaa: 0.0 ± 0.0
Pro
2.588ProAla: 2.588 ± 0.719
0.47ProCys: 0.47 ± 0.304
2.352ProAsp: 2.352 ± 0.255
2.117ProGlu: 2.117 ± 0.571
1.647ProPhe: 1.647 ± 0.424
2.117ProGly: 2.117 ± 0.947
0.941ProHis: 0.941 ± 0.527
1.882ProIle: 1.882 ± 0.498
3.529ProLys: 3.529 ± 0.802
5.881ProLeu: 5.881 ± 1.452
0.706ProMet: 0.706 ± 0.419
2.588ProAsn: 2.588 ± 0.271
4.705ProPro: 4.705 ± 0.921
1.647ProGln: 1.647 ± 0.546
2.823ProArg: 2.823 ± 0.906
4.705ProSer: 4.705 ± 1.437
4.234ProThr: 4.234 ± 0.31
2.823ProVal: 2.823 ± 0.661
0.706ProTrp: 0.706 ± 0.369
1.176ProTyr: 1.176 ± 0.47
0.235ProXaa: 0.235 ± 0.333
Gln
1.411GlnAla: 1.411 ± 0.786
1.411GlnCys: 1.411 ± 0.579
1.411GlnAsp: 1.411 ± 0.592
3.058GlnGlu: 3.058 ± 1.38
1.411GlnPhe: 1.411 ± 0.588
0.706GlnGly: 0.706 ± 0.364
1.176GlnHis: 1.176 ± 0.386
3.764GlnIle: 3.764 ± 1.012
2.117GlnLys: 2.117 ± 0.612
3.999GlnLeu: 3.999 ± 1.332
0.706GlnMet: 0.706 ± 0.465
1.411GlnAsn: 1.411 ± 0.637
0.706GlnPro: 0.706 ± 0.225
1.411GlnGln: 1.411 ± 0.539
2.352GlnArg: 2.352 ± 0.554
0.941GlnSer: 0.941 ± 0.396
2.588GlnThr: 2.588 ± 0.937
2.117GlnVal: 2.117 ± 1.054
1.176GlnTrp: 1.176 ± 0.503
1.176GlnTyr: 1.176 ± 0.563
0.0GlnXaa: 0.0 ± 0.0
Arg
3.293ArgAla: 3.293 ± 0.727
1.176ArgCys: 1.176 ± 0.462
2.117ArgAsp: 2.117 ± 0.477
2.588ArgGlu: 2.588 ± 0.753
1.411ArgPhe: 1.411 ± 0.678
2.588ArgGly: 2.588 ± 1.082
1.647ArgHis: 1.647 ± 0.463
2.117ArgIle: 2.117 ± 0.789
5.175ArgLys: 5.175 ± 1.099
4.47ArgLeu: 4.47 ± 0.498
1.411ArgMet: 1.411 ± 0.457
3.058ArgAsn: 3.058 ± 0.817
3.058ArgPro: 3.058 ± 0.585
1.176ArgGln: 1.176 ± 0.468
2.588ArgArg: 2.588 ± 0.785
4.94ArgSer: 4.94 ± 0.865
4.47ArgThr: 4.47 ± 1.146
2.117ArgVal: 2.117 ± 0.638
0.235ArgTrp: 0.235 ± 0.333
1.647ArgTyr: 1.647 ± 0.75
0.0ArgXaa: 0.0 ± 0.0
Ser
6.116SerAla: 6.116 ± 1.116
2.117SerCys: 2.117 ± 0.407
4.94SerAsp: 4.94 ± 0.866
5.41SerGlu: 5.41 ± 1.655
3.529SerPhe: 3.529 ± 0.785
4.94SerGly: 4.94 ± 1.157
2.117SerHis: 2.117 ± 0.549
6.351SerIle: 6.351 ± 1.403
3.999SerLys: 3.999 ± 0.996
7.292SerLeu: 7.292 ± 1.765
2.588SerMet: 2.588 ± 0.923
2.823SerAsn: 2.823 ± 0.655
4.47SerPro: 4.47 ± 1.447
3.529SerGln: 3.529 ± 1.108
5.175SerArg: 5.175 ± 0.364
7.998SerSer: 7.998 ± 1.436
6.116SerThr: 6.116 ± 0.628
5.646SerVal: 5.646 ± 1.43
1.176SerTrp: 1.176 ± 0.457
3.529SerTyr: 3.529 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
5.175ThrAla: 5.175 ± 0.759
0.941ThrCys: 0.941 ± 0.51
3.764ThrAsp: 3.764 ± 0.949
3.529ThrGlu: 3.529 ± 0.554
3.058ThrPhe: 3.058 ± 0.854
3.999ThrGly: 3.999 ± 0.598
1.176ThrHis: 1.176 ± 0.462
3.058ThrIle: 3.058 ± 0.532
6.587ThrLys: 6.587 ± 0.902
7.998ThrLeu: 7.998 ± 1.094
1.882ThrMet: 1.882 ± 0.696
2.588ThrAsn: 2.588 ± 0.728
3.529ThrPro: 3.529 ± 0.585
2.117ThrGln: 2.117 ± 0.495
2.588ThrArg: 2.588 ± 0.644
7.057ThrSer: 7.057 ± 0.915
5.646ThrThr: 5.646 ± 1.078
2.823ThrVal: 2.823 ± 0.278
0.47ThrTrp: 0.47 ± 0.327
1.882ThrTyr: 1.882 ± 0.578
0.0ThrXaa: 0.0 ± 0.0
Val
2.117ValAla: 2.117 ± 0.56
1.176ValCys: 1.176 ± 0.618
2.823ValAsp: 2.823 ± 0.689
3.999ValGlu: 3.999 ± 0.581
2.823ValPhe: 2.823 ± 0.542
4.705ValGly: 4.705 ± 1.076
1.411ValHis: 1.411 ± 0.505
2.823ValIle: 2.823 ± 0.653
3.999ValLys: 3.999 ± 0.674
6.116ValLeu: 6.116 ± 0.75
0.941ValMet: 0.941 ± 0.694
1.882ValAsn: 1.882 ± 0.369
3.764ValPro: 3.764 ± 0.601
0.941ValGln: 0.941 ± 0.485
2.823ValArg: 2.823 ± 0.715
5.646ValSer: 5.646 ± 1.025
6.822ValThr: 6.822 ± 1.299
3.999ValVal: 3.999 ± 0.98
0.47ValTrp: 0.47 ± 0.373
1.882ValTyr: 1.882 ± 0.643
0.0ValXaa: 0.0 ± 0.0
Trp
2.588TrpAla: 2.588 ± 0.613
0.235TrpCys: 0.235 ± 0.167
1.647TrpAsp: 1.647 ± 0.581
0.235TrpGlu: 0.235 ± 0.167
0.235TrpPhe: 0.235 ± 0.227
1.176TrpGly: 1.176 ± 0.593
0.47TrpHis: 0.47 ± 0.335
0.941TrpIle: 0.941 ± 0.694
0.706TrpLys: 0.706 ± 0.319
0.706TrpLeu: 0.706 ± 0.369
0.235TrpMet: 0.235 ± 0.227
0.235TrpAsn: 0.235 ± 0.167
0.706TrpPro: 0.706 ± 0.319
0.235TrpGln: 0.235 ± 0.167
0.706TrpArg: 0.706 ± 0.3
1.411TrpSer: 1.411 ± 0.595
0.47TrpThr: 0.47 ± 0.266
1.647TrpVal: 1.647 ± 0.66
0.235TrpTrp: 0.235 ± 0.167
1.176TrpTyr: 1.176 ± 0.686
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.941TyrAla: 0.941 ± 0.527
1.176TyrCys: 1.176 ± 0.434
0.706TyrAsp: 0.706 ± 0.357
1.882TyrGlu: 1.882 ± 0.6
2.117TyrPhe: 2.117 ± 0.803
2.117TyrGly: 2.117 ± 0.661
0.941TyrHis: 0.941 ± 0.432
2.117TyrIle: 2.117 ± 0.716
2.352TyrLys: 2.352 ± 0.992
3.058TyrLeu: 3.058 ± 1.248
0.235TyrMet: 0.235 ± 0.178
3.058TyrAsn: 3.058 ± 0.718
1.647TyrPro: 1.647 ± 0.575
2.117TyrGln: 2.117 ± 1.016
1.411TyrArg: 1.411 ± 0.502
1.882TyrSer: 1.882 ± 0.72
2.352TyrThr: 2.352 ± 0.848
0.706TyrVal: 0.706 ± 0.311
0.706TyrTrp: 0.706 ± 0.251
1.411TyrTyr: 1.411 ± 0.649
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.235XaaArg: 0.235 ± 0.333
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.47XaaXaa: 0.47 ± 0.666
Statistics based on 7 proteins (4252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski