Amino acid dipepetide frequency for Wuhan aphid virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.594AlaAla: 3.594 ± 0.785
0.98AlaCys: 0.98 ± 0.474
2.94AlaAsp: 2.94 ± 0.796
2.287AlaGlu: 2.287 ± 0.512
4.574AlaPhe: 4.574 ± 1.995
6.207AlaGly: 6.207 ± 1.108
1.96AlaHis: 1.96 ± 0.643
2.287AlaIle: 2.287 ± 0.565
4.574AlaLys: 4.574 ± 1.194
7.514AlaLeu: 7.514 ± 1.146
2.614AlaMet: 2.614 ± 0.568
2.287AlaAsn: 2.287 ± 0.53
1.307AlaPro: 1.307 ± 0.549
1.96AlaGln: 1.96 ± 0.68
1.633AlaArg: 1.633 ± 0.514
4.574AlaSer: 4.574 ± 0.954
4.574AlaThr: 4.574 ± 1.508
3.267AlaVal: 3.267 ± 1.427
1.307AlaTrp: 1.307 ± 0.702
2.94AlaTyr: 2.94 ± 0.636
0.0AlaXaa: 0.0 ± 0.0
Cys
1.307CysAla: 1.307 ± 1.064
0.653CysCys: 0.653 ± 0.272
0.653CysAsp: 0.653 ± 0.509
0.653CysGlu: 0.653 ± 0.509
0.653CysPhe: 0.653 ± 0.371
1.96CysGly: 1.96 ± 0.489
0.0CysHis: 0.0 ± 0.0
0.327CysIle: 0.327 ± 0.349
0.98CysLys: 0.98 ± 0.764
1.96CysLeu: 1.96 ± 0.749
0.653CysMet: 0.653 ± 0.347
0.0CysAsn: 0.0 ± 0.0
0.327CysPro: 0.327 ± 0.488
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.96CysSer: 1.96 ± 0.631
0.327CysThr: 0.327 ± 0.349
1.307CysVal: 1.307 ± 0.815
0.0CysTrp: 0.0 ± 0.0
0.98CysTyr: 0.98 ± 0.685
0.0CysXaa: 0.0 ± 0.0
Asp
3.267AspAla: 3.267 ± 0.46
0.327AspCys: 0.327 ± 0.488
2.94AspAsp: 2.94 ± 0.734
5.227AspGlu: 5.227 ± 0.837
1.307AspPhe: 1.307 ± 0.954
2.94AspGly: 2.94 ± 0.716
0.327AspHis: 0.327 ± 0.231
4.247AspIle: 4.247 ± 0.837
3.594AspLys: 3.594 ± 0.576
4.574AspLeu: 4.574 ± 1.01
2.287AspMet: 2.287 ± 0.753
1.96AspAsn: 1.96 ± 0.891
2.94AspPro: 2.94 ± 1.67
0.98AspGln: 0.98 ± 0.597
2.614AspArg: 2.614 ± 1.094
1.633AspSer: 1.633 ± 0.768
4.574AspThr: 4.574 ± 1.247
4.574AspVal: 4.574 ± 1.169
1.633AspTrp: 1.633 ± 0.893
1.633AspTyr: 1.633 ± 0.695
0.0AspXaa: 0.0 ± 0.0
Glu
1.96GluAla: 1.96 ± 0.946
0.653GluCys: 0.653 ± 0.509
2.94GluAsp: 2.94 ± 0.426
4.247GluGlu: 4.247 ± 1.17
1.633GluPhe: 1.633 ± 0.456
3.594GluGly: 3.594 ± 0.649
1.633GluHis: 1.633 ± 0.689
4.247GluIle: 4.247 ± 0.868
4.9GluLys: 4.9 ± 1.136
6.207GluLeu: 6.207 ± 2.025
0.327GluMet: 0.327 ± 0.255
4.574GluAsn: 4.574 ± 1.329
1.96GluPro: 1.96 ± 0.612
2.287GluGln: 2.287 ± 0.744
2.94GluArg: 2.94 ± 0.976
4.574GluSer: 4.574 ± 1.277
1.96GluThr: 1.96 ± 0.947
2.614GluVal: 2.614 ± 0.816
0.98GluTrp: 0.98 ± 0.764
1.633GluTyr: 1.633 ± 0.623
0.0GluXaa: 0.0 ± 0.0
Phe
2.287PheAla: 2.287 ± 0.702
0.327PheCys: 0.327 ± 0.231
3.594PheAsp: 3.594 ± 0.987
3.267PheGlu: 3.267 ± 1.024
1.96PhePhe: 1.96 ± 0.821
4.574PheGly: 4.574 ± 1.351
0.653PheHis: 0.653 ± 0.347
2.287PheIle: 2.287 ± 0.638
2.614PheLys: 2.614 ± 0.506
2.614PheLeu: 2.614 ± 1.237
1.633PheMet: 1.633 ± 0.893
1.633PheAsn: 1.633 ± 0.514
1.307PhePro: 1.307 ± 0.384
1.633PheGln: 1.633 ± 0.471
0.98PheArg: 0.98 ± 0.299
3.594PheSer: 3.594 ± 1.702
0.98PheThr: 0.98 ± 0.597
2.94PheVal: 2.94 ± 0.741
1.633PheTrp: 1.633 ± 0.99
2.614PheTyr: 2.614 ± 0.677
0.0PheXaa: 0.0 ± 0.0
Gly
4.574GlyAla: 4.574 ± 0.944
1.307GlyCys: 1.307 ± 0.253
3.594GlyAsp: 3.594 ± 0.51
2.94GlyGlu: 2.94 ± 1.219
5.227GlyPhe: 5.227 ± 0.778
3.92GlyGly: 3.92 ± 1.472
2.287GlyHis: 2.287 ± 1.177
5.554GlyIle: 5.554 ± 2.081
7.187GlyLys: 7.187 ± 1.243
6.861GlyLeu: 6.861 ± 1.846
3.594GlyMet: 3.594 ± 0.709
3.92GlyAsn: 3.92 ± 1.591
1.96GlyPro: 1.96 ± 0.343
0.327GlyGln: 0.327 ± 0.231
5.227GlyArg: 5.227 ± 1.154
5.227GlySer: 5.227 ± 1.062
4.247GlyThr: 4.247 ± 1.436
9.474GlyVal: 9.474 ± 1.019
1.633GlyTrp: 1.633 ± 0.723
2.614GlyTyr: 2.614 ± 1.088
0.0GlyXaa: 0.0 ± 0.0
His
1.96HisAla: 1.96 ± 1.041
0.98HisCys: 0.98 ± 0.96
1.307HisAsp: 1.307 ± 0.544
1.307HisGlu: 1.307 ± 0.544
1.633HisPhe: 1.633 ± 0.514
2.287HisGly: 2.287 ± 0.517
0.653HisHis: 0.653 ± 0.347
0.327HisIle: 0.327 ± 0.349
0.653HisLys: 0.653 ± 0.347
1.633HisLeu: 1.633 ± 0.594
0.653HisMet: 0.653 ± 0.347
0.327HisAsn: 0.327 ± 0.255
0.653HisPro: 0.653 ± 0.272
0.98HisGln: 0.98 ± 0.47
0.653HisArg: 0.653 ± 0.482
0.327HisSer: 0.327 ± 0.349
0.327HisThr: 0.327 ± 0.231
1.96HisVal: 1.96 ± 0.625
0.653HisTrp: 0.653 ± 0.463
1.633HisTyr: 1.633 ± 0.768
0.0HisXaa: 0.0 ± 0.0
Ile
6.207IleAla: 6.207 ± 1.919
1.307IleCys: 1.307 ± 0.436
1.96IleAsp: 1.96 ± 0.625
1.307IleGlu: 1.307 ± 0.716
1.96IlePhe: 1.96 ± 0.854
5.227IleGly: 5.227 ± 1.659
0.98IleHis: 0.98 ± 0.299
3.594IleIle: 3.594 ± 0.91
4.247IleLys: 4.247 ± 0.952
4.247IleLeu: 4.247 ± 1.731
3.267IleMet: 3.267 ± 1.162
2.287IleAsn: 2.287 ± 0.868
2.94IlePro: 2.94 ± 0.744
0.98IleGln: 0.98 ± 0.299
2.94IleArg: 2.94 ± 1.197
2.614IleSer: 2.614 ± 1.227
4.247IleThr: 4.247 ± 0.908
4.574IleVal: 4.574 ± 1.595
1.307IleTrp: 1.307 ± 0.733
3.267IleTyr: 3.267 ± 0.69
0.0IleXaa: 0.0 ± 0.0
Lys
3.267LysAla: 3.267 ± 0.871
1.633LysCys: 1.633 ± 0.713
5.554LysAsp: 5.554 ± 1.2
6.534LysGlu: 6.534 ± 1.524
1.633LysPhe: 1.633 ± 0.678
4.574LysGly: 4.574 ± 1.323
0.98LysHis: 0.98 ± 0.47
2.614LysIle: 2.614 ± 0.925
4.574LysLys: 4.574 ± 0.373
4.9LysLeu: 4.9 ± 1.632
3.594LysMet: 3.594 ± 1.525
2.287LysAsn: 2.287 ± 0.533
2.94LysPro: 2.94 ± 0.595
2.94LysGln: 2.94 ± 0.773
2.614LysArg: 2.614 ± 0.571
1.633LysSer: 1.633 ± 0.628
3.594LysThr: 3.594 ± 1.159
4.247LysVal: 4.247 ± 1.111
1.307LysTrp: 1.307 ± 0.544
2.94LysTyr: 2.94 ± 1.271
0.0LysXaa: 0.0 ± 0.0
Leu
5.88LeuAla: 5.88 ± 1.683
0.653LeuCys: 0.653 ± 0.343
6.534LeuAsp: 6.534 ± 1.297
3.92LeuGlu: 3.92 ± 0.882
4.247LeuPhe: 4.247 ± 0.793
6.861LeuGly: 6.861 ± 0.681
1.633LeuHis: 1.633 ± 0.477
3.92LeuIle: 3.92 ± 0.993
4.247LeuLys: 4.247 ± 1.434
8.494LeuLeu: 8.494 ± 1.847
2.94LeuMet: 2.94 ± 0.782
2.287LeuAsn: 2.287 ± 0.879
3.92LeuPro: 3.92 ± 0.755
1.633LeuGln: 1.633 ± 1.06
7.514LeuArg: 7.514 ± 1.722
7.841LeuSer: 7.841 ± 1.212
10.127LeuThr: 10.127 ± 1.275
7.514LeuVal: 7.514 ± 2.398
1.96LeuTrp: 1.96 ± 1.228
2.614LeuTyr: 2.614 ± 1.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.614MetAla: 2.614 ± 0.77
0.98MetCys: 0.98 ± 0.596
1.96MetAsp: 1.96 ± 0.987
2.614MetGlu: 2.614 ± 0.788
0.653MetPhe: 0.653 ± 0.343
1.96MetGly: 1.96 ± 0.481
0.653MetHis: 0.653 ± 0.491
2.614MetIle: 2.614 ± 0.807
2.287MetLys: 2.287 ± 0.852
4.247MetLeu: 4.247 ± 1.212
1.307MetMet: 1.307 ± 0.895
0.98MetAsn: 0.98 ± 0.474
0.653MetPro: 0.653 ± 0.463
0.327MetGln: 0.327 ± 0.255
2.287MetArg: 2.287 ± 0.698
1.633MetSer: 1.633 ± 0.922
2.614MetThr: 2.614 ± 1.124
3.594MetVal: 3.594 ± 1.14
0.653MetTrp: 0.653 ± 0.347
1.633MetTyr: 1.633 ± 0.835
0.0MetXaa: 0.0 ± 0.0
Asn
2.287AsnAla: 2.287 ± 0.951
0.327AsnCys: 0.327 ± 0.255
1.307AsnAsp: 1.307 ± 0.436
1.96AsnGlu: 1.96 ± 0.597
1.633AsnPhe: 1.633 ± 0.471
1.96AsnGly: 1.96 ± 0.773
1.96AsnHis: 1.96 ± 0.816
2.614AsnIle: 2.614 ± 0.675
2.287AsnLys: 2.287 ± 0.493
4.247AsnLeu: 4.247 ± 0.869
0.653AsnMet: 0.653 ± 0.604
0.653AsnAsn: 0.653 ± 0.463
1.96AsnPro: 1.96 ± 0.94
0.0AsnGln: 0.0 ± 0.0
2.614AsnArg: 2.614 ± 1.162
1.307AsnSer: 1.307 ± 0.631
1.96AsnThr: 1.96 ± 0.711
1.96AsnVal: 1.96 ± 0.472
2.287AsnTrp: 2.287 ± 0.797
1.307AsnTyr: 1.307 ± 0.479
0.0AsnXaa: 0.0 ± 0.0
Pro
1.307ProAla: 1.307 ± 0.444
0.0ProCys: 0.0 ± 0.0
0.327ProAsp: 0.327 ± 0.255
1.96ProGlu: 1.96 ± 0.888
0.653ProPhe: 0.653 ± 0.689
4.9ProGly: 4.9 ± 0.916
0.327ProHis: 0.327 ± 0.255
2.94ProIle: 2.94 ± 0.924
1.633ProLys: 1.633 ± 0.314
4.247ProLeu: 4.247 ± 1.279
1.96ProMet: 1.96 ± 0.615
1.633ProAsn: 1.633 ± 0.983
0.98ProPro: 0.98 ± 0.436
0.653ProGln: 0.653 ± 0.371
1.633ProArg: 1.633 ± 1.024
3.92ProSer: 3.92 ± 1.568
3.594ProThr: 3.594 ± 0.944
3.594ProVal: 3.594 ± 0.836
0.653ProTrp: 0.653 ± 0.343
1.307ProTyr: 1.307 ± 0.742
0.0ProXaa: 0.0 ± 0.0
Gln
1.633GlnAla: 1.633 ± 0.729
0.653GlnCys: 0.653 ± 0.371
0.0GlnAsp: 0.0 ± 0.0
2.287GlnGlu: 2.287 ± 0.636
0.98GlnPhe: 0.98 ± 0.532
3.267GlnGly: 3.267 ± 0.642
0.327GlnHis: 0.327 ± 0.41
1.307GlnIle: 1.307 ± 0.436
1.633GlnLys: 1.633 ± 0.628
0.653GlnLeu: 0.653 ± 0.272
0.653GlnMet: 0.653 ± 0.272
0.327GlnAsn: 0.327 ± 0.349
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.96GlnArg: 1.96 ± 0.872
0.98GlnSer: 0.98 ± 0.937
0.0GlnThr: 0.0 ± 0.0
1.96GlnVal: 1.96 ± 0.961
0.327GlnTrp: 0.327 ± 0.488
0.653GlnTyr: 0.653 ± 0.477
0.0GlnXaa: 0.0 ± 0.0
Arg
3.92ArgAla: 3.92 ± 1.732
0.0ArgCys: 0.0 ± 0.0
2.287ArgAsp: 2.287 ± 0.463
4.247ArgGlu: 4.247 ± 2.056
1.96ArgPhe: 1.96 ± 0.437
4.9ArgGly: 4.9 ± 1.355
0.98ArgHis: 0.98 ± 0.47
2.94ArgIle: 2.94 ± 1.235
4.9ArgLys: 4.9 ± 0.879
6.534ArgLeu: 6.534 ± 1.834
1.633ArgMet: 1.633 ± 0.64
0.327ArgAsn: 0.327 ± 0.255
1.96ArgPro: 1.96 ± 0.437
0.98ArgGln: 0.98 ± 0.473
0.98ArgArg: 0.98 ± 0.322
1.307ArgSer: 1.307 ± 0.442
3.267ArgThr: 3.267 ± 0.882
5.227ArgVal: 5.227 ± 0.573
0.327ArgTrp: 0.327 ± 0.255
2.614ArgTyr: 2.614 ± 0.807
0.0ArgXaa: 0.0 ± 0.0
Ser
4.9SerAla: 4.9 ± 1.218
0.98SerCys: 0.98 ± 0.495
2.614SerAsp: 2.614 ± 0.535
1.96SerGlu: 1.96 ± 0.495
3.267SerPhe: 3.267 ± 1.207
4.247SerGly: 4.247 ± 1.151
2.287SerHis: 2.287 ± 0.53
4.574SerIle: 4.574 ± 0.88
3.92SerLys: 3.92 ± 1.995
6.207SerLeu: 6.207 ± 1.952
1.307SerMet: 1.307 ± 0.544
3.594SerAsn: 3.594 ± 1.203
2.94SerPro: 2.94 ± 0.752
1.307SerGln: 1.307 ± 0.619
2.94SerArg: 2.94 ± 1.061
5.88SerSer: 5.88 ± 3.147
4.574SerThr: 4.574 ± 0.922
3.92SerVal: 3.92 ± 1.732
1.633SerTrp: 1.633 ± 1.469
1.307SerTyr: 1.307 ± 0.466
0.0SerXaa: 0.0 ± 0.0
Thr
2.94ThrAla: 2.94 ± 0.497
0.653ThrCys: 0.653 ± 0.371
3.92ThrAsp: 3.92 ± 1.12
2.614ThrGlu: 2.614 ± 0.681
1.96ThrPhe: 1.96 ± 0.816
6.207ThrGly: 6.207 ± 1.018
0.98ThrHis: 0.98 ± 0.436
5.227ThrIle: 5.227 ± 1.065
3.267ThrLys: 3.267 ± 1.304
5.88ThrLeu: 5.88 ± 1.081
1.96ThrMet: 1.96 ± 0.83
1.96ThrAsn: 1.96 ± 0.769
3.92ThrPro: 3.92 ± 1.141
0.327ThrGln: 0.327 ± 0.255
3.594ThrArg: 3.594 ± 1.079
4.574ThrSer: 4.574 ± 1.274
5.88ThrThr: 5.88 ± 0.879
3.594ThrVal: 3.594 ± 1.433
1.633ThrTrp: 1.633 ± 0.689
3.267ThrTyr: 3.267 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
6.207ValAla: 6.207 ± 1.54
0.98ValCys: 0.98 ± 0.473
3.594ValAsp: 3.594 ± 0.593
2.614ValGlu: 2.614 ± 0.562
3.92ValPhe: 3.92 ± 0.947
7.841ValGly: 7.841 ± 0.662
0.653ValHis: 0.653 ± 0.698
3.594ValIle: 3.594 ± 1.07
3.92ValLys: 3.92 ± 1.285
7.514ValLeu: 7.514 ± 1.631
2.94ValMet: 2.94 ± 0.782
1.633ValAsn: 1.633 ± 0.591
3.594ValPro: 3.594 ± 1.102
0.98ValGln: 0.98 ± 0.473
3.92ValArg: 3.92 ± 1.152
5.554ValSer: 5.554 ± 1.97
4.247ValThr: 4.247 ± 1.151
5.88ValVal: 5.88 ± 1.444
1.96ValTrp: 1.96 ± 0.789
3.267ValTyr: 3.267 ± 0.686
0.0ValXaa: 0.0 ± 0.0
Trp
0.327TrpAla: 0.327 ± 0.41
0.653TrpCys: 0.653 ± 0.604
1.96TrpAsp: 1.96 ± 1.083
1.633TrpGlu: 1.633 ± 1.024
1.633TrpPhe: 1.633 ± 0.471
0.98TrpGly: 0.98 ± 0.526
0.0TrpHis: 0.0 ± 0.0
1.96TrpIle: 1.96 ± 0.495
0.98TrpLys: 0.98 ± 0.436
3.267TrpLeu: 3.267 ± 0.899
1.307TrpMet: 1.307 ± 0.562
0.653TrpAsn: 0.653 ± 0.398
0.327TrpPro: 0.327 ± 0.255
0.653TrpGln: 0.653 ± 0.343
1.96TrpArg: 1.96 ± 0.869
1.96TrpSer: 1.96 ± 0.444
0.98TrpThr: 0.98 ± 0.322
0.327TrpVal: 0.327 ± 0.302
0.0TrpTrp: 0.0 ± 0.0
0.653TrpTyr: 0.653 ± 0.371
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.94TyrAla: 2.94 ± 0.594
0.653TyrCys: 0.653 ± 0.698
3.92TyrAsp: 3.92 ± 1.694
2.287TyrGlu: 2.287 ± 1.074
1.633TyrPhe: 1.633 ± 0.983
3.267TyrGly: 3.267 ± 1.256
1.633TyrHis: 1.633 ± 0.725
2.287TyrIle: 2.287 ± 0.974
1.96TyrLys: 1.96 ± 0.631
2.94TyrLeu: 2.94 ± 0.907
0.653TyrMet: 0.653 ± 0.345
1.96TyrAsn: 1.96 ± 0.676
1.633TyrPro: 1.633 ± 0.56
0.653TyrGln: 0.653 ± 0.689
2.287TyrArg: 2.287 ± 0.952
3.267TyrSer: 3.267 ± 0.627
2.287TyrThr: 2.287 ± 1.316
2.287TyrVal: 2.287 ± 0.99
0.327TyrTrp: 0.327 ± 0.231
2.94TyrTyr: 2.94 ± 1.609
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski