Amino acid dipepetide frequency for Air potato virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.879AlaAla: 4.879 ± 1.141
0.929AlaCys: 0.929 ± 0.261
3.253AlaAsp: 3.253 ± 0.928
3.717AlaGlu: 3.717 ± 0.335
2.323AlaPhe: 2.323 ± 0.465
5.576AlaGly: 5.576 ± 1.305
1.394AlaHis: 1.394 ± 0.638
2.556AlaIle: 2.556 ± 0.957
1.859AlaLys: 1.859 ± 0.48
5.112AlaLeu: 5.112 ± 0.643
1.394AlaMet: 1.394 ± 0.493
1.162AlaAsn: 1.162 ± 0.471
4.414AlaPro: 4.414 ± 1.332
3.717AlaGln: 3.717 ± 0.788
3.02AlaArg: 3.02 ± 0.323
4.414AlaSer: 4.414 ± 1.246
4.647AlaThr: 4.647 ± 0.697
3.95AlaVal: 3.95 ± 0.77
1.394AlaTrp: 1.394 ± 0.689
2.091AlaTyr: 2.091 ± 0.623
0.0AlaXaa: 0.0 ± 0.0
Cys
0.465CysAla: 0.465 ± 0.476
0.697CysCys: 0.697 ± 0.233
1.626CysAsp: 1.626 ± 0.498
0.465CysGlu: 0.465 ± 0.221
1.394CysPhe: 1.394 ± 0.548
1.394CysGly: 1.394 ± 0.407
0.232CysHis: 0.232 ± 0.139
1.162CysIle: 1.162 ± 0.855
0.697CysLys: 0.697 ± 0.261
1.626CysLeu: 1.626 ± 0.458
0.232CysMet: 0.232 ± 0.139
0.697CysAsn: 0.697 ± 0.274
0.929CysPro: 0.929 ± 0.557
0.929CysGln: 0.929 ± 0.547
0.697CysArg: 0.697 ± 0.261
2.323CysSer: 2.323 ± 0.84
0.929CysThr: 0.929 ± 0.383
1.162CysVal: 1.162 ± 0.453
0.465CysTrp: 0.465 ± 0.221
1.162CysTyr: 1.162 ± 0.464
0.0CysXaa: 0.0 ± 0.0
Asp
3.253AspAla: 3.253 ± 0.425
2.323AspCys: 2.323 ± 0.457
1.859AspAsp: 1.859 ± 0.44
3.02AspGlu: 3.02 ± 0.582
3.485AspPhe: 3.485 ± 0.551
2.788AspGly: 2.788 ± 0.939
0.465AspHis: 0.465 ± 0.221
4.647AspIle: 4.647 ± 1.247
2.323AspLys: 2.323 ± 0.555
6.041AspLeu: 6.041 ± 1.673
0.465AspMet: 0.465 ± 0.204
1.859AspAsn: 1.859 ± 0.469
2.556AspPro: 2.556 ± 0.845
0.232AspGln: 0.232 ± 0.139
2.323AspArg: 2.323 ± 0.863
2.788AspSer: 2.788 ± 0.883
1.626AspThr: 1.626 ± 0.491
4.182AspVal: 4.182 ± 1.427
0.0AspTrp: 0.0 ± 0.0
1.626AspTyr: 1.626 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
3.253GluAla: 3.253 ± 0.991
0.697GluCys: 0.697 ± 0.233
3.02GluAsp: 3.02 ± 0.576
4.647GluGlu: 4.647 ± 0.706
2.788GluPhe: 2.788 ± 0.685
3.253GluGly: 3.253 ± 0.781
1.394GluHis: 1.394 ± 0.419
3.95GluIle: 3.95 ± 0.794
3.253GluLys: 3.253 ± 0.501
5.112GluLeu: 5.112 ± 0.79
1.162GluMet: 1.162 ± 0.432
3.02GluAsn: 3.02 ± 0.696
2.091GluPro: 2.091 ± 0.397
2.788GluGln: 2.788 ± 0.873
4.879GluArg: 4.879 ± 0.718
2.091GluSer: 2.091 ± 0.802
2.788GluThr: 2.788 ± 0.461
4.647GluVal: 4.647 ± 0.533
1.626GluTrp: 1.626 ± 0.444
1.626GluTyr: 1.626 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
2.323PheAla: 2.323 ± 0.325
1.394PheCys: 1.394 ± 0.406
2.788PheAsp: 2.788 ± 0.753
2.091PheGlu: 2.091 ± 0.601
1.859PhePhe: 1.859 ± 0.593
2.091PheGly: 2.091 ± 1.236
0.232PheHis: 0.232 ± 0.139
1.162PheIle: 1.162 ± 0.514
3.485PheLys: 3.485 ± 0.717
6.506PheLeu: 6.506 ± 1.015
0.697PheMet: 0.697 ± 0.384
1.394PheAsn: 1.394 ± 0.536
1.162PhePro: 1.162 ± 0.476
0.929PheGln: 0.929 ± 0.552
3.717PheArg: 3.717 ± 1.109
4.879PheSer: 4.879 ± 2.164
3.02PheThr: 3.02 ± 0.76
4.879PheVal: 4.879 ± 0.937
0.232PheTrp: 0.232 ± 0.332
1.162PheTyr: 1.162 ± 0.805
0.0PheXaa: 0.0 ± 0.0
Gly
4.414GlyAla: 4.414 ± 0.772
1.859GlyCys: 1.859 ± 0.597
3.717GlyAsp: 3.717 ± 0.776
3.485GlyGlu: 3.485 ± 0.516
2.556GlyPhe: 2.556 ± 0.381
4.647GlyGly: 4.647 ± 0.89
1.626GlyHis: 1.626 ± 0.402
3.95GlyIle: 3.95 ± 0.585
5.576GlyLys: 5.576 ± 1.692
5.112GlyLeu: 5.112 ± 0.732
0.697GlyMet: 0.697 ± 0.485
2.091GlyAsn: 2.091 ± 1.103
1.859GlyPro: 1.859 ± 0.852
1.394GlyGln: 1.394 ± 0.466
3.02GlyArg: 3.02 ± 1.069
3.95GlySer: 3.95 ± 0.461
3.253GlyThr: 3.253 ± 0.655
4.879GlyVal: 4.879 ± 1.29
0.929GlyTrp: 0.929 ± 0.349
2.788GlyTyr: 2.788 ± 0.833
0.0GlyXaa: 0.0 ± 0.0
His
1.394HisAla: 1.394 ± 0.629
0.232HisCys: 0.232 ± 0.139
0.465HisAsp: 0.465 ± 0.424
1.394HisGlu: 1.394 ± 0.365
0.465HisPhe: 0.465 ± 0.574
1.394HisGly: 1.394 ± 0.784
0.697HisHis: 0.697 ± 0.418
0.232HisIle: 0.232 ± 0.139
0.697HisLys: 0.697 ± 0.614
3.717HisLeu: 3.717 ± 0.528
0.697HisMet: 0.697 ± 0.261
0.0HisAsn: 0.0 ± 0.0
2.788HisPro: 2.788 ± 0.717
0.697HisGln: 0.697 ± 0.418
0.697HisArg: 0.697 ± 0.418
2.788HisSer: 2.788 ± 1.397
1.162HisThr: 1.162 ± 0.512
1.626HisVal: 1.626 ± 0.665
0.0HisTrp: 0.0 ± 0.0
1.162HisTyr: 1.162 ± 0.345
0.0HisXaa: 0.0 ± 0.0
Ile
2.323IleAla: 2.323 ± 1.038
0.697IleCys: 0.697 ± 0.31
3.485IleAsp: 3.485 ± 1.336
3.02IleGlu: 3.02 ± 0.733
1.626IlePhe: 1.626 ± 0.545
4.414IleGly: 4.414 ± 1.616
1.162IleHis: 1.162 ± 0.481
3.253IleIle: 3.253 ± 1.028
3.02IleLys: 3.02 ± 1.23
5.576IleLeu: 5.576 ± 0.851
1.162IleMet: 1.162 ± 0.464
2.556IleAsn: 2.556 ± 0.602
4.879IlePro: 4.879 ± 0.689
0.929IleGln: 0.929 ± 0.759
1.859IleArg: 1.859 ± 0.394
4.182IleSer: 4.182 ± 1.093
2.091IleThr: 2.091 ± 0.595
3.485IleVal: 3.485 ± 0.874
0.929IleTrp: 0.929 ± 0.349
1.394IleTyr: 1.394 ± 0.778
0.0IleXaa: 0.0 ± 0.0
Lys
3.717LysAla: 3.717 ± 0.7
0.232LysCys: 0.232 ± 0.332
2.788LysAsp: 2.788 ± 0.94
2.788LysGlu: 2.788 ± 0.674
2.788LysPhe: 2.788 ± 1.201
4.647LysGly: 4.647 ± 1.803
1.626LysHis: 1.626 ± 0.772
3.485LysIle: 3.485 ± 0.326
2.556LysLys: 2.556 ± 1.139
4.182LysLeu: 4.182 ± 0.556
2.091LysMet: 2.091 ± 0.509
2.556LysAsn: 2.556 ± 0.422
2.788LysPro: 2.788 ± 1.024
1.859LysGln: 1.859 ± 0.919
4.182LysArg: 4.182 ± 0.631
7.203LysSer: 7.203 ± 1.118
2.788LysThr: 2.788 ± 0.639
3.253LysVal: 3.253 ± 1.011
1.162LysTrp: 1.162 ± 0.349
2.788LysTyr: 2.788 ± 0.608
0.0LysXaa: 0.0 ± 0.0
Leu
6.273LeuAla: 6.273 ± 0.717
2.091LeuCys: 2.091 ± 0.522
3.485LeuAsp: 3.485 ± 0.979
4.879LeuGlu: 4.879 ± 0.874
3.02LeuPhe: 3.02 ± 0.909
5.576LeuGly: 5.576 ± 0.821
3.02LeuHis: 3.02 ± 0.824
4.647LeuIle: 4.647 ± 0.694
8.132LeuLys: 8.132 ± 0.598
12.082LeuLeu: 12.082 ± 1.707
2.323LeuMet: 2.323 ± 0.676
4.879LeuAsn: 4.879 ± 0.751
4.647LeuPro: 4.647 ± 0.79
2.788LeuGln: 2.788 ± 0.999
6.041LeuArg: 6.041 ± 1.187
6.738LeuSer: 6.738 ± 1.751
6.273LeuThr: 6.273 ± 1.113
8.132LeuVal: 8.132 ± 1.809
0.697LeuTrp: 0.697 ± 0.31
3.253LeuTyr: 3.253 ± 1.001
0.0LeuXaa: 0.0 ± 0.0
Met
0.929MetAla: 0.929 ± 0.38
0.697MetCys: 0.697 ± 0.233
0.465MetAsp: 0.465 ± 0.23
1.394MetGlu: 1.394 ± 0.613
1.162MetPhe: 1.162 ± 0.432
0.929MetGly: 0.929 ± 0.261
0.465MetHis: 0.465 ± 0.221
1.162MetIle: 1.162 ± 0.463
1.859MetLys: 1.859 ± 0.784
1.394MetLeu: 1.394 ± 0.522
0.232MetMet: 0.232 ± 0.139
0.929MetAsn: 0.929 ± 0.253
1.394MetPro: 1.394 ± 0.28
0.465MetGln: 0.465 ± 0.301
1.162MetArg: 1.162 ± 0.267
1.394MetSer: 1.394 ± 0.835
2.091MetThr: 2.091 ± 1.34
1.394MetVal: 1.394 ± 0.383
0.0MetTrp: 0.0 ± 0.0
0.232MetTyr: 0.232 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
2.323AsnAla: 2.323 ± 0.804
0.0AsnCys: 0.0 ± 0.0
0.697AsnAsp: 0.697 ± 0.233
2.323AsnGlu: 2.323 ± 0.837
3.253AsnPhe: 3.253 ± 1.027
2.091AsnGly: 2.091 ± 0.461
0.697AsnHis: 0.697 ± 0.39
2.091AsnIle: 2.091 ± 0.378
3.02AsnLys: 3.02 ± 0.492
4.647AsnLeu: 4.647 ± 0.879
0.697AsnMet: 0.697 ± 0.32
0.697AsnAsn: 0.697 ± 0.418
2.556AsnPro: 2.556 ± 0.595
0.697AsnGln: 0.697 ± 0.233
3.253AsnArg: 3.253 ± 0.952
3.485AsnSer: 3.485 ± 0.861
2.788AsnThr: 2.788 ± 1.053
3.02AsnVal: 3.02 ± 1.073
0.232AsnTrp: 0.232 ± 0.139
1.394AsnTyr: 1.394 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
3.717ProAla: 3.717 ± 0.43
0.465ProCys: 0.465 ± 0.204
2.788ProAsp: 2.788 ± 0.952
3.02ProGlu: 3.02 ± 1.042
1.162ProPhe: 1.162 ± 0.245
2.323ProGly: 2.323 ± 0.665
1.626ProHis: 1.626 ± 0.945
2.091ProIle: 2.091 ± 0.525
2.556ProLys: 2.556 ± 0.654
4.879ProLeu: 4.879 ± 0.824
1.162ProMet: 1.162 ± 0.305
1.859ProAsn: 1.859 ± 0.335
2.788ProPro: 2.788 ± 0.623
2.091ProGln: 2.091 ± 1.193
2.323ProArg: 2.323 ± 0.579
4.182ProSer: 4.182 ± 0.856
5.344ProThr: 5.344 ± 1.173
4.414ProVal: 4.414 ± 0.95
0.232ProTrp: 0.232 ± 0.287
2.323ProTyr: 2.323 ± 0.411
0.0ProXaa: 0.0 ± 0.0
Gln
1.859GlnAla: 1.859 ± 0.722
0.465GlnCys: 0.465 ± 0.23
1.162GlnAsp: 1.162 ± 0.463
1.626GlnGlu: 1.626 ± 0.339
1.626GlnPhe: 1.626 ± 0.474
1.626GlnGly: 1.626 ± 0.489
1.162GlnHis: 1.162 ± 0.376
3.717GlnIle: 3.717 ± 1.374
1.626GlnLys: 1.626 ± 0.444
2.091GlnLeu: 2.091 ± 0.611
0.697GlnMet: 0.697 ± 0.233
1.394GlnAsn: 1.394 ± 0.894
2.091GlnPro: 2.091 ± 0.987
1.162GlnGln: 1.162 ± 0.464
0.465GlnArg: 0.465 ± 0.221
3.02GlnSer: 3.02 ± 0.567
2.091GlnThr: 2.091 ± 0.553
2.323GlnVal: 2.323 ± 0.595
0.232GlnTrp: 0.232 ± 0.139
0.465GlnTyr: 0.465 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
3.717ArgAla: 3.717 ± 1.002
1.394ArgCys: 1.394 ± 0.218
3.02ArgAsp: 3.02 ± 0.672
4.182ArgGlu: 4.182 ± 0.792
2.091ArgPhe: 2.091 ± 0.84
3.717ArgGly: 3.717 ± 1.117
0.929ArgHis: 0.929 ± 0.557
2.323ArgIle: 2.323 ± 0.489
2.788ArgLys: 2.788 ± 1.397
4.647ArgLeu: 4.647 ± 0.661
1.162ArgMet: 1.162 ± 0.366
1.859ArgAsn: 1.859 ± 0.187
2.323ArgPro: 2.323 ± 1.151
2.323ArgGln: 2.323 ± 0.647
4.182ArgArg: 4.182 ± 0.863
5.112ArgSer: 5.112 ± 0.801
3.253ArgThr: 3.253 ± 0.814
4.879ArgVal: 4.879 ± 1.486
0.929ArgTrp: 0.929 ± 0.375
1.626ArgTyr: 1.626 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
4.879SerAla: 4.879 ± 1.327
1.162SerCys: 1.162 ± 0.425
3.717SerAsp: 3.717 ± 0.602
5.112SerGlu: 5.112 ± 1.692
4.182SerPhe: 4.182 ± 0.716
5.809SerGly: 5.809 ± 0.782
1.859SerHis: 1.859 ± 0.694
3.717SerIle: 3.717 ± 0.692
5.344SerLys: 5.344 ± 1.264
10.455SerLeu: 10.455 ± 1.41
1.626SerMet: 1.626 ± 0.685
2.323SerAsn: 2.323 ± 0.817
3.95SerPro: 3.95 ± 0.655
1.394SerGln: 1.394 ± 0.463
4.182SerArg: 4.182 ± 0.723
9.526SerSer: 9.526 ± 2.152
6.041SerThr: 6.041 ± 1.07
6.738SerVal: 6.738 ± 1.386
1.162SerTrp: 1.162 ± 0.464
2.323SerTyr: 2.323 ± 0.677
0.0SerXaa: 0.0 ± 0.0
Thr
3.253ThrAla: 3.253 ± 0.576
0.929ThrCys: 0.929 ± 0.383
2.323ThrAsp: 2.323 ± 0.888
2.323ThrGlu: 2.323 ± 0.628
4.182ThrPhe: 4.182 ± 0.952
3.02ThrGly: 3.02 ± 0.861
0.929ThrHis: 0.929 ± 0.383
2.091ThrIle: 2.091 ± 0.301
3.02ThrLys: 3.02 ± 0.985
5.112ThrLeu: 5.112 ± 1.263
0.465ThrMet: 0.465 ± 0.204
3.253ThrAsn: 3.253 ± 0.439
3.717ThrPro: 3.717 ± 0.594
2.788ThrGln: 2.788 ± 0.861
2.091ThrArg: 2.091 ± 0.513
6.97ThrSer: 6.97 ± 2.125
4.414ThrThr: 4.414 ± 0.739
6.273ThrVal: 6.273 ± 0.774
0.465ThrTrp: 0.465 ± 0.415
2.788ThrTyr: 2.788 ± 0.516
0.0ThrXaa: 0.0 ± 0.0
Val
6.273ValAla: 6.273 ± 0.929
1.626ValCys: 1.626 ± 0.325
5.112ValAsp: 5.112 ± 1.757
6.738ValGlu: 6.738 ± 0.875
4.647ValPhe: 4.647 ± 1.669
4.414ValGly: 4.414 ± 1.177
0.929ValHis: 0.929 ± 0.409
2.556ValIle: 2.556 ± 1.108
5.112ValLys: 5.112 ± 1.052
6.273ValLeu: 6.273 ± 1.016
1.162ValMet: 1.162 ± 0.576
5.112ValAsn: 5.112 ± 1.14
3.485ValPro: 3.485 ± 0.751
2.323ValGln: 2.323 ± 0.837
5.112ValArg: 5.112 ± 1.15
7.203ValSer: 7.203 ± 1.528
3.717ValThr: 3.717 ± 0.985
9.294ValVal: 9.294 ± 1.516
0.232ValTrp: 0.232 ± 0.139
1.626ValTyr: 1.626 ± 0.68
0.0ValXaa: 0.0 ± 0.0
Trp
0.465TrpAla: 0.465 ± 0.221
0.232TrpCys: 0.232 ± 0.139
0.465TrpAsp: 0.465 ± 0.382
0.0TrpGlu: 0.0 ± 0.0
0.697TrpPhe: 0.697 ± 0.331
0.465TrpGly: 0.465 ± 0.467
0.0TrpHis: 0.0 ± 0.0
1.394TrpIle: 1.394 ± 0.417
0.465TrpLys: 0.465 ± 0.278
1.394TrpLeu: 1.394 ± 0.482
0.0TrpMet: 0.0 ± 0.0
0.232TrpAsn: 0.232 ± 0.139
0.232TrpPro: 0.232 ± 0.276
0.465TrpGln: 0.465 ± 0.278
0.697TrpArg: 0.697 ± 0.233
1.394TrpSer: 1.394 ± 0.589
0.232TrpThr: 0.232 ± 0.139
1.626TrpVal: 1.626 ± 0.408
0.0TrpTrp: 0.0 ± 0.0
0.465TrpTyr: 0.465 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.091TyrAla: 2.091 ± 0.543
1.162TyrCys: 1.162 ± 0.245
1.626TyrAsp: 1.626 ± 0.36
1.859TyrGlu: 1.859 ± 1.375
0.697TyrPhe: 0.697 ± 0.419
1.626TyrGly: 1.626 ± 0.927
1.859TyrHis: 1.859 ± 0.604
1.859TyrIle: 1.859 ± 1.004
2.091TyrLys: 2.091 ± 0.385
3.02TyrLeu: 3.02 ± 0.638
1.394TyrMet: 1.394 ± 0.365
2.323TyrAsn: 2.323 ± 0.254
0.697TyrPro: 0.697 ± 0.333
0.929TyrGln: 0.929 ± 0.765
2.556TyrArg: 2.556 ± 0.795
1.859TyrSer: 1.859 ± 0.414
1.859TyrThr: 1.859 ± 0.318
2.788TyrVal: 2.788 ± 0.774
0.0TyrTrp: 0.0 ± 0.0
1.859TyrTyr: 1.859 ± 0.419
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4305 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski