Amino acid dipepetide frequency for Pepo aphid-borne yellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.579AlaAla: 5.579 ± 0.942
3.282AlaCys: 3.282 ± 1.224
2.297AlaAsp: 2.297 ± 0.794
7.22AlaGlu: 7.22 ± 1.068
2.297AlaPhe: 2.297 ± 0.534
4.923AlaGly: 4.923 ± 1.771
0.328AlaHis: 0.328 ± 0.89
2.626AlaIle: 2.626 ± 0.764
2.954AlaLys: 2.954 ± 0.412
6.236AlaLeu: 6.236 ± 1.72
0.985AlaMet: 0.985 ± 0.451
1.969AlaAsn: 1.969 ± 0.533
4.595AlaPro: 4.595 ± 1.027
3.282AlaGln: 3.282 ± 0.714
4.595AlaArg: 4.595 ± 1.275
6.564AlaSer: 6.564 ± 1.254
2.954AlaThr: 2.954 ± 0.869
5.579AlaVal: 5.579 ± 1.257
2.297AlaTrp: 2.297 ± 0.581
1.313AlaTyr: 1.313 ± 0.583
0.0AlaXaa: 0.0 ± 0.0
Cys
1.313CysAla: 1.313 ± 0.632
0.0CysCys: 0.0 ± 0.0
2.297CysAsp: 2.297 ± 0.68
0.656CysGlu: 0.656 ± 0.503
0.328CysPhe: 0.328 ± 0.482
2.297CysGly: 2.297 ± 0.811
0.0CysHis: 0.0 ± 0.0
0.656CysIle: 0.656 ± 0.455
1.969CysLys: 1.969 ± 0.717
2.626CysLeu: 2.626 ± 1.199
1.313CysMet: 1.313 ± 0.613
0.0CysAsn: 0.0 ± 0.0
1.313CysPro: 1.313 ± 0.581
1.313CysGln: 1.313 ± 0.381
0.328CysArg: 0.328 ± 0.228
0.985CysSer: 0.985 ± 0.541
2.297CysThr: 2.297 ± 1.044
0.985CysVal: 0.985 ± 0.458
0.656CysTrp: 0.656 ± 0.307
0.328CysTyr: 0.328 ± 0.228
0.0CysXaa: 0.0 ± 0.0
Asp
4.595AspAla: 4.595 ± 0.847
0.985AspCys: 0.985 ± 0.458
3.282AspAsp: 3.282 ± 1.244
4.923AspGlu: 4.923 ± 0.948
1.313AspPhe: 1.313 ± 0.485
5.251AspGly: 5.251 ± 1.552
0.328AspHis: 0.328 ± 0.557
2.297AspIle: 2.297 ± 0.974
0.985AspLys: 0.985 ± 0.508
3.282AspLeu: 3.282 ± 1.032
0.328AspMet: 0.328 ± 0.401
3.938AspAsn: 3.938 ± 0.943
3.938AspPro: 3.938 ± 1.285
1.969AspGln: 1.969 ± 0.479
2.626AspArg: 2.626 ± 0.84
2.297AspSer: 2.297 ± 0.482
1.313AspThr: 1.313 ± 0.495
1.313AspVal: 1.313 ± 0.613
0.656AspTrp: 0.656 ± 0.282
1.641AspTyr: 1.641 ± 0.968
0.0AspXaa: 0.0 ± 0.0
Glu
2.954GluAla: 2.954 ± 0.543
0.985GluCys: 0.985 ± 0.429
5.251GluAsp: 5.251 ± 1.788
5.251GluGlu: 5.251 ± 1.498
2.626GluPhe: 2.626 ± 0.963
5.251GluGly: 5.251 ± 1.427
0.656GluHis: 0.656 ± 0.577
1.969GluIle: 1.969 ± 0.669
4.266GluLys: 4.266 ± 1.781
3.938GluLeu: 3.938 ± 1.1
0.656GluMet: 0.656 ± 0.503
3.282GluAsn: 3.282 ± 1.371
1.641GluPro: 1.641 ± 0.695
0.656GluGln: 0.656 ± 0.455
3.282GluArg: 3.282 ± 1.336
3.61GluSer: 3.61 ± 0.817
3.938GluThr: 3.938 ± 0.542
3.282GluVal: 3.282 ± 0.657
1.969GluTrp: 1.969 ± 0.461
4.266GluTyr: 4.266 ± 0.98
0.0GluXaa: 0.0 ± 0.0
Phe
1.313PheAla: 1.313 ± 0.868
0.328PheCys: 0.328 ± 0.228
2.297PheAsp: 2.297 ± 0.483
4.266PheGlu: 4.266 ± 1.28
1.969PhePhe: 1.969 ± 0.92
2.297PheGly: 2.297 ± 1.003
1.313PheHis: 1.313 ± 0.624
2.954PheIle: 2.954 ± 0.808
0.985PheLys: 0.985 ± 0.451
5.579PheLeu: 5.579 ± 2.023
0.0PheMet: 0.0 ± 0.0
2.626PheAsn: 2.626 ± 0.92
1.313PhePro: 1.313 ± 0.485
3.61PheGln: 3.61 ± 0.841
3.282PheArg: 3.282 ± 0.464
4.266PheSer: 4.266 ± 0.755
1.313PheThr: 1.313 ± 0.371
4.266PheVal: 4.266 ± 0.671
0.328PheTrp: 0.328 ± 0.274
0.328PheTyr: 0.328 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
4.595GlyAla: 4.595 ± 1.277
1.313GlyCys: 1.313 ± 0.403
1.969GlyAsp: 1.969 ± 0.625
3.282GlyGlu: 3.282 ± 1.188
1.969GlyPhe: 1.969 ± 1.875
6.564GlyGly: 6.564 ± 1.491
2.954GlyHis: 2.954 ± 0.599
2.954GlyIle: 2.954 ± 1.003
5.907GlyLys: 5.907 ± 1.976
4.595GlyLeu: 4.595 ± 1.306
1.313GlyMet: 1.313 ± 0.926
3.61GlyAsn: 3.61 ± 1.331
4.266GlyPro: 4.266 ± 0.818
2.297GlyGln: 2.297 ± 0.925
4.595GlyArg: 4.595 ± 1.737
10.502GlySer: 10.502 ± 3.384
3.61GlyThr: 3.61 ± 0.421
4.266GlyVal: 4.266 ± 1.044
1.641GlyTrp: 1.641 ± 0.447
1.641GlyTyr: 1.641 ± 0.516
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.969HisCys: 1.969 ± 0.465
0.985HisAsp: 0.985 ± 0.451
1.313HisGlu: 1.313 ± 0.583
1.641HisPhe: 1.641 ± 0.543
0.985HisGly: 0.985 ± 1.058
0.656HisHis: 0.656 ± 0.459
0.328HisIle: 0.328 ± 0.274
0.656HisLys: 0.656 ± 0.455
1.641HisLeu: 1.641 ± 1.336
0.328HisMet: 0.328 ± 0.286
0.0HisAsn: 0.0 ± 0.0
0.656HisPro: 0.656 ± 0.503
0.985HisGln: 0.985 ± 0.438
0.656HisArg: 0.656 ± 0.503
2.297HisSer: 2.297 ± 1.876
0.656HisThr: 0.656 ± 0.307
1.313HisVal: 1.313 ± 0.868
0.328HisTrp: 0.328 ± 0.286
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.266IleAla: 4.266 ± 1.078
0.328IleCys: 0.328 ± 0.274
1.641IleAsp: 1.641 ± 0.739
2.626IleGlu: 2.626 ± 0.559
2.297IlePhe: 2.297 ± 1.289
1.313IleGly: 1.313 ± 0.884
0.328IleHis: 0.328 ± 0.228
1.641IleIle: 1.641 ± 0.516
1.313IleLys: 1.313 ± 0.467
3.282IleLeu: 3.282 ± 1.009
0.985IleMet: 0.985 ± 0.458
2.954IleAsn: 2.954 ± 1.121
4.923IlePro: 4.923 ± 1.257
2.297IleGln: 2.297 ± 0.652
1.641IleArg: 1.641 ± 0.598
3.938IleSer: 3.938 ± 1.709
4.923IleThr: 4.923 ± 2.184
0.656IleVal: 0.656 ± 0.885
0.328IleTrp: 0.328 ± 0.482
1.641IleTyr: 1.641 ± 0.746
0.0IleXaa: 0.0 ± 0.0
Lys
2.954LysAla: 2.954 ± 0.567
0.985LysCys: 0.985 ± 0.458
2.954LysAsp: 2.954 ± 0.64
2.297LysGlu: 2.297 ± 0.777
3.282LysPhe: 3.282 ± 1.482
3.282LysGly: 3.282 ± 1.083
0.985LysHis: 0.985 ± 0.651
2.626LysIle: 2.626 ± 1.798
2.626LysLys: 2.626 ± 0.945
6.564LysLeu: 6.564 ± 1.399
1.641LysMet: 1.641 ± 0.555
2.626LysAsn: 2.626 ± 0.917
2.626LysPro: 2.626 ± 0.744
1.641LysGln: 1.641 ± 0.746
1.641LysArg: 1.641 ± 0.516
4.595LysSer: 4.595 ± 0.928
3.282LysThr: 3.282 ± 1.226
0.985LysVal: 0.985 ± 0.433
1.313LysTrp: 1.313 ± 0.766
0.328LysTyr: 0.328 ± 0.274
0.328LysXaa: 0.328 ± 0.274
Leu
7.877LeuAla: 7.877 ± 2.24
1.969LeuCys: 1.969 ± 0.916
3.938LeuAsp: 3.938 ± 0.395
4.595LeuGlu: 4.595 ± 0.907
4.923LeuPhe: 4.923 ± 1.862
5.251LeuGly: 5.251 ± 1.419
1.969LeuHis: 1.969 ± 1.809
3.938LeuIle: 3.938 ± 0.963
4.923LeuLys: 4.923 ± 1.02
7.22LeuLeu: 7.22 ± 1.941
0.985LeuMet: 0.985 ± 0.521
2.954LeuAsn: 2.954 ± 0.606
3.938LeuPro: 3.938 ± 1.245
4.266LeuGln: 4.266 ± 0.621
5.579LeuArg: 5.579 ± 1.116
9.846LeuSer: 9.846 ± 2.128
6.236LeuThr: 6.236 ± 1.956
3.61LeuVal: 3.61 ± 1.293
2.626LeuTrp: 2.626 ± 0.956
2.954LeuTyr: 2.954 ± 1.253
0.0LeuXaa: 0.0 ± 0.0
Met
1.313MetAla: 1.313 ± 0.485
0.0MetCys: 0.0 ± 0.0
1.313MetAsp: 1.313 ± 1.101
0.656MetGlu: 0.656 ± 0.588
0.0MetPhe: 0.0 ± 0.0
0.328MetGly: 0.328 ± 0.228
0.0MetHis: 0.0 ± 0.0
0.328MetIle: 0.328 ± 0.286
1.313MetLys: 1.313 ± 0.613
2.297MetLeu: 2.297 ± 0.811
0.656MetMet: 0.656 ± 0.573
3.61MetAsn: 3.61 ± 0.753
0.0MetPro: 0.0 ± 0.0
0.328MetGln: 0.328 ± 0.286
0.328MetArg: 0.328 ± 0.482
1.313MetSer: 1.313 ± 0.718
0.656MetThr: 0.656 ± 0.307
2.626MetVal: 2.626 ± 0.886
0.0MetTrp: 0.0 ± 0.0
0.328MetTyr: 0.328 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.641AsnCys: 1.641 ± 0.746
1.313AsnAsp: 1.313 ± 0.565
1.313AsnGlu: 1.313 ± 0.834
4.923AsnPhe: 4.923 ± 1.451
5.579AsnGly: 5.579 ± 1.866
0.0AsnHis: 0.0 ± 0.0
4.266AsnIle: 4.266 ± 1.413
1.969AsnLys: 1.969 ± 0.585
4.595AsnLeu: 4.595 ± 1.054
0.656AsnMet: 0.656 ± 0.307
1.641AsnAsn: 1.641 ± 0.365
3.61AsnPro: 3.61 ± 1.04
0.656AsnGln: 0.656 ± 0.455
2.954AsnArg: 2.954 ± 0.9
4.923AsnSer: 4.923 ± 0.956
2.626AsnThr: 2.626 ± 0.972
1.969AsnVal: 1.969 ± 0.481
0.0AsnTrp: 0.0 ± 0.0
1.641AsnTyr: 1.641 ± 0.763
0.0AsnXaa: 0.0 ± 0.0
Pro
6.564ProAla: 6.564 ± 0.914
1.313ProCys: 1.313 ± 0.655
3.938ProAsp: 3.938 ± 1.193
3.938ProGlu: 3.938 ± 1.245
2.297ProPhe: 2.297 ± 1.044
5.251ProGly: 5.251 ± 1.295
1.641ProHis: 1.641 ± 0.501
2.297ProIle: 2.297 ± 0.854
2.297ProLys: 2.297 ± 1.108
3.61ProLeu: 3.61 ± 1.272
1.313ProMet: 1.313 ± 0.613
1.313ProAsn: 1.313 ± 0.495
8.533ProPro: 8.533 ± 1.762
3.282ProGln: 3.282 ± 0.926
3.938ProArg: 3.938 ± 2.15
6.236ProSer: 6.236 ± 1.893
2.297ProThr: 2.297 ± 0.494
1.313ProVal: 1.313 ± 0.655
0.656ProTrp: 0.656 ± 0.503
0.985ProTyr: 0.985 ± 0.433
0.0ProXaa: 0.0 ± 0.0
Gln
2.297GlnAla: 2.297 ± 0.655
1.641GlnCys: 1.641 ± 0.504
1.313GlnAsp: 1.313 ± 0.495
3.61GlnGlu: 3.61 ± 1.085
1.969GlnPhe: 1.969 ± 1.495
2.626GlnGly: 2.626 ± 0.485
0.985GlnHis: 0.985 ± 1.058
0.656GlnIle: 0.656 ± 0.307
2.954GlnLys: 2.954 ± 0.709
2.626GlnLeu: 2.626 ± 1.051
0.0GlnMet: 0.0 ± 0.0
2.297GlnAsn: 2.297 ± 0.672
2.626GlnPro: 2.626 ± 0.835
1.641GlnGln: 1.641 ± 0.803
2.626GlnArg: 2.626 ± 0.968
3.282GlnSer: 3.282 ± 0.792
0.656GlnThr: 0.656 ± 0.307
0.656GlnVal: 0.656 ± 0.512
1.313GlnTrp: 1.313 ± 0.547
1.313GlnTyr: 1.313 ± 0.468
0.0GlnXaa: 0.0 ± 0.0
Arg
5.251ArgAla: 5.251 ± 1.33
0.328ArgCys: 0.328 ± 0.274
1.641ArgAsp: 1.641 ± 0.763
1.641ArgGlu: 1.641 ± 0.978
1.641ArgPhe: 1.641 ± 0.549
4.595ArgGly: 4.595 ± 1.614
0.656ArgHis: 0.656 ± 0.524
1.969ArgIle: 1.969 ± 0.855
1.641ArgLys: 1.641 ± 0.584
7.548ArgLeu: 7.548 ± 2.741
0.985ArgMet: 0.985 ± 0.502
2.954ArgAsn: 2.954 ± 0.844
3.61ArgPro: 3.61 ± 1.106
2.954ArgGln: 2.954 ± 1.141
11.815ArgArg: 11.815 ± 5.726
5.251ArgSer: 5.251 ± 2.558
2.297ArgThr: 2.297 ± 1.075
3.938ArgVal: 3.938 ± 1.456
1.969ArgTrp: 1.969 ± 0.893
1.313ArgTyr: 1.313 ± 0.918
0.0ArgXaa: 0.0 ± 0.0
Ser
7.22SerAla: 7.22 ± 2.649
0.985SerCys: 0.985 ± 0.458
3.938SerAsp: 3.938 ± 0.978
3.61SerGlu: 3.61 ± 1.278
4.595SerPhe: 4.595 ± 0.716
9.846SerGly: 9.846 ± 1.455
1.313SerHis: 1.313 ± 0.485
5.251SerIle: 5.251 ± 2.362
3.61SerLys: 3.61 ± 0.533
9.846SerLeu: 9.846 ± 1.435
1.969SerMet: 1.969 ± 0.653
1.969SerAsn: 1.969 ± 0.483
5.907SerPro: 5.907 ± 1.394
2.297SerGln: 2.297 ± 0.672
5.251SerArg: 5.251 ± 1.776
14.44SerSer: 14.44 ± 4.098
5.907SerThr: 5.907 ± 1.089
6.892SerVal: 6.892 ± 1.285
2.954SerTrp: 2.954 ± 1.061
2.626SerTyr: 2.626 ± 0.634
0.0SerXaa: 0.0 ± 0.0
Thr
6.236ThrAla: 6.236 ± 1.653
0.656ThrCys: 0.656 ± 0.503
2.297ThrAsp: 2.297 ± 1.004
2.626ThrGlu: 2.626 ± 0.764
1.969ThrPhe: 1.969 ± 1.275
2.954ThrGly: 2.954 ± 0.831
0.0ThrHis: 0.0 ± 0.0
2.954ThrIle: 2.954 ± 0.606
1.641ThrLys: 1.641 ± 0.615
5.251ThrLeu: 5.251 ± 2.17
1.641ThrMet: 1.641 ± 0.995
2.626ThrAsn: 2.626 ± 1.227
2.626ThrPro: 2.626 ± 0.481
0.328ThrGln: 0.328 ± 0.557
3.61ThrArg: 3.61 ± 0.734
4.266ThrSer: 4.266 ± 1.17
2.954ThrThr: 2.954 ± 0.901
4.595ThrVal: 4.595 ± 1.613
0.0ThrTrp: 0.0 ± 0.0
2.626ThrTyr: 2.626 ± 1.035
0.0ThrXaa: 0.0 ± 0.0
Val
4.923ValAla: 4.923 ± 1.266
0.985ValCys: 0.985 ± 0.458
1.969ValAsp: 1.969 ± 1.02
2.297ValGlu: 2.297 ± 0.77
2.297ValPhe: 2.297 ± 0.534
2.626ValGly: 2.626 ± 1.852
0.985ValHis: 0.985 ± 0.913
1.969ValIle: 1.969 ± 0.916
3.282ValLys: 3.282 ± 0.699
5.251ValLeu: 5.251 ± 1.523
0.328ValMet: 0.328 ± 0.228
2.626ValAsn: 2.626 ± 1.259
5.251ValPro: 5.251 ± 0.848
2.626ValGln: 2.626 ± 0.783
3.938ValArg: 3.938 ± 1.297
5.579ValSer: 5.579 ± 1.07
0.985ValThr: 0.985 ± 0.548
3.282ValVal: 3.282 ± 1.163
1.313ValTrp: 1.313 ± 0.613
1.313ValTyr: 1.313 ± 0.868
0.0ValXaa: 0.0 ± 0.0
Trp
1.313TrpAla: 1.313 ± 0.869
0.985TrpCys: 0.985 ± 0.458
0.985TrpAsp: 0.985 ± 0.433
1.641TrpGlu: 1.641 ± 0.483
0.328TrpPhe: 0.328 ± 0.274
1.641TrpGly: 1.641 ± 0.516
0.985TrpHis: 0.985 ± 0.451
0.656TrpIle: 0.656 ± 0.307
0.656TrpLys: 0.656 ± 0.307
2.297TrpLeu: 2.297 ± 0.925
0.328TrpMet: 0.328 ± 0.228
1.641TrpAsn: 1.641 ± 0.569
0.985TrpPro: 0.985 ± 0.458
0.0TrpGln: 0.0 ± 0.0
0.656TrpArg: 0.656 ± 0.455
2.626TrpSer: 2.626 ± 0.928
1.313TrpThr: 1.313 ± 0.835
1.313TrpVal: 1.313 ± 0.613
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.313TyrAla: 1.313 ± 0.485
0.985TyrCys: 0.985 ± 0.946
1.641TyrAsp: 1.641 ± 0.549
1.969TyrGlu: 1.969 ± 0.855
1.969TyrPhe: 1.969 ± 0.716
1.313TyrGly: 1.313 ± 0.868
1.313TyrHis: 1.313 ± 0.381
0.985TyrIle: 0.985 ± 0.508
3.61TyrLys: 3.61 ± 1.33
1.313TyrLeu: 1.313 ± 1.76
0.656TyrMet: 0.656 ± 0.282
1.641TyrAsn: 1.641 ± 0.578
0.656TyrPro: 0.656 ± 0.282
0.656TyrGln: 0.656 ± 0.778
0.656TyrArg: 0.656 ± 0.455
3.61TyrSer: 3.61 ± 0.563
1.641TyrThr: 1.641 ± 0.739
0.656TyrVal: 0.656 ± 0.307
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.328XaaVal: 0.328 ± 0.274
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3048 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski