Amino acid dipepetide frequency for Pneumovirus dog/Bari/100-12/ITA/2012

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.158AlaAla: 4.158 ± 2.125
1.313AlaCys: 1.313 ± 0.889
2.626AlaAsp: 2.626 ± 0.627
4.376AlaGlu: 4.376 ± 1.528
1.313AlaPhe: 1.313 ± 0.659
5.47AlaGly: 5.47 ± 1.979
0.875AlaHis: 0.875 ± 0.292
3.72AlaIle: 3.72 ± 1.157
3.063AlaLys: 3.063 ± 0.844
5.47AlaLeu: 5.47 ± 1.239
1.751AlaMet: 1.751 ± 0.692
2.407AlaAsn: 2.407 ± 0.566
2.626AlaPro: 2.626 ± 1.086
0.875AlaGln: 0.875 ± 0.447
2.845AlaArg: 2.845 ± 1.164
1.969AlaSer: 1.969 ± 0.509
1.969AlaThr: 1.969 ± 0.658
4.376AlaVal: 4.376 ± 1.787
0.438AlaTrp: 0.438 ± 0.521
1.313AlaTyr: 1.313 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.875CysAla: 0.875 ± 0.578
0.656CysCys: 0.656 ± 0.42
1.094CysAsp: 1.094 ± 0.612
0.875CysGlu: 0.875 ± 0.448
1.094CysPhe: 1.094 ± 0.51
0.875CysGly: 0.875 ± 0.423
1.094CysHis: 1.094 ± 0.723
1.094CysIle: 1.094 ± 0.579
2.188CysLys: 2.188 ± 0.69
1.532CysLeu: 1.532 ± 0.683
0.0CysMet: 0.0 ± 0.0
1.969CysAsn: 1.969 ± 1.034
1.094CysPro: 1.094 ± 0.476
0.438CysGln: 0.438 ± 0.252
0.438CysArg: 0.438 ± 0.274
2.407CysSer: 2.407 ± 0.794
1.751CysThr: 1.751 ± 0.432
1.751CysVal: 1.751 ± 0.812
0.219CysTrp: 0.219 ± 0.246
0.875CysTyr: 0.875 ± 0.366
0.0CysXaa: 0.0 ± 0.0
Asp
3.282AspAla: 3.282 ± 1.544
1.094AspCys: 1.094 ± 1.099
4.158AspAsp: 4.158 ± 1.057
2.188AspGlu: 2.188 ± 0.753
1.532AspPhe: 1.532 ± 0.397
1.313AspGly: 1.313 ± 0.721
1.094AspHis: 1.094 ± 0.503
4.158AspIle: 4.158 ± 1.035
3.939AspLys: 3.939 ± 0.799
5.908AspLeu: 5.908 ± 0.791
1.313AspMet: 1.313 ± 0.457
2.407AspAsn: 2.407 ± 0.428
2.407AspPro: 2.407 ± 0.559
1.313AspGln: 1.313 ± 0.775
1.751AspArg: 1.751 ± 0.967
3.282AspSer: 3.282 ± 0.863
4.376AspThr: 4.376 ± 0.725
3.501AspVal: 3.501 ± 1.279
0.438AspTrp: 0.438 ± 0.363
2.626AspTyr: 2.626 ± 0.831
0.0AspXaa: 0.0 ± 0.0
Glu
3.063GluAla: 3.063 ± 1.009
1.094GluCys: 1.094 ± 0.309
2.188GluAsp: 2.188 ± 1.308
3.72GluGlu: 3.72 ± 2.707
2.626GluPhe: 2.626 ± 0.819
1.751GluGly: 1.751 ± 0.869
1.532GluHis: 1.532 ± 0.39
3.282GluIle: 3.282 ± 0.642
3.939GluLys: 3.939 ± 1.571
6.565GluLeu: 6.565 ± 0.98
1.313GluMet: 1.313 ± 0.465
1.969GluAsn: 1.969 ± 0.482
1.532GluPro: 1.532 ± 0.837
1.969GluGln: 1.969 ± 0.707
2.845GluArg: 2.845 ± 0.924
3.501GluSer: 3.501 ± 0.81
3.063GluThr: 3.063 ± 0.986
3.282GluVal: 3.282 ± 0.627
0.875GluTrp: 0.875 ± 0.351
1.532GluTyr: 1.532 ± 0.587
0.0GluXaa: 0.0 ± 0.0
Phe
0.875PheAla: 0.875 ± 0.533
0.875PheCys: 0.875 ± 0.538
1.532PheAsp: 1.532 ± 0.556
1.751PheGlu: 1.751 ± 0.669
1.094PhePhe: 1.094 ± 0.683
1.094PheGly: 1.094 ± 0.328
1.532PheHis: 1.532 ± 0.59
3.063PheIle: 3.063 ± 0.697
1.532PheLys: 1.532 ± 0.431
4.595PheLeu: 4.595 ± 1.055
0.875PheMet: 0.875 ± 0.375
2.845PheAsn: 2.845 ± 0.821
1.969PhePro: 1.969 ± 0.537
1.532PheGln: 1.532 ± 0.404
1.751PheArg: 1.751 ± 0.892
2.407PheSer: 2.407 ± 0.613
2.188PheThr: 2.188 ± 0.729
1.969PheVal: 1.969 ± 0.839
0.219PheTrp: 0.219 ± 0.137
1.969PheTyr: 1.969 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
2.407GlyAla: 2.407 ± 0.882
1.532GlyCys: 1.532 ± 0.751
1.751GlyAsp: 1.751 ± 0.614
2.407GlyGlu: 2.407 ± 0.393
1.969GlyPhe: 1.969 ± 0.538
2.845GlyGly: 2.845 ± 0.705
1.969GlyHis: 1.969 ± 0.524
2.407GlyIle: 2.407 ± 0.597
1.969GlyLys: 1.969 ± 0.46
7.221GlyLeu: 7.221 ± 1.802
1.313GlyMet: 1.313 ± 0.692
1.751GlyAsn: 1.751 ± 0.386
1.751GlyPro: 1.751 ± 0.664
1.313GlyGln: 1.313 ± 0.789
2.407GlyArg: 2.407 ± 0.633
3.939GlySer: 3.939 ± 0.649
1.969GlyThr: 1.969 ± 0.61
5.252GlyVal: 5.252 ± 0.796
0.656GlyTrp: 0.656 ± 0.461
2.188GlyTyr: 2.188 ± 0.548
0.0GlyXaa: 0.0 ± 0.0
His
1.094HisAla: 1.094 ± 0.495
0.438HisCys: 0.438 ± 0.274
1.313HisAsp: 1.313 ± 0.623
1.313HisGlu: 1.313 ± 0.337
0.875HisPhe: 0.875 ± 0.548
1.094HisGly: 1.094 ± 0.535
1.094HisHis: 1.094 ± 0.536
1.751HisIle: 1.751 ± 0.541
1.969HisLys: 1.969 ± 0.81
1.751HisLeu: 1.751 ± 0.843
1.532HisMet: 1.532 ± 0.768
2.188HisAsn: 2.188 ± 0.954
1.094HisPro: 1.094 ± 0.521
0.219HisGln: 0.219 ± 0.137
1.094HisArg: 1.094 ± 0.41
1.313HisSer: 1.313 ± 0.367
0.875HisThr: 0.875 ± 0.323
0.875HisVal: 0.875 ± 0.533
1.313HisTrp: 1.313 ± 0.617
0.875HisTyr: 0.875 ± 0.501
0.0HisXaa: 0.0 ± 0.0
Ile
3.282IleAla: 3.282 ± 0.612
1.313IleCys: 1.313 ± 0.735
3.501IleAsp: 3.501 ± 0.831
4.595IleGlu: 4.595 ± 0.993
1.969IlePhe: 1.969 ± 0.68
3.501IleGly: 3.501 ± 0.7
1.532IleHis: 1.532 ± 0.833
5.908IleIle: 5.908 ± 1.426
3.282IleLys: 3.282 ± 0.927
7.659IleLeu: 7.659 ± 0.948
2.845IleMet: 2.845 ± 0.821
5.47IleAsn: 5.47 ± 0.75
2.407IlePro: 2.407 ± 0.562
2.626IleGln: 2.626 ± 0.609
4.814IleArg: 4.814 ± 1.059
5.47IleSer: 5.47 ± 1.319
5.033IleThr: 5.033 ± 0.951
2.845IleVal: 2.845 ± 0.868
0.438IleTrp: 0.438 ± 0.274
0.656IleTyr: 0.656 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
3.939LysAla: 3.939 ± 0.789
0.875LysCys: 0.875 ± 0.326
3.501LysAsp: 3.501 ± 1.165
3.72LysGlu: 3.72 ± 1.058
3.063LysPhe: 3.063 ± 1.362
2.626LysGly: 2.626 ± 0.678
2.188LysHis: 2.188 ± 0.655
3.939LysIle: 3.939 ± 0.916
3.72LysLys: 3.72 ± 0.72
6.783LysLeu: 6.783 ± 1.13
0.875LysMet: 0.875 ± 0.482
2.845LysAsn: 2.845 ± 0.853
3.72LysPro: 3.72 ± 1.33
1.969LysGln: 1.969 ± 0.557
3.063LysArg: 3.063 ± 0.994
5.47LysSer: 5.47 ± 1.958
3.063LysThr: 3.063 ± 0.732
4.814LysVal: 4.814 ± 1.264
0.438LysTrp: 0.438 ± 0.274
2.845LysTyr: 2.845 ± 0.659
0.0LysXaa: 0.0 ± 0.0
Leu
6.127LeuAla: 6.127 ± 1.535
0.875LeuCys: 0.875 ± 0.478
5.908LeuAsp: 5.908 ± 0.87
5.033LeuGlu: 5.033 ± 1.484
3.282LeuPhe: 3.282 ± 0.838
5.252LeuGly: 5.252 ± 0.763
3.501LeuHis: 3.501 ± 1.118
7.659LeuIle: 7.659 ± 0.934
10.066LeuLys: 10.066 ± 1.724
9.19LeuLeu: 9.19 ± 1.722
1.751LeuMet: 1.751 ± 0.896
6.346LeuAsn: 6.346 ± 1.208
4.595LeuPro: 4.595 ± 0.635
3.939LeuGln: 3.939 ± 1.37
4.814LeuArg: 4.814 ± 0.93
12.254LeuSer: 12.254 ± 2.229
9.628LeuThr: 9.628 ± 1.24
4.595LeuVal: 4.595 ± 1.042
0.219LeuTrp: 0.219 ± 0.137
2.845LeuTyr: 2.845 ± 0.894
0.0LeuXaa: 0.0 ± 0.0
Met
1.532MetAla: 1.532 ± 0.661
0.875MetCys: 0.875 ± 0.5
1.532MetAsp: 1.532 ± 0.557
1.313MetGlu: 1.313 ± 0.945
1.094MetPhe: 1.094 ± 0.366
1.532MetGly: 1.532 ± 0.77
0.0MetHis: 0.0 ± 0.0
1.751MetIle: 1.751 ± 0.416
1.313MetLys: 1.313 ± 0.485
3.501MetLeu: 3.501 ± 0.679
0.875MetMet: 0.875 ± 0.41
0.656MetAsn: 0.656 ± 0.483
0.875MetPro: 0.875 ± 0.431
1.751MetGln: 1.751 ± 0.442
0.656MetArg: 0.656 ± 0.285
3.063MetSer: 3.063 ± 0.745
1.751MetThr: 1.751 ± 0.662
1.532MetVal: 1.532 ± 0.581
0.219MetTrp: 0.219 ± 0.137
1.094MetTyr: 1.094 ± 0.468
0.0MetXaa: 0.0 ± 0.0
Asn
3.282AsnAla: 3.282 ± 0.911
1.751AsnCys: 1.751 ± 0.857
1.751AsnAsp: 1.751 ± 0.343
1.532AsnGlu: 1.532 ± 0.575
2.845AsnPhe: 2.845 ± 0.457
1.969AsnGly: 1.969 ± 1.059
1.094AsnHis: 1.094 ± 0.414
4.595AsnIle: 4.595 ± 0.968
5.033AsnLys: 5.033 ± 0.952
7.44AsnLeu: 7.44 ± 1.643
2.626AsnMet: 2.626 ± 0.572
3.063AsnAsn: 3.063 ± 0.689
1.751AsnPro: 1.751 ± 0.49
2.407AsnGln: 2.407 ± 0.493
3.72AsnArg: 3.72 ± 1.379
4.158AsnSer: 4.158 ± 1.061
3.501AsnThr: 3.501 ± 1.084
3.282AsnVal: 3.282 ± 1.051
0.656AsnTrp: 0.656 ± 0.308
2.626AsnTyr: 2.626 ± 0.697
0.0AsnXaa: 0.0 ± 0.0
Pro
1.969ProAla: 1.969 ± 0.792
0.656ProCys: 0.656 ± 0.348
3.282ProAsp: 3.282 ± 0.658
1.532ProGlu: 1.532 ± 1.172
0.656ProPhe: 0.656 ± 0.35
1.751ProGly: 1.751 ± 0.566
0.438ProHis: 0.438 ± 0.274
1.969ProIle: 1.969 ± 0.485
3.72ProLys: 3.72 ± 0.501
2.626ProLeu: 2.626 ± 0.632
1.532ProMet: 1.532 ± 0.545
3.063ProAsn: 3.063 ± 0.731
3.501ProPro: 3.501 ± 2.091
1.532ProGln: 1.532 ± 0.553
1.532ProArg: 1.532 ± 0.877
3.501ProSer: 3.501 ± 1.073
3.939ProThr: 3.939 ± 1.615
2.626ProVal: 2.626 ± 0.869
0.875ProTrp: 0.875 ± 0.548
2.407ProTyr: 2.407 ± 1.003
0.0ProXaa: 0.0 ± 0.0
Gln
2.626GlnAla: 2.626 ± 0.769
0.875GlnCys: 0.875 ± 0.579
1.532GlnAsp: 1.532 ± 0.59
2.188GlnGlu: 2.188 ± 0.595
1.751GlnPhe: 1.751 ± 0.884
1.751GlnGly: 1.751 ± 0.624
0.875GlnHis: 0.875 ± 0.393
1.532GlnIle: 1.532 ± 0.593
1.532GlnLys: 1.532 ± 0.368
3.72GlnLeu: 3.72 ± 0.735
0.438GlnMet: 0.438 ± 0.25
1.532GlnAsn: 1.532 ± 0.638
0.875GlnPro: 0.875 ± 0.437
1.094GlnGln: 1.094 ± 0.39
1.532GlnArg: 1.532 ± 0.454
2.407GlnSer: 2.407 ± 0.636
1.313GlnThr: 1.313 ± 0.814
2.188GlnVal: 2.188 ± 0.856
0.0GlnTrp: 0.0 ± 0.0
1.532GlnTyr: 1.532 ± 0.433
0.0GlnXaa: 0.0 ± 0.0
Arg
3.282ArgAla: 3.282 ± 1.287
1.094ArgCys: 1.094 ± 0.483
3.72ArgAsp: 3.72 ± 0.854
3.282ArgGlu: 3.282 ± 1.054
1.751ArgPhe: 1.751 ± 0.632
2.407ArgGly: 2.407 ± 0.613
0.656ArgHis: 0.656 ± 0.271
3.063ArgIle: 3.063 ± 0.747
1.969ArgLys: 1.969 ± 0.367
3.939ArgLeu: 3.939 ± 1.034
0.656ArgMet: 0.656 ± 0.508
3.501ArgAsn: 3.501 ± 0.932
1.751ArgPro: 1.751 ± 0.772
2.188ArgGln: 2.188 ± 0.701
2.407ArgArg: 2.407 ± 0.572
3.72ArgSer: 3.72 ± 0.52
3.282ArgThr: 3.282 ± 0.749
2.407ArgVal: 2.407 ± 0.687
0.875ArgTrp: 0.875 ± 0.326
1.969ArgTyr: 1.969 ± 0.814
0.0ArgXaa: 0.0 ± 0.0
Ser
3.063SerAla: 3.063 ± 0.668
2.407SerCys: 2.407 ± 0.979
2.626SerAsp: 2.626 ± 1.017
3.72SerGlu: 3.72 ± 0.929
3.72SerPhe: 3.72 ± 0.883
3.72SerGly: 3.72 ± 0.715
0.438SerHis: 0.438 ± 0.263
4.595SerIle: 4.595 ± 1.055
4.814SerLys: 4.814 ± 1.085
10.722SerLeu: 10.722 ± 2.294
1.532SerMet: 1.532 ± 0.725
6.565SerAsn: 6.565 ± 1.217
2.188SerPro: 2.188 ± 0.504
1.532SerGln: 1.532 ± 0.456
3.72SerArg: 3.72 ± 1.248
8.972SerSer: 8.972 ± 1.551
7.221SerThr: 7.221 ± 1.069
7.002SerVal: 7.002 ± 1.034
0.875SerTrp: 0.875 ± 0.366
3.939SerTyr: 3.939 ± 0.523
0.0SerXaa: 0.0 ± 0.0
Thr
3.939ThrAla: 3.939 ± 1.304
1.094ThrCys: 1.094 ± 0.608
4.376ThrAsp: 4.376 ± 0.97
2.407ThrGlu: 2.407 ± 0.548
1.751ThrPhe: 1.751 ± 0.618
3.72ThrGly: 3.72 ± 0.7
1.532ThrHis: 1.532 ± 0.515
3.72ThrIle: 3.72 ± 0.973
3.72ThrLys: 3.72 ± 0.714
4.814ThrLeu: 4.814 ± 1.214
2.188ThrMet: 2.188 ± 0.477
4.814ThrAsn: 4.814 ± 0.754
2.845ThrPro: 2.845 ± 1.231
2.188ThrGln: 2.188 ± 0.619
4.376ThrArg: 4.376 ± 1.077
6.346ThrSer: 6.346 ± 1.19
4.814ThrThr: 4.814 ± 1.869
4.595ThrVal: 4.595 ± 1.865
0.656ThrTrp: 0.656 ± 0.35
2.188ThrTyr: 2.188 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
2.626ValAla: 2.626 ± 0.581
2.407ValCys: 2.407 ± 0.625
3.282ValAsp: 3.282 ± 0.698
3.501ValGlu: 3.501 ± 1.141
1.969ValPhe: 1.969 ± 0.479
4.158ValGly: 4.158 ± 0.653
1.094ValHis: 1.094 ± 0.463
5.908ValIle: 5.908 ± 1.381
2.845ValLys: 2.845 ± 0.939
7.659ValLeu: 7.659 ± 1.749
2.188ValMet: 2.188 ± 0.942
2.188ValAsn: 2.188 ± 0.418
2.845ValPro: 2.845 ± 0.772
1.751ValGln: 1.751 ± 0.689
1.532ValArg: 1.532 ± 0.555
5.689ValSer: 5.689 ± 1.212
4.158ValThr: 4.158 ± 0.831
5.033ValVal: 5.033 ± 0.97
0.656ValTrp: 0.656 ± 0.393
2.845ValTyr: 2.845 ± 0.693
0.219ValXaa: 0.219 ± 0.275
Trp
0.0TrpAla: 0.0 ± 0.0
0.438TrpCys: 0.438 ± 0.25
0.438TrpAsp: 0.438 ± 0.274
0.219TrpGlu: 0.219 ± 0.28
0.438TrpPhe: 0.438 ± 0.334
0.438TrpGly: 0.438 ± 0.394
0.438TrpHis: 0.438 ± 0.234
1.094TrpIle: 1.094 ± 0.503
1.094TrpLys: 1.094 ± 0.365
1.751TrpLeu: 1.751 ± 0.724
0.219TrpMet: 0.219 ± 0.248
0.438TrpAsn: 0.438 ± 0.459
0.438TrpPro: 0.438 ± 0.263
0.219TrpGln: 0.219 ± 0.371
0.0TrpArg: 0.0 ± 0.0
0.656TrpSer: 0.656 ± 0.347
0.438TrpThr: 0.438 ± 0.274
1.532TrpVal: 1.532 ± 0.575
0.0TrpTrp: 0.0 ± 0.0
0.875TrpTyr: 0.875 ± 0.395
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.532TyrAla: 1.532 ± 0.868
0.656TyrCys: 0.656 ± 0.294
1.969TyrAsp: 1.969 ± 0.446
1.969TyrGlu: 1.969 ± 0.443
0.875TyrPhe: 0.875 ± 0.395
1.751TyrGly: 1.751 ± 0.568
1.094TyrHis: 1.094 ± 0.366
3.939TyrIle: 3.939 ± 0.662
1.751TyrLys: 1.751 ± 0.553
5.033TyrLeu: 5.033 ± 1.108
0.875TyrMet: 0.875 ± 0.404
3.063TyrAsn: 3.063 ± 1.437
2.845TyrPro: 2.845 ± 0.824
0.438TyrGln: 0.438 ± 0.404
2.845TyrArg: 2.845 ± 1.564
2.626TyrSer: 2.626 ± 0.643
1.751TyrThr: 1.751 ± 0.64
1.094TyrVal: 1.094 ± 0.455
1.094TyrTrp: 1.094 ± 0.811
0.875TyrTyr: 0.875 ± 0.408
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.219XaaSer: 0.219 ± 0.275
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (4571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski