Amino acid dipepetide frequency for Influenza A virus (A/Uruguay/716/2007(H3N2))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.481AlaAla: 2.481 ± 0.799
2.481AlaCys: 2.481 ± 1.227
3.309AlaAsp: 3.309 ± 0.664
3.309AlaGlu: 3.309 ± 0.664
1.241AlaPhe: 1.241 ± 0.576
2.895AlaGly: 2.895 ± 1.109
0.0AlaHis: 0.0 ± 0.0
3.722AlaIle: 3.722 ± 0.736
0.827AlaLys: 0.827 ± 0.865
7.444AlaLeu: 7.444 ± 2.171
1.241AlaMet: 1.241 ± 1.298
0.414AlaAsn: 0.414 ± 0.408
2.481AlaPro: 2.481 ± 0.456
2.895AlaGln: 2.895 ± 0.37
0.414AlaArg: 0.414 ± 0.433
5.376AlaSer: 5.376 ± 0.867
4.136AlaThr: 4.136 ± 0.869
2.068AlaVal: 2.068 ± 0.833
1.654AlaTrp: 1.654 ± 0.942
0.827AlaTyr: 0.827 ± 0.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.414CysAla: 0.414 ± 0.433
0.0CysCys: 0.0 ± 0.0
4.136CysAsp: 4.136 ± 0.586
1.241CysGlu: 1.241 ± 0.576
3.309CysPhe: 3.309 ± 0.436
0.827CysGly: 0.827 ± 0.471
0.827CysHis: 0.827 ± 0.471
6.617CysIle: 6.617 ± 0.872
0.0CysLys: 0.0 ± 0.0
2.481CysLeu: 2.481 ± 0.456
0.414CysMet: 0.414 ± 0.433
2.068CysAsn: 2.068 ± 0.839
1.654CysPro: 1.654 ± 0.218
0.827CysGln: 0.827 ± 0.409
1.241CysArg: 1.241 ± 0.661
1.654CysSer: 1.654 ± 0.942
1.654CysThr: 1.654 ± 0.218
2.895CysVal: 2.895 ± 0.755
0.0CysTrp: 0.0 ± 0.0
2.481CysTyr: 2.481 ± 0.609
0.0CysXaa: 0.0 ± 0.0
Asp
2.068AspAla: 2.068 ± 0.839
0.0AspCys: 0.0 ± 0.0
2.481AspAsp: 2.481 ± 1.055
0.827AspGlu: 0.827 ± 0.409
0.0AspPhe: 0.0 ± 0.0
5.79AspGly: 5.79 ± 0.978
1.654AspHis: 1.654 ± 0.218
4.136AspIle: 4.136 ± 1.533
3.722AspLys: 3.722 ± 0.205
4.963AspLeu: 4.963 ± 2.007
0.827AspMet: 0.827 ± 0.409
4.136AspAsn: 4.136 ± 0.586
3.309AspPro: 3.309 ± 0.664
2.481AspGln: 2.481 ± 1.227
2.068AspArg: 2.068 ± 1.073
4.549AspSer: 4.549 ± 0.706
3.309AspThr: 3.309 ± 1.067
2.895AspVal: 2.895 ± 0.37
0.827AspTrp: 0.827 ± 0.409
0.827AspTyr: 0.827 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
1.654GluAla: 1.654 ± 0.858
2.481GluCys: 2.481 ± 0.609
2.068GluAsp: 2.068 ± 0.822
3.722GluGlu: 3.722 ± 1.978
2.481GluPhe: 2.481 ± 0.456
6.203GluGly: 6.203 ± 1.059
0.0GluHis: 0.0 ± 0.0
4.963GluIle: 4.963 ± 1.687
6.617GluLys: 6.617 ± 1.676
4.549GluLeu: 4.549 ± 0.641
1.241GluMet: 1.241 ± 0.513
4.136GluAsn: 4.136 ± 1.644
1.654GluPro: 1.654 ± 0.942
1.654GluGln: 1.654 ± 1.279
2.068GluArg: 2.068 ± 0.259
2.068GluSer: 2.068 ± 0.444
2.481GluThr: 2.481 ± 1.055
3.722GluVal: 3.722 ± 0.618
0.827GluTrp: 0.827 ± 0.531
1.241GluTyr: 1.241 ± 0.661
0.0GluXaa: 0.0 ± 0.0
Phe
3.722PheAla: 3.722 ± 0.205
0.827PheCys: 0.827 ± 0.471
1.654PheAsp: 1.654 ± 0.218
0.827PheGlu: 0.827 ± 0.409
2.068PhePhe: 2.068 ± 0.444
1.241PheGly: 1.241 ± 0.514
2.068PheHis: 2.068 ± 0.259
3.309PheIle: 3.309 ± 0.436
5.376PheLys: 5.376 ± 0.655
1.241PheLeu: 1.241 ± 0.663
0.827PheMet: 0.827 ± 0.471
2.481PheAsn: 2.481 ± 0.456
0.414PhePro: 0.414 ± 0.311
2.481PheGln: 2.481 ± 1.227
0.827PheArg: 0.827 ± 0.409
3.309PheSer: 3.309 ± 0.436
0.414PheThr: 0.414 ± 0.433
2.068PheVal: 2.068 ± 0.833
0.0PheTrp: 0.0 ± 0.0
0.827PheTyr: 0.827 ± 0.471
0.0PheXaa: 0.0 ± 0.0
Gly
3.722GlyAla: 3.722 ± 1.058
1.241GlyCys: 1.241 ± 0.518
4.549GlyAsp: 4.549 ± 0.625
1.654GlyGlu: 1.654 ± 0.818
4.549GlyPhe: 4.549 ± 1.131
1.654GlyGly: 1.654 ± 0.942
1.654GlyHis: 1.654 ± 0.218
3.309GlyIle: 3.309 ± 0.386
7.031GlyLys: 7.031 ± 0.526
2.895GlyLeu: 2.895 ± 1.684
1.654GlyMet: 1.654 ± 0.818
5.376GlyAsn: 5.376 ± 0.708
0.827GlyPro: 0.827 ± 0.531
1.654GlyGln: 1.654 ± 0.218
4.136GlyArg: 4.136 ± 0.785
7.031GlySer: 7.031 ± 1.207
9.512GlyThr: 9.512 ± 1.091
5.79GlyVal: 5.79 ± 0.396
3.309GlyTrp: 3.309 ± 0.436
3.309GlyTyr: 3.309 ± 0.436
0.0GlyXaa: 0.0 ± 0.0
His
0.827HisAla: 0.827 ± 0.409
0.827HisCys: 0.827 ± 0.471
2.068HisAsp: 2.068 ± 0.458
0.414HisGlu: 0.414 ± 0.433
1.241HisPhe: 1.241 ± 0.661
1.654HisGly: 1.654 ± 0.69
1.654HisHis: 1.654 ± 0.818
0.0HisIle: 0.0 ± 0.0
0.827HisLys: 0.827 ± 0.409
2.068HisLeu: 2.068 ± 0.444
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.241HisPro: 1.241 ± 0.514
2.481HisGln: 2.481 ± 1.227
0.827HisArg: 0.827 ± 0.865
1.654HisSer: 1.654 ± 0.942
1.654HisThr: 1.654 ± 0.218
1.654HisVal: 1.654 ± 0.942
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.963IleAla: 4.963 ± 1.518
3.309IleCys: 3.309 ± 0.436
3.309IleAsp: 3.309 ± 1.636
6.203IleGlu: 6.203 ± 1.059
2.481IlePhe: 2.481 ± 0.456
5.79IleGly: 5.79 ± 1.472
0.0IleHis: 0.0 ± 0.0
2.895IleIle: 2.895 ± 0.755
1.654IleLys: 1.654 ± 0.218
8.271IleLeu: 8.271 ± 0.967
2.895IleMet: 2.895 ± 0.821
5.79IleAsn: 5.79 ± 1.673
2.481IlePro: 2.481 ± 1.227
0.827IleGln: 0.827 ± 0.409
5.79IleArg: 5.79 ± 0.657
4.136IleSer: 4.136 ± 0.586
7.858IleThr: 7.858 ± 0.911
7.858IleVal: 7.858 ± 2.494
1.654IleTrp: 1.654 ± 0.218
2.895IleTyr: 2.895 ± 0.37
0.0IleXaa: 0.0 ± 0.0
Lys
3.722LysAla: 3.722 ± 1.149
2.895LysCys: 2.895 ± 0.54
2.481LysAsp: 2.481 ± 0.609
4.136LysGlu: 4.136 ± 1.086
2.481LysPhe: 2.481 ± 1.227
3.722LysGly: 3.722 ± 0.205
0.414LysHis: 0.414 ± 0.408
4.963LysIle: 4.963 ± 0.654
1.654LysLys: 1.654 ± 0.818
6.617LysLeu: 6.617 ± 1.09
0.414LysMet: 0.414 ± 0.376
2.068LysAsn: 2.068 ± 0.959
2.481LysPro: 2.481 ± 0.456
4.136LysGln: 4.136 ± 0.586
2.068LysArg: 2.068 ± 0.833
4.963LysSer: 4.963 ± 0.654
3.722LysThr: 3.722 ± 1.582
0.827LysVal: 0.827 ± 0.471
0.827LysTrp: 0.827 ± 0.409
1.654LysTyr: 1.654 ± 0.818
0.0LysXaa: 0.0 ± 0.0
Leu
2.481LeuAla: 2.481 ± 0.679
3.309LeuCys: 3.309 ± 0.838
2.068LeuAsp: 2.068 ± 0.444
4.136LeuGlu: 4.136 ± 1.46
3.309LeuPhe: 3.309 ± 0.842
6.203LeuGly: 6.203 ± 0.75
2.068LeuHis: 2.068 ± 1.073
6.203LeuIle: 6.203 ± 0.637
4.963LeuLys: 4.963 ± 1.722
6.203LeuLeu: 6.203 ± 1.459
2.068LeuMet: 2.068 ± 0.959
4.549LeuAsn: 4.549 ± 1.131
0.827LeuPro: 0.827 ± 0.409
1.654LeuGln: 1.654 ± 0.903
2.895LeuArg: 2.895 ± 0.37
3.722LeuSer: 3.722 ± 1.149
4.963LeuThr: 4.963 ± 0.887
5.79LeuVal: 5.79 ± 0.657
2.895LeuTrp: 2.895 ± 0.54
1.654LeuTyr: 1.654 ± 0.691
0.0LeuXaa: 0.0 ± 0.0
Met
0.414MetAla: 0.414 ± 0.433
0.0MetCys: 0.0 ± 0.0
0.414MetAsp: 0.414 ± 0.433
1.241MetGlu: 1.241 ± 1.298
0.0MetPhe: 0.0 ± 0.0
2.895MetGly: 2.895 ± 1.109
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.827MetLys: 0.827 ± 0.409
0.827MetLeu: 0.827 ± 0.471
0.0MetMet: 0.0 ± 0.0
2.481MetAsn: 2.481 ± 0.609
1.654MetPro: 1.654 ± 0.218
1.241MetGln: 1.241 ± 0.576
2.481MetArg: 2.481 ± 0.799
0.827MetSer: 0.827 ± 0.531
0.827MetThr: 0.827 ± 0.471
2.068MetVal: 2.068 ± 1.258
0.827MetTrp: 0.827 ± 0.409
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.963AsnAla: 4.963 ± 1.077
1.654AsnCys: 1.654 ± 0.818
6.203AsnAsp: 6.203 ± 0.79
4.549AsnGlu: 4.549 ± 0.641
0.0AsnPhe: 0.0 ± 0.0
8.271AsnGly: 8.271 ± 1.864
0.0AsnHis: 0.0 ± 0.0
5.376AsnIle: 5.376 ± 0.912
1.654AsnLys: 1.654 ± 0.818
2.068AsnLeu: 2.068 ± 0.259
0.414AsnMet: 0.414 ± 0.433
6.203AsnAsn: 6.203 ± 0.409
2.068AsnPro: 2.068 ± 0.959
2.481AsnGln: 2.481 ± 0.609
6.617AsnArg: 6.617 ± 0.737
6.617AsnSer: 6.617 ± 1.026
2.068AsnThr: 2.068 ± 0.259
3.309AsnVal: 3.309 ± 0.838
3.309AsnTrp: 3.309 ± 0.436
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.827ProAla: 0.827 ± 0.409
0.0ProCys: 0.0 ± 0.0
2.481ProAsp: 2.481 ± 0.609
1.241ProGlu: 1.241 ± 0.518
2.481ProPhe: 2.481 ± 0.609
3.309ProGly: 3.309 ± 0.838
0.827ProHis: 0.827 ± 0.409
3.309ProIle: 3.309 ± 0.664
1.654ProLys: 1.654 ± 0.942
1.654ProLeu: 1.654 ± 1.279
0.0ProMet: 0.0 ± 0.0
7.031ProAsn: 7.031 ± 0.747
0.827ProPro: 0.827 ± 0.471
1.654ProGln: 1.654 ± 0.218
4.136ProArg: 4.136 ± 0.586
2.481ProSer: 2.481 ± 1.211
0.827ProThr: 0.827 ± 0.471
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
2.895ProTyr: 2.895 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
3.722GlnAla: 3.722 ± 1.542
1.654GlnCys: 1.654 ± 0.218
0.827GlnAsp: 0.827 ± 0.409
0.827GlnGlu: 0.827 ± 0.471
0.827GlnPhe: 0.827 ± 0.471
0.827GlnGly: 0.827 ± 0.471
2.068GlnHis: 2.068 ± 0.259
6.203GlnIle: 6.203 ± 1.939
2.895GlnLys: 2.895 ± 0.37
0.827GlnLeu: 0.827 ± 0.409
1.654GlnMet: 1.654 ± 1.731
4.963GlnAsn: 4.963 ± 1.919
0.0GlnPro: 0.0 ± 0.0
1.241GlnGln: 1.241 ± 0.518
1.241GlnArg: 1.241 ± 1.298
0.827GlnSer: 0.827 ± 0.409
2.895GlnThr: 2.895 ± 0.37
2.481GlnVal: 2.481 ± 1.413
0.0GlnTrp: 0.0 ± 0.0
0.827GlnTyr: 0.827 ± 0.471
0.0GlnXaa: 0.0 ± 0.0
Arg
0.414ArgAla: 0.414 ± 0.433
2.895ArgCys: 2.895 ± 0.755
2.481ArgAsp: 2.481 ± 0.437
2.481ArgGlu: 2.481 ± 0.381
1.654ArgPhe: 1.654 ± 0.858
4.136ArgGly: 4.136 ± 0.322
1.241ArgHis: 1.241 ± 0.514
3.309ArgIle: 3.309 ± 1.636
2.481ArgLys: 2.481 ± 1.055
5.376ArgLeu: 5.376 ± 0.972
1.241ArgMet: 1.241 ± 1.298
4.549ArgAsn: 4.549 ± 0.641
2.068ArgPro: 2.068 ± 0.259
1.654ArgGln: 1.654 ± 0.903
1.654ArgArg: 1.654 ± 0.858
6.203ArgSer: 6.203 ± 1.939
3.309ArgThr: 3.309 ± 1.884
0.827ArgVal: 0.827 ± 0.409
0.0ArgTrp: 0.0 ± 0.0
1.654ArgTyr: 1.654 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
2.895SerAla: 2.895 ± 1.397
5.376SerCys: 5.376 ± 1.087
1.241SerAsp: 1.241 ± 0.518
5.79SerGlu: 5.79 ± 0.74
3.309SerPhe: 3.309 ± 0.838
8.685SerGly: 8.685 ± 1.351
1.654SerHis: 1.654 ± 0.69
7.858SerIle: 7.858 ± 0.749
4.136SerLys: 4.136 ± 1.533
3.309SerLeu: 3.309 ± 0.386
0.414SerMet: 0.414 ± 0.408
4.963SerAsn: 4.963 ± 1.219
2.068SerPro: 2.068 ± 0.259
1.654SerGln: 1.654 ± 0.858
3.309SerArg: 3.309 ± 0.838
13.234SerSer: 13.234 ± 1.405
6.617SerThr: 6.617 ± 1.256
1.654SerVal: 1.654 ± 0.942
1.654SerTrp: 1.654 ± 0.942
2.895SerTyr: 2.895 ± 0.37
0.0SerXaa: 0.0 ± 0.0
Thr
2.895ThrAla: 2.895 ± 0.56
1.241ThrCys: 1.241 ± 0.576
2.895ThrAsp: 2.895 ± 0.37
4.136ThrGlu: 4.136 ± 1.037
1.241ThrPhe: 1.241 ± 0.576
6.203ThrGly: 6.203 ± 0.409
1.241ThrHis: 1.241 ± 0.514
9.098ThrIle: 9.098 ± 1.386
4.549ThrLys: 4.549 ± 0.323
6.617ThrLeu: 6.617 ± 0.548
0.827ThrMet: 0.827 ± 0.409
3.722ThrAsn: 3.722 ± 0.736
2.895ThrPro: 2.895 ± 0.755
2.481ThrGln: 2.481 ± 0.456
2.068ThrArg: 2.068 ± 0.259
3.309ThrSer: 3.309 ± 1.067
3.722ThrThr: 3.722 ± 1.728
4.549ThrVal: 4.549 ± 0.625
0.0ThrTrp: 0.0 ± 0.0
3.309ThrTyr: 3.309 ± 0.664
0.0ThrXaa: 0.0 ± 0.0
Val
3.722ValAla: 3.722 ± 1.24
4.549ValCys: 4.549 ± 2.312
3.722ValAsp: 3.722 ± 1.178
6.617ValGlu: 6.617 ± 0.382
2.481ValPhe: 2.481 ± 0.679
0.827ValGly: 0.827 ± 0.471
2.895ValHis: 2.895 ± 0.862
2.481ValIle: 2.481 ± 0.609
3.722ValLys: 3.722 ± 0.736
1.654ValLeu: 1.654 ± 0.903
1.654ValMet: 1.654 ± 0.942
0.827ValAsn: 0.827 ± 0.409
4.549ValPro: 4.549 ± 0.793
2.068ValGln: 2.068 ± 1.258
2.068ValArg: 2.068 ± 0.533
4.549ValSer: 4.549 ± 1.627
4.549ValThr: 4.549 ± 0.323
2.068ValVal: 2.068 ± 1.073
0.827ValTrp: 0.827 ± 0.471
2.068ValTyr: 2.068 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
1.654TrpAla: 1.654 ± 0.218
0.0TrpCys: 0.0 ± 0.0
0.827TrpAsp: 0.827 ± 0.409
0.827TrpGlu: 0.827 ± 0.409
0.0TrpPhe: 0.0 ± 0.0
1.241TrpGly: 1.241 ± 0.518
0.0TrpHis: 0.0 ± 0.0
2.068TrpIle: 2.068 ± 0.839
0.827TrpLys: 0.827 ± 0.471
2.068TrpLeu: 2.068 ± 0.259
0.827TrpMet: 0.827 ± 0.471
0.0TrpAsn: 0.0 ± 0.0
0.827TrpPro: 0.827 ± 0.471
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
4.136TrpSer: 4.136 ± 1.533
2.481TrpThr: 2.481 ± 0.456
0.827TrpVal: 0.827 ± 0.471
0.0TrpTrp: 0.0 ± 0.0
0.827TrpTyr: 0.827 ± 0.409
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.654TyrAla: 1.654 ± 0.818
0.0TyrCys: 0.0 ± 0.0
1.654TyrAsp: 1.654 ± 0.818
1.654TyrGlu: 1.654 ± 0.942
0.827TyrPhe: 0.827 ± 0.409
2.481TyrGly: 2.481 ± 0.456
0.827TyrHis: 0.827 ± 0.409
1.654TyrIle: 1.654 ± 0.818
0.827TyrLys: 0.827 ± 0.409
0.827TyrLeu: 0.827 ± 0.471
0.0TyrMet: 0.0 ± 0.0
2.068TyrAsn: 2.068 ± 0.259
3.309TyrPro: 3.309 ± 0.436
1.241TyrGln: 1.241 ± 0.576
3.722TyrArg: 3.722 ± 0.949
2.068TyrSer: 2.068 ± 0.259
0.0TyrThr: 0.0 ± 0.0
4.549TyrVal: 4.549 ± 0.625
0.827TyrTrp: 0.827 ± 0.409
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski