Amino acid dipepetide frequency for Influenza A virus (A/Canada goose/New York/475813-2/2007(H5N2))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.81AlaAla: 3.81 ± 1.011
0.896AlaCys: 0.896 ± 0.412
2.465AlaAsp: 2.465 ± 0.49
3.362AlaGlu: 3.362 ± 0.868
1.569AlaPhe: 1.569 ± 0.603
4.258AlaGly: 4.258 ± 1.056
0.672AlaHis: 0.672 ± 0.377
4.931AlaIle: 4.931 ± 1.238
2.241AlaLys: 2.241 ± 0.546
5.379AlaLeu: 5.379 ± 0.907
2.689AlaMet: 2.689 ± 0.741
2.689AlaAsn: 2.689 ± 0.585
2.241AlaPro: 2.241 ± 0.441
1.793AlaGln: 1.793 ± 0.492
2.913AlaArg: 2.913 ± 0.698
5.155AlaSer: 5.155 ± 1.121
5.155AlaThr: 5.155 ± 0.651
2.689AlaVal: 2.689 ± 0.754
1.121AlaTrp: 1.121 ± 0.561
1.345AlaTyr: 1.345 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.448CysAla: 0.448 ± 0.291
0.224CysCys: 0.224 ± 0.208
0.672CysAsp: 0.672 ± 0.608
0.896CysGlu: 0.896 ± 0.35
1.345CysPhe: 1.345 ± 0.55
0.448CysGly: 0.448 ± 0.444
0.896CysHis: 0.896 ± 0.249
1.793CysIle: 1.793 ± 0.788
0.896CysLys: 0.896 ± 0.348
1.121CysLeu: 1.121 ± 0.516
0.896CysMet: 0.896 ± 0.29
0.896CysAsn: 0.896 ± 0.428
0.672CysPro: 0.672 ± 0.452
0.448CysGln: 0.448 ± 0.293
1.569CysArg: 1.569 ± 0.735
2.241CysSer: 2.241 ± 0.776
0.672CysThr: 0.672 ± 0.3
1.345CysVal: 1.345 ± 0.623
0.224CysTrp: 0.224 ± 0.195
0.672CysTyr: 0.672 ± 0.478
0.0CysXaa: 0.0 ± 0.0
Asp
3.138AspAla: 3.138 ± 0.569
1.345AspCys: 1.345 ± 0.37
1.793AspAsp: 1.793 ± 0.704
3.138AspGlu: 3.138 ± 0.684
2.241AspPhe: 2.241 ± 0.828
3.81AspGly: 3.81 ± 0.991
0.224AspHis: 0.224 ± 0.195
2.017AspIle: 2.017 ± 0.626
2.017AspLys: 2.017 ± 0.653
3.586AspLeu: 3.586 ± 0.743
1.569AspMet: 1.569 ± 0.455
3.138AspAsn: 3.138 ± 0.649
3.138AspPro: 3.138 ± 0.948
1.793AspGln: 1.793 ± 0.848
2.689AspArg: 2.689 ± 0.778
3.81AspSer: 3.81 ± 0.974
2.017AspThr: 2.017 ± 0.576
3.362AspVal: 3.362 ± 0.516
0.448AspTrp: 0.448 ± 0.32
1.569AspTyr: 1.569 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
2.465GluAla: 2.465 ± 0.477
1.569GluCys: 1.569 ± 0.783
4.706GluAsp: 4.706 ± 0.828
6.948GluGlu: 6.948 ± 1.119
2.465GluPhe: 2.465 ± 0.724
4.258GluGly: 4.258 ± 1.201
0.672GluHis: 0.672 ± 0.288
5.155GluIle: 5.155 ± 0.781
5.827GluLys: 5.827 ± 1.256
5.827GluLeu: 5.827 ± 0.595
2.241GluMet: 2.241 ± 0.655
3.362GluAsn: 3.362 ± 1.014
2.465GluPro: 2.465 ± 1.06
4.258GluGln: 4.258 ± 1.179
4.706GluArg: 4.706 ± 1.028
6.499GluSer: 6.499 ± 1.381
4.034GluThr: 4.034 ± 0.682
4.706GluVal: 4.706 ± 1.227
0.896GluTrp: 0.896 ± 0.379
2.465GluTyr: 2.465 ± 0.584
0.0GluXaa: 0.0 ± 0.0
Phe
2.017PheAla: 2.017 ± 0.621
0.224PheCys: 0.224 ± 0.222
1.345PheAsp: 1.345 ± 0.417
5.379PheGlu: 5.379 ± 1.303
1.793PhePhe: 1.793 ± 0.513
1.569PheGly: 1.569 ± 0.292
1.121PheHis: 1.121 ± 0.47
2.017PheIle: 2.017 ± 0.631
0.896PheLys: 0.896 ± 0.461
4.034PheLeu: 4.034 ± 0.679
0.896PheMet: 0.896 ± 0.387
1.793PheAsn: 1.793 ± 0.688
0.672PhePro: 0.672 ± 0.407
2.465PheGln: 2.465 ± 0.815
1.793PheArg: 1.793 ± 0.311
3.586PheSer: 3.586 ± 0.452
2.241PheThr: 2.241 ± 0.518
2.465PheVal: 2.465 ± 0.774
0.896PheTrp: 0.896 ± 0.419
1.121PheTyr: 1.121 ± 0.404
0.0PheXaa: 0.0 ± 0.0
Gly
2.465GlyAla: 2.465 ± 0.607
0.448GlyCys: 0.448 ± 0.245
3.81GlyAsp: 3.81 ± 0.485
3.586GlyGlu: 3.586 ± 1.454
2.689GlyPhe: 2.689 ± 0.432
3.362GlyGly: 3.362 ± 0.664
0.672GlyHis: 0.672 ± 0.439
4.482GlyIle: 4.482 ± 0.782
4.258GlyLys: 4.258 ± 0.487
4.706GlyLeu: 4.706 ± 0.955
2.241GlyMet: 2.241 ± 0.433
2.913GlyAsn: 2.913 ± 0.853
3.138GlyPro: 3.138 ± 0.875
2.017GlyGln: 2.017 ± 0.427
5.155GlyArg: 5.155 ± 0.769
4.258GlySer: 4.258 ± 1.3
6.275GlyThr: 6.275 ± 1.162
4.706GlyVal: 4.706 ± 0.438
1.569GlyTrp: 1.569 ± 0.735
2.465GlyTyr: 2.465 ± 0.685
0.0GlyXaa: 0.0 ± 0.0
His
0.896HisAla: 0.896 ± 0.395
0.224HisCys: 0.224 ± 0.197
0.896HisAsp: 0.896 ± 0.585
1.121HisGlu: 1.121 ± 0.377
1.569HisPhe: 1.569 ± 0.376
0.896HisGly: 0.896 ± 0.408
0.448HisHis: 0.448 ± 0.405
1.345HisIle: 1.345 ± 0.669
1.121HisLys: 1.121 ± 0.388
1.345HisLeu: 1.345 ± 0.487
0.224HisMet: 0.224 ± 0.195
0.448HisAsn: 0.448 ± 0.405
1.121HisPro: 1.121 ± 0.429
0.448HisGln: 0.448 ± 0.242
1.569HisArg: 1.569 ± 0.596
1.569HisSer: 1.569 ± 0.392
0.224HisThr: 0.224 ± 0.271
0.672HisVal: 0.672 ± 0.378
0.0HisTrp: 0.0 ± 0.0
0.224HisTyr: 0.224 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
4.706IleAla: 4.706 ± 1.035
2.017IleCys: 2.017 ± 0.479
3.362IleAsp: 3.362 ± 0.987
6.275IleGlu: 6.275 ± 1.654
1.121IlePhe: 1.121 ± 0.328
4.706IleGly: 4.706 ± 0.823
0.896IleHis: 0.896 ± 0.477
4.258IleIle: 4.258 ± 1.169
3.586IleLys: 3.586 ± 0.85
6.499IleLeu: 6.499 ± 1.545
2.913IleMet: 2.913 ± 0.669
3.81IleAsn: 3.81 ± 0.779
2.465IlePro: 2.465 ± 0.615
1.793IleGln: 1.793 ± 0.403
5.379IleArg: 5.379 ± 1.268
2.465IleSer: 2.465 ± 0.365
4.034IleThr: 4.034 ± 0.776
3.81IleVal: 3.81 ± 0.682
0.896IleTrp: 0.896 ± 0.407
1.345IleTyr: 1.345 ± 0.467
0.0IleXaa: 0.0 ± 0.0
Lys
4.034LysAla: 4.034 ± 0.828
1.569LysCys: 1.569 ± 0.499
2.913LysAsp: 2.913 ± 0.363
5.155LysGlu: 5.155 ± 0.911
1.345LysPhe: 1.345 ± 0.624
3.138LysGly: 3.138 ± 0.544
0.896LysHis: 0.896 ± 0.285
2.913LysIle: 2.913 ± 0.539
3.81LysLys: 3.81 ± 1.842
5.155LysLeu: 5.155 ± 0.968
3.138LysMet: 3.138 ± 0.703
2.241LysAsn: 2.241 ± 0.661
0.672LysPro: 0.672 ± 0.43
1.793LysGln: 1.793 ± 0.832
4.482LysArg: 4.482 ± 1.475
3.362LysSer: 3.362 ± 0.712
3.138LysThr: 3.138 ± 1.241
2.689LysVal: 2.689 ± 0.654
2.017LysTrp: 2.017 ± 0.689
1.569LysTyr: 1.569 ± 0.362
0.0LysXaa: 0.0 ± 0.0
Leu
4.931LeuAla: 4.931 ± 0.721
0.896LeuCys: 0.896 ± 0.469
1.569LeuAsp: 1.569 ± 0.691
6.723LeuGlu: 6.723 ± 1.363
2.017LeuPhe: 2.017 ± 0.498
4.482LeuGly: 4.482 ± 0.713
1.345LeuHis: 1.345 ± 0.534
6.723LeuIle: 6.723 ± 0.994
6.948LeuLys: 6.948 ± 1.305
6.275LeuLeu: 6.275 ± 1.41
2.689LeuMet: 2.689 ± 0.429
4.482LeuAsn: 4.482 ± 1.074
3.586LeuPro: 3.586 ± 0.824
3.362LeuGln: 3.362 ± 0.6
5.379LeuArg: 5.379 ± 1.623
4.931LeuSer: 4.931 ± 0.668
5.603LeuThr: 5.603 ± 1.5
3.362LeuVal: 3.362 ± 0.706
1.121LeuTrp: 1.121 ± 0.393
2.913LeuTyr: 2.913 ± 1.019
0.0LeuXaa: 0.0 ± 0.0
Met
3.586MetAla: 3.586 ± 0.649
1.345MetCys: 1.345 ± 0.63
3.138MetAsp: 3.138 ± 1.101
5.603MetGlu: 5.603 ± 1.016
1.121MetPhe: 1.121 ± 0.818
2.241MetGly: 2.241 ± 0.763
0.224MetHis: 0.224 ± 0.195
2.689MetIle: 2.689 ± 0.552
2.241MetLys: 2.241 ± 0.898
1.569MetLeu: 1.569 ± 0.414
1.569MetMet: 1.569 ± 0.635
0.672MetAsn: 0.672 ± 0.478
0.896MetPro: 0.896 ± 0.365
1.569MetGln: 1.569 ± 0.504
2.689MetArg: 2.689 ± 0.809
2.465MetSer: 2.465 ± 0.462
1.793MetThr: 1.793 ± 0.656
3.362MetVal: 3.362 ± 1.005
0.224MetTrp: 0.224 ± 0.195
0.672MetTyr: 0.672 ± 0.237
0.0MetXaa: 0.0 ± 0.0
Asn
3.81AsnAla: 3.81 ± 0.869
0.448AsnCys: 0.448 ± 0.293
2.689AsnAsp: 2.689 ± 0.379
4.258AsnGlu: 4.258 ± 1.006
2.017AsnPhe: 2.017 ± 0.405
4.931AsnGly: 4.931 ± 1.472
0.672AsnHis: 0.672 ± 0.411
2.017AsnIle: 2.017 ± 0.625
2.913AsnLys: 2.913 ± 0.588
3.138AsnLeu: 3.138 ± 0.624
2.241AsnMet: 2.241 ± 0.703
3.138AsnAsn: 3.138 ± 1.24
4.258AsnPro: 4.258 ± 0.6
1.793AsnGln: 1.793 ± 0.619
2.913AsnArg: 2.913 ± 0.784
4.034AsnSer: 4.034 ± 0.868
3.586AsnThr: 3.586 ± 0.771
2.913AsnVal: 2.913 ± 1.442
1.121AsnTrp: 1.121 ± 0.709
0.672AsnTyr: 0.672 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
2.465ProAla: 2.465 ± 1.157
0.448ProCys: 0.448 ± 0.28
1.345ProAsp: 1.345 ± 0.547
3.138ProGlu: 3.138 ± 0.587
2.241ProPhe: 2.241 ± 0.378
2.241ProGly: 2.241 ± 0.412
0.672ProHis: 0.672 ± 0.43
2.465ProIle: 2.465 ± 0.441
2.913ProLys: 2.913 ± 0.556
4.034ProLeu: 4.034 ± 0.724
1.121ProMet: 1.121 ± 0.53
2.913ProAsn: 2.913 ± 0.809
1.345ProPro: 1.345 ± 0.551
1.121ProGln: 1.121 ± 0.446
2.465ProArg: 2.465 ± 0.719
3.138ProSer: 3.138 ± 0.812
1.569ProThr: 1.569 ± 0.572
1.569ProVal: 1.569 ± 0.561
0.224ProTrp: 0.224 ± 0.197
0.896ProTyr: 0.896 ± 0.428
0.0ProXaa: 0.0 ± 0.0
Gln
2.241GlnAla: 2.241 ± 0.916
0.672GlnCys: 0.672 ± 0.308
1.345GlnAsp: 1.345 ± 0.359
1.793GlnGlu: 1.793 ± 0.635
0.896GlnPhe: 0.896 ± 0.362
2.913GlnGly: 2.913 ± 0.867
0.448GlnHis: 0.448 ± 0.329
3.586GlnIle: 3.586 ± 0.48
2.465GlnLys: 2.465 ± 0.904
3.362GlnLeu: 3.362 ± 0.97
2.465GlnMet: 2.465 ± 0.884
2.465GlnAsn: 2.465 ± 0.531
0.672GlnPro: 0.672 ± 0.421
1.121GlnGln: 1.121 ± 0.336
3.81GlnArg: 3.81 ± 0.872
3.138GlnSer: 3.138 ± 0.887
2.465GlnThr: 2.465 ± 0.958
2.465GlnVal: 2.465 ± 0.822
0.448GlnTrp: 0.448 ± 0.39
0.672GlnTyr: 0.672 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
4.034ArgAla: 4.034 ± 0.85
1.121ArgCys: 1.121 ± 0.524
3.362ArgAsp: 3.362 ± 0.804
2.913ArgGlu: 2.913 ± 0.747
2.465ArgPhe: 2.465 ± 0.729
6.499ArgGly: 6.499 ± 1.094
0.672ArgHis: 0.672 ± 0.344
4.034ArgIle: 4.034 ± 0.597
2.465ArgLys: 2.465 ± 0.717
4.931ArgLeu: 4.931 ± 0.64
4.034ArgMet: 4.034 ± 1.371
5.827ArgAsn: 5.827 ± 0.808
2.689ArgPro: 2.689 ± 0.525
2.913ArgGln: 2.913 ± 0.639
5.827ArgArg: 5.827 ± 1.093
4.482ArgSer: 4.482 ± 1.078
6.275ArgThr: 6.275 ± 0.713
2.913ArgVal: 2.913 ± 1.042
0.224ArgTrp: 0.224 ± 0.223
1.793ArgTyr: 1.793 ± 0.524
0.0ArgXaa: 0.0 ± 0.0
Ser
3.81SerAla: 3.81 ± 0.86
1.793SerCys: 1.793 ± 0.866
2.913SerAsp: 2.913 ± 0.742
3.81SerGlu: 3.81 ± 0.735
4.482SerPhe: 4.482 ± 0.82
5.603SerGly: 5.603 ± 1.275
1.569SerHis: 1.569 ± 0.761
5.827SerIle: 5.827 ± 0.919
3.362SerLys: 3.362 ± 1.072
6.051SerLeu: 6.051 ± 1.222
2.689SerMet: 2.689 ± 0.912
4.931SerAsn: 4.931 ± 1.739
2.017SerPro: 2.017 ± 0.505
4.258SerGln: 4.258 ± 0.832
3.138SerArg: 3.138 ± 0.651
9.413SerSer: 9.413 ± 1.7
4.706SerThr: 4.706 ± 0.956
3.81SerVal: 3.81 ± 0.748
1.569SerTrp: 1.569 ± 0.632
1.793SerTyr: 1.793 ± 0.594
0.0SerXaa: 0.0 ± 0.0
Thr
3.81ThrAla: 3.81 ± 0.599
0.896ThrCys: 0.896 ± 0.32
2.017ThrAsp: 2.017 ± 0.63
4.258ThrGlu: 4.258 ± 1.101
2.913ThrPhe: 2.913 ± 0.575
4.258ThrGly: 4.258 ± 1.035
2.017ThrHis: 2.017 ± 0.686
4.931ThrIle: 4.931 ± 0.91
4.034ThrLys: 4.034 ± 0.644
4.931ThrLeu: 4.931 ± 1.168
2.017ThrMet: 2.017 ± 0.593
2.465ThrAsn: 2.465 ± 0.557
1.569ThrPro: 1.569 ± 0.376
2.465ThrGln: 2.465 ± 0.796
5.155ThrArg: 5.155 ± 0.648
3.362ThrSer: 3.362 ± 0.644
3.586ThrThr: 3.586 ± 1.145
4.706ThrVal: 4.706 ± 1.149
0.448ThrTrp: 0.448 ± 0.225
3.138ThrTyr: 3.138 ± 0.929
0.0ThrXaa: 0.0 ± 0.0
Val
3.138ValAla: 3.138 ± 0.685
2.017ValCys: 2.017 ± 1.128
3.81ValAsp: 3.81 ± 1.007
3.81ValGlu: 3.81 ± 0.696
2.241ValPhe: 2.241 ± 0.587
2.913ValGly: 2.913 ± 0.71
1.121ValHis: 1.121 ± 0.543
1.793ValIle: 1.793 ± 0.691
2.689ValLys: 2.689 ± 0.831
4.931ValLeu: 4.931 ± 1.514
2.241ValMet: 2.241 ± 0.556
3.362ValAsn: 3.362 ± 0.647
2.689ValPro: 2.689 ± 0.692
2.017ValGln: 2.017 ± 0.798
4.482ValArg: 4.482 ± 1.079
5.155ValSer: 5.155 ± 0.739
2.689ValThr: 2.689 ± 0.697
3.138ValVal: 3.138 ± 0.618
1.121ValTrp: 1.121 ± 0.572
1.345ValTyr: 1.345 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.672TrpAla: 0.672 ± 0.3
0.0TrpCys: 0.0 ± 0.0
0.448TrpAsp: 0.448 ± 0.258
1.569TrpGlu: 1.569 ± 0.594
0.672TrpPhe: 0.672 ± 0.32
0.672TrpGly: 0.672 ± 0.249
0.448TrpHis: 0.448 ± 0.323
1.345TrpIle: 1.345 ± 0.441
0.672TrpLys: 0.672 ± 0.43
1.121TrpLeu: 1.121 ± 0.532
1.121TrpMet: 1.121 ± 0.393
0.672TrpAsn: 0.672 ± 0.344
0.448TrpPro: 0.448 ± 0.28
0.224TrpGln: 0.224 ± 0.203
0.896TrpArg: 0.896 ± 0.519
1.793TrpSer: 1.793 ± 0.951
1.345TrpThr: 1.345 ± 0.47
0.448TrpVal: 0.448 ± 0.28
0.672TrpTrp: 0.672 ± 0.271
0.224TrpTyr: 0.224 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.896TyrAla: 0.896 ± 0.391
0.224TyrCys: 0.224 ± 0.197
2.241TyrAsp: 2.241 ± 0.679
1.793TyrGlu: 1.793 ± 0.57
1.121TyrPhe: 1.121 ± 0.371
2.017TyrGly: 2.017 ± 0.384
0.672TyrHis: 0.672 ± 0.608
2.017TyrIle: 2.017 ± 0.45
0.672TyrLys: 0.672 ± 0.249
1.345TyrLeu: 1.345 ± 0.416
0.448TyrMet: 0.448 ± 0.242
1.569TyrAsn: 1.569 ± 0.613
1.569TyrPro: 1.569 ± 0.756
1.793TyrGln: 1.793 ± 0.414
2.465TyrArg: 2.465 ± 1.056
2.689TyrSer: 2.689 ± 0.423
1.569TyrThr: 1.569 ± 0.768
1.569TyrVal: 1.569 ± 0.641
0.224TyrTrp: 0.224 ± 0.178
0.224TyrTyr: 0.224 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4463 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski