Amino acid dipepetide frequency for Influenza A virus (A/swine/Korea/S452/2004(H9N2))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.766AlaAla: 3.766 ± 0.916
0.941AlaCys: 0.941 ± 0.409
2.589AlaAsp: 2.589 ± 0.717
2.824AlaGlu: 2.824 ± 0.904
2.118AlaPhe: 2.118 ± 0.709
4.707AlaGly: 4.707 ± 1.006
0.706AlaHis: 0.706 ± 0.414
4.707AlaIle: 4.707 ± 1.043
2.118AlaLys: 2.118 ± 0.583
5.648AlaLeu: 5.648 ± 1.0
2.824AlaMet: 2.824 ± 0.743
3.06AlaAsn: 3.06 ± 0.663
2.353AlaPro: 2.353 ± 0.368
1.647AlaGln: 1.647 ± 0.511
3.766AlaArg: 3.766 ± 0.558
4.942AlaSer: 4.942 ± 1.032
5.648AlaThr: 5.648 ± 0.925
2.824AlaVal: 2.824 ± 0.709
1.177AlaTrp: 1.177 ± 0.616
0.941AlaTyr: 0.941 ± 0.312
0.0AlaXaa: 0.0 ± 0.0
Cys
0.941CysAla: 0.941 ± 0.3
0.235CysCys: 0.235 ± 0.179
0.471CysAsp: 0.471 ± 0.463
0.941CysGlu: 0.941 ± 0.365
1.412CysPhe: 1.412 ± 0.517
0.235CysGly: 0.235 ± 0.228
0.941CysHis: 0.941 ± 0.275
1.412CysIle: 1.412 ± 0.485
1.177CysLys: 1.177 ± 0.324
1.177CysLeu: 1.177 ± 0.503
0.941CysMet: 0.941 ± 0.312
0.941CysAsn: 0.941 ± 0.385
0.471CysPro: 0.471 ± 0.281
0.471CysGln: 0.471 ± 0.281
1.177CysArg: 1.177 ± 0.628
2.118CysSer: 2.118 ± 0.873
0.706CysThr: 0.706 ± 0.331
1.883CysVal: 1.883 ± 0.849
0.235CysTrp: 0.235 ± 0.191
0.706CysTyr: 0.706 ± 0.457
0.0CysXaa: 0.0 ± 0.0
Asp
3.06AspAla: 3.06 ± 0.47
1.177AspCys: 1.177 ± 0.385
2.118AspAsp: 2.118 ± 0.382
3.295AspGlu: 3.295 ± 0.7
1.883AspPhe: 1.883 ± 0.928
3.295AspGly: 3.295 ± 1.115
0.941AspHis: 0.941 ± 0.314
1.883AspIle: 1.883 ± 0.453
2.353AspLys: 2.353 ± 0.856
3.06AspLeu: 3.06 ± 0.484
1.647AspMet: 1.647 ± 0.385
2.824AspAsn: 2.824 ± 0.898
3.295AspPro: 3.295 ± 1.009
2.118AspGln: 2.118 ± 0.936
2.353AspArg: 2.353 ± 0.365
3.06AspSer: 3.06 ± 0.868
2.824AspThr: 2.824 ± 0.846
3.06AspVal: 3.06 ± 0.583
0.706AspTrp: 0.706 ± 0.357
1.647AspTyr: 1.647 ± 0.545
0.0AspXaa: 0.0 ± 0.0
Glu
2.353GluAla: 2.353 ± 0.548
1.177GluCys: 1.177 ± 0.719
4.472GluAsp: 4.472 ± 0.762
6.354GluGlu: 6.354 ± 1.15
1.883GluPhe: 1.883 ± 0.537
4.942GluGly: 4.942 ± 1.116
0.941GluHis: 0.941 ± 0.423
4.707GluIle: 4.707 ± 1.022
5.648GluLys: 5.648 ± 1.503
6.354GluLeu: 6.354 ± 0.694
2.118GluMet: 2.118 ± 0.653
3.766GluAsn: 3.766 ± 0.978
2.824GluPro: 2.824 ± 1.249
3.53GluGln: 3.53 ± 0.986
4.236GluArg: 4.236 ± 0.851
5.648GluSer: 5.648 ± 1.14
4.001GluThr: 4.001 ± 0.669
4.472GluVal: 4.472 ± 1.244
0.941GluTrp: 0.941 ± 0.38
1.412GluTyr: 1.412 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
2.353PheAla: 2.353 ± 0.514
0.235PheCys: 0.235 ± 0.228
1.177PheAsp: 1.177 ± 0.419
4.236PheGlu: 4.236 ± 1.085
1.412PhePhe: 1.412 ± 0.45
1.883PheGly: 1.883 ± 0.412
0.941PheHis: 0.941 ± 0.313
1.883PheIle: 1.883 ± 0.646
1.177PheLys: 1.177 ± 0.467
3.766PheLeu: 3.766 ± 0.921
1.177PheMet: 1.177 ± 0.397
2.353PheAsn: 2.353 ± 0.695
0.941PhePro: 0.941 ± 0.404
2.353PheGln: 2.353 ± 0.606
1.412PheArg: 1.412 ± 0.406
3.53PheSer: 3.53 ± 0.464
2.353PheThr: 2.353 ± 0.593
2.353PheVal: 2.353 ± 0.727
0.471PheTrp: 0.471 ± 0.292
1.177PheTyr: 1.177 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
3.06GlyAla: 3.06 ± 0.844
0.471GlyCys: 0.471 ± 0.244
2.589GlyAsp: 2.589 ± 0.48
3.53GlyGlu: 3.53 ± 1.607
3.06GlyPhe: 3.06 ± 0.571
4.001GlyGly: 4.001 ± 0.927
0.941GlyHis: 0.941 ± 0.535
4.236GlyIle: 4.236 ± 0.78
3.766GlyLys: 3.766 ± 0.511
5.413GlyLeu: 5.413 ± 0.996
2.589GlyMet: 2.589 ± 0.615
3.295GlyAsn: 3.295 ± 0.77
3.295GlyPro: 3.295 ± 0.652
2.353GlyGln: 2.353 ± 0.489
6.119GlyArg: 6.119 ± 0.738
4.707GlySer: 4.707 ± 1.639
6.825GlyThr: 6.825 ± 1.209
4.942GlyVal: 4.942 ± 0.269
1.177GlyTrp: 1.177 ± 0.506
2.118GlyTyr: 2.118 ± 0.628
0.0GlyXaa: 0.0 ± 0.0
His
0.706HisAla: 0.706 ± 0.264
0.235HisCys: 0.235 ± 0.233
0.706HisAsp: 0.706 ± 0.457
1.647HisGlu: 1.647 ± 0.329
1.177HisPhe: 1.177 ± 0.324
1.412HisGly: 1.412 ± 0.472
0.235HisHis: 0.235 ± 0.232
2.118HisIle: 2.118 ± 0.833
1.647HisLys: 1.647 ± 0.643
0.941HisLeu: 0.941 ± 0.468
0.471HisMet: 0.471 ± 0.24
0.471HisAsn: 0.471 ± 0.463
1.177HisPro: 1.177 ± 0.493
0.471HisGln: 0.471 ± 0.203
1.177HisArg: 1.177 ± 0.5
1.177HisSer: 1.177 ± 0.471
0.235HisThr: 0.235 ± 0.254
0.471HisVal: 0.471 ± 0.317
0.0HisTrp: 0.0 ± 0.0
0.235HisTyr: 0.235 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
3.53IleAla: 3.53 ± 0.771
1.647IleCys: 1.647 ± 0.466
4.236IleAsp: 4.236 ± 1.097
7.06IleGlu: 7.06 ± 2.171
1.647IlePhe: 1.647 ± 0.415
4.707IleGly: 4.707 ± 0.788
1.177IleHis: 1.177 ± 0.368
4.001IleIle: 4.001 ± 1.073
4.236IleLys: 4.236 ± 0.665
6.354IleLeu: 6.354 ± 1.75
2.118IleMet: 2.118 ± 0.346
3.53IleAsn: 3.53 ± 0.617
2.353IlePro: 2.353 ± 0.685
2.118IleGln: 2.118 ± 0.475
4.707IleArg: 4.707 ± 1.027
2.353IleSer: 2.353 ± 0.448
4.001IleThr: 4.001 ± 0.521
4.001IleVal: 4.001 ± 0.896
0.941IleTrp: 0.941 ± 0.469
1.177IleTyr: 1.177 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
4.707LysAla: 4.707 ± 0.761
1.177LysCys: 1.177 ± 0.403
2.824LysAsp: 2.824 ± 0.644
5.884LysGlu: 5.884 ± 0.917
0.941LysPhe: 0.941 ± 0.678
3.295LysGly: 3.295 ± 0.618
0.706LysHis: 0.706 ± 0.302
4.942LysIle: 4.942 ± 0.925
3.295LysLys: 3.295 ± 1.197
3.766LysLeu: 3.766 ± 0.941
2.824LysMet: 2.824 ± 0.599
2.353LysAsn: 2.353 ± 0.452
1.177LysPro: 1.177 ± 0.468
1.883LysGln: 1.883 ± 0.619
5.178LysArg: 5.178 ± 1.439
3.53LysSer: 3.53 ± 0.562
3.53LysThr: 3.53 ± 1.101
3.06LysVal: 3.06 ± 0.926
1.412LysTrp: 1.412 ± 0.406
1.647LysTyr: 1.647 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
4.942LeuAla: 4.942 ± 0.912
0.941LeuCys: 0.941 ± 0.353
1.883LeuAsp: 1.883 ± 0.647
6.354LeuGlu: 6.354 ± 1.318
2.353LeuPhe: 2.353 ± 0.667
4.236LeuGly: 4.236 ± 0.618
1.177LeuHis: 1.177 ± 0.45
5.884LeuIle: 5.884 ± 0.982
5.178LeuLys: 5.178 ± 1.086
6.825LeuLeu: 6.825 ± 1.453
1.883LeuMet: 1.883 ± 0.531
4.001LeuAsn: 4.001 ± 1.003
3.766LeuPro: 3.766 ± 0.818
2.824LeuGln: 2.824 ± 0.562
6.119LeuArg: 6.119 ± 1.459
4.942LeuSer: 4.942 ± 0.644
5.884LeuThr: 5.884 ± 1.448
4.707LeuVal: 4.707 ± 1.636
0.941LeuTrp: 0.941 ± 0.298
3.06LeuTyr: 3.06 ± 1.104
0.0LeuXaa: 0.0 ± 0.0
Met
4.001MetAla: 4.001 ± 0.526
0.941MetCys: 0.941 ± 0.441
2.589MetAsp: 2.589 ± 1.013
4.472MetGlu: 4.472 ± 0.733
1.412MetPhe: 1.412 ± 0.702
2.118MetGly: 2.118 ± 0.726
0.235MetHis: 0.235 ± 0.191
2.824MetIle: 2.824 ± 0.478
2.353MetLys: 2.353 ± 1.033
1.883MetLeu: 1.883 ± 0.307
1.412MetMet: 1.412 ± 0.495
0.941MetAsn: 0.941 ± 0.563
0.706MetPro: 0.706 ± 0.337
1.177MetGln: 1.177 ± 0.299
2.353MetArg: 2.353 ± 0.675
2.118MetSer: 2.118 ± 0.35
1.647MetThr: 1.647 ± 0.46
3.295MetVal: 3.295 ± 0.983
0.471MetTrp: 0.471 ± 0.299
0.706MetTyr: 0.706 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
3.766AsnAla: 3.766 ± 0.765
0.706AsnCys: 0.706 ± 0.462
2.589AsnAsp: 2.589 ± 0.315
4.001AsnGlu: 4.001 ± 0.772
1.647AsnPhe: 1.647 ± 0.608
4.236AsnGly: 4.236 ± 0.99
0.0AsnHis: 0.0 ± 0.0
2.824AsnIle: 2.824 ± 0.658
3.295AsnLys: 3.295 ± 0.568
3.766AsnLeu: 3.766 ± 1.007
3.06AsnMet: 3.06 ± 0.76
3.295AsnAsn: 3.295 ± 1.271
4.001AsnPro: 4.001 ± 0.515
2.118AsnGln: 2.118 ± 0.554
3.53AsnArg: 3.53 ± 0.642
3.53AsnSer: 3.53 ± 0.773
3.53AsnThr: 3.53 ± 0.743
2.353AsnVal: 2.353 ± 1.275
0.941AsnTrp: 0.941 ± 0.535
0.706AsnTyr: 0.706 ± 0.449
0.0AsnXaa: 0.0 ± 0.0
Pro
2.589ProAla: 2.589 ± 0.991
0.471ProCys: 0.471 ± 0.312
1.647ProAsp: 1.647 ± 0.397
2.589ProGlu: 2.589 ± 0.602
2.353ProPhe: 2.353 ± 0.425
3.06ProGly: 3.06 ± 0.458
0.706ProHis: 0.706 ± 0.501
3.295ProIle: 3.295 ± 0.49
3.295ProLys: 3.295 ± 0.87
3.295ProLeu: 3.295 ± 0.737
0.941ProMet: 0.941 ± 0.668
2.589ProAsn: 2.589 ± 1.095
1.412ProPro: 1.412 ± 0.354
0.941ProGln: 0.941 ± 0.484
2.118ProArg: 2.118 ± 0.677
3.295ProSer: 3.295 ± 0.733
1.647ProThr: 1.647 ± 0.597
1.412ProVal: 1.412 ± 0.615
0.471ProTrp: 0.471 ± 0.244
0.706ProTyr: 0.706 ± 0.396
0.0ProXaa: 0.0 ± 0.0
Gln
2.353GlnAla: 2.353 ± 0.828
0.941GlnCys: 0.941 ± 0.403
1.647GlnAsp: 1.647 ± 0.593
1.647GlnGlu: 1.647 ± 0.6
0.471GlnPhe: 0.471 ± 0.268
2.824GlnGly: 2.824 ± 0.8
0.941GlnHis: 0.941 ± 0.335
3.295GlnIle: 3.295 ± 0.539
2.824GlnLys: 2.824 ± 0.97
2.824GlnLeu: 2.824 ± 0.713
2.824GlnMet: 2.824 ± 0.757
2.589GlnAsn: 2.589 ± 0.564
0.706GlnPro: 0.706 ± 0.386
1.412GlnGln: 1.412 ± 0.327
3.295GlnArg: 3.295 ± 0.831
2.589GlnSer: 2.589 ± 0.813
2.589GlnThr: 2.589 ± 1.095
1.883GlnVal: 1.883 ± 0.732
0.706GlnTrp: 0.706 ± 0.385
1.177GlnTyr: 1.177 ± 0.409
0.0GlnXaa: 0.0 ± 0.0
Arg
4.236ArgAla: 4.236 ± 0.64
1.177ArgCys: 1.177 ± 0.432
2.824ArgAsp: 2.824 ± 0.862
2.118ArgGlu: 2.118 ± 0.533
2.589ArgPhe: 2.589 ± 0.816
6.354ArgGly: 6.354 ± 1.15
0.471ArgHis: 0.471 ± 0.278
4.001ArgIle: 4.001 ± 0.913
2.118ArgLys: 2.118 ± 0.792
4.707ArgLeu: 4.707 ± 0.453
2.824ArgMet: 2.824 ± 0.998
4.472ArgAsn: 4.472 ± 0.716
2.589ArgPro: 2.589 ± 0.672
3.53ArgGln: 3.53 ± 0.456
4.942ArgArg: 4.942 ± 0.981
5.648ArgSer: 5.648 ± 1.144
6.354ArgThr: 6.354 ± 1.025
2.589ArgVal: 2.589 ± 0.695
0.706ArgTrp: 0.706 ± 0.268
2.118ArgTyr: 2.118 ± 0.717
0.0ArgXaa: 0.0 ± 0.0
Ser
3.53SerAla: 3.53 ± 1.255
2.353SerCys: 2.353 ± 0.825
3.06SerAsp: 3.06 ± 0.823
2.589SerGlu: 2.589 ± 0.609
3.766SerPhe: 3.766 ± 0.488
5.884SerGly: 5.884 ± 1.336
2.118SerHis: 2.118 ± 0.898
5.648SerIle: 5.648 ± 1.158
4.942SerLys: 4.942 ± 0.701
5.413SerLeu: 5.413 ± 1.228
2.118SerMet: 2.118 ± 0.723
4.472SerAsn: 4.472 ± 1.742
1.883SerPro: 1.883 ± 0.613
4.472SerGln: 4.472 ± 0.939
2.589SerArg: 2.589 ± 0.693
7.767SerSer: 7.767 ± 1.226
4.001SerThr: 4.001 ± 1.133
4.001SerVal: 4.001 ± 0.895
1.647SerTrp: 1.647 ± 0.903
2.353SerTyr: 2.353 ± 0.831
0.0SerXaa: 0.0 ± 0.0
Thr
3.766ThrAla: 3.766 ± 0.611
1.177ThrCys: 1.177 ± 0.312
2.589ThrAsp: 2.589 ± 0.548
5.178ThrGlu: 5.178 ± 1.396
2.824ThrPhe: 2.824 ± 0.277
4.707ThrGly: 4.707 ± 0.971
2.118ThrHis: 2.118 ± 0.593
4.472ThrIle: 4.472 ± 0.831
4.236ThrLys: 4.236 ± 0.95
5.413ThrLeu: 5.413 ± 0.918
2.118ThrMet: 2.118 ± 0.476
3.06ThrAsn: 3.06 ± 0.532
1.412ThrPro: 1.412 ± 0.339
2.118ThrGln: 2.118 ± 0.703
4.236ThrArg: 4.236 ± 0.777
4.236ThrSer: 4.236 ± 0.709
4.707ThrThr: 4.707 ± 1.004
4.942ThrVal: 4.942 ± 1.256
0.706ThrTrp: 0.706 ± 0.264
3.06ThrTyr: 3.06 ± 0.848
0.0ThrXaa: 0.0 ± 0.0
Val
3.766ValAla: 3.766 ± 0.926
2.353ValCys: 2.353 ± 1.387
3.766ValAsp: 3.766 ± 0.815
4.236ValGlu: 4.236 ± 0.597
2.353ValPhe: 2.353 ± 0.553
3.295ValGly: 3.295 ± 0.975
0.706ValHis: 0.706 ± 0.302
1.647ValIle: 1.647 ± 0.543
2.353ValLys: 2.353 ± 0.722
5.178ValLeu: 5.178 ± 1.382
2.118ValMet: 2.118 ± 0.555
3.53ValAsn: 3.53 ± 0.617
2.353ValPro: 2.353 ± 0.663
2.353ValGln: 2.353 ± 0.669
4.236ValArg: 4.236 ± 1.435
4.942ValSer: 4.942 ± 0.713
2.824ValThr: 2.824 ± 0.64
3.53ValVal: 3.53 ± 0.538
0.706ValTrp: 0.706 ± 0.496
1.647ValTyr: 1.647 ± 0.222
0.0ValXaa: 0.0 ± 0.0
Trp
1.177TrpAla: 1.177 ± 0.41
0.0TrpCys: 0.0 ± 0.0
0.471TrpAsp: 0.471 ± 0.258
1.412TrpGlu: 1.412 ± 0.481
0.706TrpPhe: 0.706 ± 0.32
0.941TrpGly: 0.941 ± 0.426
0.471TrpHis: 0.471 ± 0.337
0.706TrpIle: 0.706 ± 0.449
0.706TrpLys: 0.706 ± 0.501
0.706TrpLeu: 0.706 ± 0.306
0.941TrpMet: 0.941 ± 0.436
0.941TrpAsn: 0.941 ± 0.372
0.706TrpPro: 0.706 ± 0.276
0.0TrpGln: 0.0 ± 0.0
0.706TrpArg: 0.706 ± 0.491
1.883TrpSer: 1.883 ± 0.918
1.647TrpThr: 1.647 ± 0.631
0.471TrpVal: 0.471 ± 0.312
0.941TrpTrp: 0.941 ± 0.275
0.471TrpTyr: 0.471 ± 0.463
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.706TyrAla: 0.706 ± 0.268
0.235TyrCys: 0.235 ± 0.233
2.353TyrAsp: 2.353 ± 0.708
1.412TyrGlu: 1.412 ± 0.371
1.177TyrPhe: 1.177 ± 0.291
2.353TyrGly: 2.353 ± 0.517
0.235TyrHis: 0.235 ± 0.232
1.647TyrIle: 1.647 ± 0.337
1.412TyrLys: 1.412 ± 0.853
1.412TyrLeu: 1.412 ± 0.484
0.471TyrMet: 0.471 ± 0.203
1.412TyrAsn: 1.412 ± 0.573
1.412TyrPro: 1.412 ± 0.626
1.647TyrGln: 1.647 ± 0.359
2.118TyrArg: 2.118 ± 0.878
2.589TyrSer: 2.589 ± 0.385
2.118TyrThr: 2.118 ± 0.694
1.647TyrVal: 1.647 ± 0.725
0.706TyrTrp: 0.706 ± 0.329
0.471TyrTyr: 0.471 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski