Amino acid dipepetide frequency for Staphylococcus virus Baq_Sau1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.445AlaAla: 1.445 ± 0.469
0.217AlaCys: 0.217 ± 0.12
3.107AlaAsp: 3.107 ± 0.502
4.19AlaGlu: 4.19 ± 0.56
2.962AlaPhe: 2.962 ± 0.444
2.818AlaGly: 2.818 ± 0.479
1.011AlaHis: 1.011 ± 0.283
4.985AlaIle: 4.985 ± 1.007
5.13AlaLys: 5.13 ± 0.683
3.974AlaLeu: 3.974 ± 0.663
1.734AlaMet: 1.734 ± 0.443
4.118AlaAsn: 4.118 ± 0.471
1.589AlaPro: 1.589 ± 0.306
2.24AlaGln: 2.24 ± 0.398
2.384AlaArg: 2.384 ± 0.409
3.54AlaSer: 3.54 ± 0.638
3.179AlaThr: 3.179 ± 0.488
3.757AlaVal: 3.757 ± 0.569
0.722AlaTrp: 0.722 ± 0.304
2.818AlaTyr: 2.818 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.144CysAla: 0.144 ± 0.094
0.0CysCys: 0.0 ± 0.0
0.144CysAsp: 0.144 ± 0.107
0.433CysGlu: 0.433 ± 0.208
0.361CysPhe: 0.361 ± 0.167
0.144CysGly: 0.144 ± 0.097
0.0CysHis: 0.0 ± 0.0
0.289CysIle: 0.289 ± 0.129
0.433CysLys: 0.433 ± 0.171
0.289CysLeu: 0.289 ± 0.18
0.144CysMet: 0.144 ± 0.151
0.217CysAsn: 0.217 ± 0.133
0.072CysPro: 0.072 ± 0.062
0.217CysGln: 0.217 ± 0.121
0.361CysArg: 0.361 ± 0.149
0.144CysSer: 0.144 ± 0.124
0.289CysThr: 0.289 ± 0.128
0.289CysVal: 0.289 ± 0.127
0.072CysTrp: 0.072 ± 0.069
0.072CysTyr: 0.072 ± 0.069
0.0CysXaa: 0.0 ± 0.0
Asp
3.829AspAla: 3.829 ± 0.638
0.289AspCys: 0.289 ± 0.14
4.479AspAsp: 4.479 ± 0.775
5.13AspGlu: 5.13 ± 0.745
3.974AspPhe: 3.974 ± 0.629
3.034AspGly: 3.034 ± 0.456
0.433AspHis: 0.433 ± 0.213
5.708AspIle: 5.708 ± 0.65
6.43AspLys: 6.43 ± 0.73
4.985AspLeu: 4.985 ± 0.707
1.734AspMet: 1.734 ± 0.318
4.118AspAsn: 4.118 ± 0.564
1.3AspPro: 1.3 ± 0.294
1.084AspGln: 1.084 ± 0.252
1.951AspArg: 1.951 ± 0.416
4.19AspSer: 4.19 ± 0.579
3.54AspThr: 3.54 ± 0.545
3.54AspVal: 3.54 ± 0.537
0.795AspTrp: 0.795 ± 0.224
2.529AspTyr: 2.529 ± 0.521
0.0AspXaa: 0.0 ± 0.0
Glu
5.419GluAla: 5.419 ± 0.761
0.361GluCys: 0.361 ± 0.15
3.685GluAsp: 3.685 ± 0.735
6.286GluGlu: 6.286 ± 1.039
2.962GluPhe: 2.962 ± 0.459
3.179GluGly: 3.179 ± 0.477
1.011GluHis: 1.011 ± 0.256
6.286GluIle: 6.286 ± 0.83
6.213GluLys: 6.213 ± 1.022
6.791GluLeu: 6.791 ± 0.737
2.023GluMet: 2.023 ± 0.384
5.274GluAsn: 5.274 ± 0.623
1.734GluPro: 1.734 ± 0.356
4.841GluGln: 4.841 ± 0.704
3.901GluArg: 3.901 ± 0.538
3.974GluSer: 3.974 ± 0.575
3.107GluThr: 3.107 ± 0.436
5.708GluVal: 5.708 ± 0.791
0.578GluTrp: 0.578 ± 0.189
5.202GluTyr: 5.202 ± 0.653
0.0GluXaa: 0.0 ± 0.0
Phe
2.312PheAla: 2.312 ± 0.338
0.144PheCys: 0.144 ± 0.102
3.179PheAsp: 3.179 ± 0.49
3.901PheGlu: 3.901 ± 0.539
1.228PhePhe: 1.228 ± 0.277
2.962PheGly: 2.962 ± 0.458
0.795PheHis: 0.795 ± 0.292
2.89PheIle: 2.89 ± 0.402
4.19PheLys: 4.19 ± 0.468
2.601PheLeu: 2.601 ± 0.377
1.084PheMet: 1.084 ± 0.296
3.179PheAsn: 3.179 ± 0.445
0.939PhePro: 0.939 ± 0.327
1.373PheGln: 1.373 ± 0.353
1.445PheArg: 1.445 ± 0.306
3.396PheSer: 3.396 ± 0.469
2.962PheThr: 2.962 ± 0.389
2.384PheVal: 2.384 ± 0.583
0.144PheTrp: 0.144 ± 0.099
1.734PheTyr: 1.734 ± 0.348
0.0PheXaa: 0.0 ± 0.0
Gly
2.745GlyAla: 2.745 ± 0.463
0.361GlyCys: 0.361 ± 0.173
3.612GlyAsp: 3.612 ± 0.494
3.54GlyGlu: 3.54 ± 0.614
2.384GlyPhe: 2.384 ± 0.473
3.034GlyGly: 3.034 ± 0.555
1.517GlyHis: 1.517 ± 0.422
4.19GlyIle: 4.19 ± 0.554
4.768GlyLys: 4.768 ± 0.576
5.491GlyLeu: 5.491 ± 0.834
1.589GlyMet: 1.589 ± 0.301
2.89GlyAsn: 2.89 ± 0.446
0.65GlyPro: 0.65 ± 0.276
1.662GlyGln: 1.662 ± 0.405
2.023GlyArg: 2.023 ± 0.329
3.034GlySer: 3.034 ± 0.44
3.54GlyThr: 3.54 ± 0.459
4.407GlyVal: 4.407 ± 0.522
0.65GlyTrp: 0.65 ± 0.254
2.456GlyTyr: 2.456 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
1.373HisAla: 1.373 ± 0.387
0.144HisCys: 0.144 ± 0.096
1.011HisAsp: 1.011 ± 0.243
0.867HisGlu: 0.867 ± 0.282
1.011HisPhe: 1.011 ± 0.272
1.3HisGly: 1.3 ± 0.255
0.506HisHis: 0.506 ± 0.156
1.228HisIle: 1.228 ± 0.432
0.722HisLys: 0.722 ± 0.243
1.3HisLeu: 1.3 ± 0.322
0.506HisMet: 0.506 ± 0.199
1.011HisAsn: 1.011 ± 0.276
0.361HisPro: 0.361 ± 0.136
0.361HisGln: 0.361 ± 0.189
0.361HisArg: 0.361 ± 0.157
1.3HisSer: 1.3 ± 0.271
0.939HisThr: 0.939 ± 0.264
1.156HisVal: 1.156 ± 0.318
0.0HisTrp: 0.0 ± 0.0
0.939HisTyr: 0.939 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
5.13IleAla: 5.13 ± 0.827
0.144IleCys: 0.144 ± 0.103
5.635IleAsp: 5.635 ± 0.613
6.502IleGlu: 6.502 ± 0.699
2.673IlePhe: 2.673 ± 0.512
3.757IleGly: 3.757 ± 0.583
1.156IleHis: 1.156 ± 0.279
4.19IleIle: 4.19 ± 0.685
7.803IleLys: 7.803 ± 0.678
4.552IleLeu: 4.552 ± 0.475
1.806IleMet: 1.806 ± 0.408
4.985IleAsn: 4.985 ± 0.496
2.167IlePro: 2.167 ± 0.314
2.167IleGln: 2.167 ± 0.398
3.468IleArg: 3.468 ± 0.538
4.696IleSer: 4.696 ± 0.591
5.274IleThr: 5.274 ± 0.785
4.479IleVal: 4.479 ± 0.554
1.373IleTrp: 1.373 ± 0.681
3.323IleTyr: 3.323 ± 0.664
0.0IleXaa: 0.0 ± 0.0
Lys
5.852LysAla: 5.852 ± 0.616
0.072LysCys: 0.072 ± 0.069
6.43LysAsp: 6.43 ± 0.673
8.309LysGlu: 8.309 ± 1.145
3.323LysPhe: 3.323 ± 0.592
5.491LysGly: 5.491 ± 0.604
2.095LysHis: 2.095 ± 0.484
5.852LysIle: 5.852 ± 0.607
8.525LysLys: 8.525 ± 0.652
6.791LysLeu: 6.791 ± 0.698
2.167LysMet: 2.167 ± 0.426
5.346LysAsn: 5.346 ± 0.74
3.179LysPro: 3.179 ± 0.534
4.552LysGln: 4.552 ± 0.65
3.54LysArg: 3.54 ± 0.54
5.708LysSer: 5.708 ± 0.642
5.563LysThr: 5.563 ± 0.631
6.213LysVal: 6.213 ± 0.749
0.578LysTrp: 0.578 ± 0.175
5.13LysTyr: 5.13 ± 0.804
0.0LysXaa: 0.0 ± 0.0
Leu
3.323LeuAla: 3.323 ± 0.564
0.144LeuCys: 0.144 ± 0.081
4.335LeuAsp: 4.335 ± 0.526
5.78LeuGlu: 5.78 ± 0.673
3.468LeuPhe: 3.468 ± 0.525
3.757LeuGly: 3.757 ± 0.501
0.795LeuHis: 0.795 ± 0.222
4.479LeuIle: 4.479 ± 0.6
7.369LeuLys: 7.369 ± 0.637
5.491LeuLeu: 5.491 ± 0.626
1.517LeuMet: 1.517 ± 0.326
5.997LeuAsn: 5.997 ± 0.513
2.529LeuPro: 2.529 ± 0.367
3.251LeuGln: 3.251 ± 0.563
3.54LeuArg: 3.54 ± 0.587
4.985LeuSer: 4.985 ± 0.57
4.985LeuThr: 4.985 ± 0.601
4.046LeuVal: 4.046 ± 0.619
0.722LeuTrp: 0.722 ± 0.286
3.251LeuTyr: 3.251 ± 0.583
0.0LeuXaa: 0.0 ± 0.0
Met
1.445MetAla: 1.445 ± 0.333
0.217MetCys: 0.217 ± 0.115
1.228MetAsp: 1.228 ± 0.269
1.445MetGlu: 1.445 ± 0.329
1.011MetPhe: 1.011 ± 0.271
0.867MetGly: 0.867 ± 0.293
0.361MetHis: 0.361 ± 0.168
1.589MetIle: 1.589 ± 0.368
2.167MetLys: 2.167 ± 0.362
2.312MetLeu: 2.312 ± 0.326
0.795MetMet: 0.795 ± 0.223
2.24MetAsn: 2.24 ± 0.412
0.867MetPro: 0.867 ± 0.219
1.156MetGln: 1.156 ± 0.307
1.011MetArg: 1.011 ± 0.34
1.662MetSer: 1.662 ± 0.471
2.529MetThr: 2.529 ± 0.542
0.795MetVal: 0.795 ± 0.212
0.433MetTrp: 0.433 ± 0.174
1.084MetTyr: 1.084 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
4.696AsnAla: 4.696 ± 0.849
0.361AsnCys: 0.361 ± 0.181
4.624AsnAsp: 4.624 ± 0.523
5.563AsnGlu: 5.563 ± 0.78
2.962AsnPhe: 2.962 ± 0.595
5.202AsnGly: 5.202 ± 0.529
1.011AsnHis: 1.011 ± 0.275
4.118AsnIle: 4.118 ± 0.511
6.647AsnLys: 6.647 ± 0.626
4.913AsnLeu: 4.913 ± 0.525
1.517AsnMet: 1.517 ± 0.323
5.274AsnAsn: 5.274 ± 0.99
2.601AsnPro: 2.601 ± 0.502
2.24AsnGln: 2.24 ± 0.352
2.745AsnArg: 2.745 ± 0.382
3.251AsnSer: 3.251 ± 0.369
3.974AsnThr: 3.974 ± 0.486
4.19AsnVal: 4.19 ± 0.599
0.867AsnTrp: 0.867 ± 0.232
3.468AsnTyr: 3.468 ± 0.588
0.0AsnXaa: 0.0 ± 0.0
Pro
1.156ProAla: 1.156 ± 0.27
0.289ProCys: 0.289 ± 0.145
1.084ProAsp: 1.084 ± 0.247
2.456ProGlu: 2.456 ± 0.387
1.3ProPhe: 1.3 ± 0.354
1.589ProGly: 1.589 ± 0.437
0.289ProHis: 0.289 ± 0.127
1.806ProIle: 1.806 ± 0.321
2.89ProLys: 2.89 ± 0.685
1.3ProLeu: 1.3 ± 0.279
1.156ProMet: 1.156 ± 0.265
2.312ProAsn: 2.312 ± 0.392
0.578ProPro: 0.578 ± 0.222
1.445ProGln: 1.445 ± 0.347
1.084ProArg: 1.084 ± 0.323
1.589ProSer: 1.589 ± 0.334
1.806ProThr: 1.806 ± 0.357
1.951ProVal: 1.951 ± 0.403
0.144ProTrp: 0.144 ± 0.113
1.011ProTyr: 1.011 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
2.095GlnAla: 2.095 ± 0.365
0.433GlnCys: 0.433 ± 0.173
2.673GlnAsp: 2.673 ± 0.434
2.962GlnGlu: 2.962 ± 0.578
1.517GlnPhe: 1.517 ± 0.388
1.951GlnGly: 1.951 ± 0.353
0.65GlnHis: 0.65 ± 0.205
2.89GlnIle: 2.89 ± 0.443
3.757GlnLys: 3.757 ± 0.47
2.818GlnLeu: 2.818 ± 0.474
1.3GlnMet: 1.3 ± 0.332
2.167GlnAsn: 2.167 ± 0.366
1.3GlnPro: 1.3 ± 0.314
1.734GlnGln: 1.734 ± 0.413
2.312GlnArg: 2.312 ± 0.36
2.456GlnSer: 2.456 ± 0.408
1.373GlnThr: 1.373 ± 0.33
2.167GlnVal: 2.167 ± 0.447
0.289GlnTrp: 0.289 ± 0.161
1.445GlnTyr: 1.445 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
1.517ArgAla: 1.517 ± 0.277
0.289ArgCys: 0.289 ± 0.132
1.662ArgAsp: 1.662 ± 0.372
3.179ArgGlu: 3.179 ± 0.412
1.878ArgPhe: 1.878 ± 0.363
2.312ArgGly: 2.312 ± 0.422
1.156ArgHis: 1.156 ± 0.323
3.757ArgIle: 3.757 ± 0.527
4.263ArgLys: 4.263 ± 0.582
3.396ArgLeu: 3.396 ± 0.489
0.795ArgMet: 0.795 ± 0.249
3.251ArgAsn: 3.251 ± 0.56
1.084ArgPro: 1.084 ± 0.291
1.589ArgGln: 1.589 ± 0.328
1.951ArgArg: 1.951 ± 0.49
1.806ArgSer: 1.806 ± 0.308
2.095ArgThr: 2.095 ± 0.454
2.529ArgVal: 2.529 ± 0.464
0.433ArgTrp: 0.433 ± 0.192
1.734ArgTyr: 1.734 ± 0.44
0.0ArgXaa: 0.0 ± 0.0
Ser
3.829SerAla: 3.829 ± 0.62
0.144SerCys: 0.144 ± 0.101
4.552SerAsp: 4.552 ± 0.642
3.54SerGlu: 3.54 ± 0.58
2.601SerPhe: 2.601 ± 0.495
3.757SerGly: 3.757 ± 0.557
1.156SerHis: 1.156 ± 0.326
5.274SerIle: 5.274 ± 0.452
5.635SerLys: 5.635 ± 0.583
3.612SerLeu: 3.612 ± 0.437
1.662SerMet: 1.662 ± 0.374
4.624SerAsn: 4.624 ± 0.473
1.3SerPro: 1.3 ± 0.338
2.023SerGln: 2.023 ± 0.377
2.24SerArg: 2.24 ± 0.318
3.251SerSer: 3.251 ± 0.45
3.685SerThr: 3.685 ± 0.401
3.107SerVal: 3.107 ± 0.527
0.289SerTrp: 0.289 ± 0.114
2.745SerTyr: 2.745 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
3.251ThrAla: 3.251 ± 0.507
0.072ThrCys: 0.072 ± 0.075
4.118ThrAsp: 4.118 ± 0.589
4.263ThrGlu: 4.263 ± 0.571
2.601ThrPhe: 2.601 ± 0.396
3.829ThrGly: 3.829 ± 0.549
0.867ThrHis: 0.867 ± 0.207
5.635ThrIle: 5.635 ± 1.38
5.924ThrLys: 5.924 ± 0.623
4.913ThrLeu: 4.913 ± 0.487
1.084ThrMet: 1.084 ± 0.411
3.757ThrAsn: 3.757 ± 0.581
1.806ThrPro: 1.806 ± 0.343
2.529ThrGln: 2.529 ± 0.44
1.951ThrArg: 1.951 ± 0.293
2.962ThrSer: 2.962 ± 0.517
3.757ThrThr: 3.757 ± 0.662
4.479ThrVal: 4.479 ± 0.674
0.867ThrTrp: 0.867 ± 0.256
2.167ThrTyr: 2.167 ± 0.495
0.0ThrXaa: 0.0 ± 0.0
Val
3.757ValAla: 3.757 ± 0.894
0.289ValCys: 0.289 ± 0.132
4.624ValAsp: 4.624 ± 0.688
5.563ValGlu: 5.563 ± 0.707
2.312ValPhe: 2.312 ± 0.519
2.456ValGly: 2.456 ± 0.547
0.578ValHis: 0.578 ± 0.203
5.924ValIle: 5.924 ± 0.613
6.069ValLys: 6.069 ± 0.513
3.974ValLeu: 3.974 ± 0.537
1.445ValMet: 1.445 ± 0.356
4.768ValAsn: 4.768 ± 0.553
1.806ValPro: 1.806 ± 0.441
1.662ValGln: 1.662 ± 0.296
2.095ValArg: 2.095 ± 0.457
4.046ValSer: 4.046 ± 0.706
3.685ValThr: 3.685 ± 0.589
3.396ValVal: 3.396 ± 0.474
0.867ValTrp: 0.867 ± 0.37
2.745ValTyr: 2.745 ± 0.557
0.0ValXaa: 0.0 ± 0.0
Trp
0.722TrpAla: 0.722 ± 0.299
0.072TrpCys: 0.072 ± 0.069
0.289TrpAsp: 0.289 ± 0.127
0.65TrpGlu: 0.65 ± 0.184
0.289TrpPhe: 0.289 ± 0.135
0.433TrpGly: 0.433 ± 0.278
0.072TrpHis: 0.072 ± 0.078
1.011TrpIle: 1.011 ± 0.276
0.795TrpLys: 0.795 ± 0.27
0.65TrpLeu: 0.65 ± 0.238
0.144TrpMet: 0.144 ± 0.107
1.734TrpAsn: 1.734 ± 1.004
0.144TrpPro: 0.144 ± 0.107
0.506TrpGln: 0.506 ± 0.175
0.144TrpArg: 0.144 ± 0.1
0.939TrpSer: 0.939 ± 0.292
1.228TrpThr: 1.228 ± 0.347
0.65TrpVal: 0.65 ± 0.195
0.144TrpTrp: 0.144 ± 0.107
0.506TrpTyr: 0.506 ± 0.213
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.878TyrAla: 1.878 ± 0.357
0.072TyrCys: 0.072 ± 0.071
2.601TyrAsp: 2.601 ± 0.495
3.974TyrGlu: 3.974 ± 0.562
2.023TyrPhe: 2.023 ± 0.529
2.673TyrGly: 2.673 ± 0.565
0.939TyrHis: 0.939 ± 0.276
3.179TyrIle: 3.179 ± 0.543
4.985TyrLys: 4.985 ± 0.712
3.612TyrLeu: 3.612 ± 0.59
0.939TyrMet: 0.939 ± 0.269
2.962TyrAsn: 2.962 ± 0.437
1.228TyrPro: 1.228 ± 0.337
1.662TyrGln: 1.662 ± 0.359
2.167TyrArg: 2.167 ± 0.503
2.167TyrSer: 2.167 ± 0.432
3.323TyrThr: 3.323 ± 0.456
2.818TyrVal: 2.818 ± 0.449
1.156TyrTrp: 1.156 ± 0.405
1.734TyrTyr: 1.734 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13842 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski