Amino acid dipepetide frequency for Arthrobacter phage Nandita

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.381AlaAla: 19.381 ± 2.597
0.891AlaCys: 0.891 ± 0.242
8.168AlaAsp: 8.168 ± 0.757
7.871AlaGlu: 7.871 ± 1.054
3.193AlaPhe: 3.193 ± 0.487
12.326AlaGly: 12.326 ± 1.196
2.005AlaHis: 2.005 ± 0.28
4.975AlaIle: 4.975 ± 0.693
6.312AlaLys: 6.312 ± 0.706
10.544AlaLeu: 10.544 ± 0.765
2.896AlaMet: 2.896 ± 0.467
3.119AlaAsn: 3.119 ± 0.549
6.832AlaPro: 6.832 ± 0.657
5.049AlaGln: 5.049 ± 0.626
7.351AlaArg: 7.351 ± 0.919
5.94AlaSer: 5.94 ± 0.739
6.386AlaThr: 6.386 ± 0.688
9.282AlaVal: 9.282 ± 0.718
2.45AlaTrp: 2.45 ± 0.399
1.634AlaTyr: 1.634 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
0.594CysAla: 0.594 ± 0.21
0.074CysCys: 0.074 ± 0.077
0.371CysAsp: 0.371 ± 0.192
0.594CysGlu: 0.594 ± 0.191
0.074CysPhe: 0.074 ± 0.077
0.817CysGly: 0.817 ± 0.226
0.149CysHis: 0.149 ± 0.101
0.149CysIle: 0.149 ± 0.123
0.223CysLys: 0.223 ± 0.117
0.668CysLeu: 0.668 ± 0.263
0.149CysMet: 0.149 ± 0.094
0.149CysAsn: 0.149 ± 0.107
0.446CysPro: 0.446 ± 0.188
0.223CysGln: 0.223 ± 0.127
0.446CysArg: 0.446 ± 0.161
0.594CysSer: 0.594 ± 0.275
0.371CysThr: 0.371 ± 0.215
0.371CysVal: 0.371 ± 0.157
0.149CysTrp: 0.149 ± 0.11
0.149CysTyr: 0.149 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
6.089AspAla: 6.089 ± 0.628
0.371AspCys: 0.371 ± 0.183
2.97AspAsp: 2.97 ± 0.514
4.084AspGlu: 4.084 ± 0.652
1.856AspPhe: 1.856 ± 0.295
6.163AspGly: 6.163 ± 0.706
0.965AspHis: 0.965 ± 0.318
2.302AspIle: 2.302 ± 0.471
1.559AspLys: 1.559 ± 0.405
6.312AspLeu: 6.312 ± 0.653
1.262AspMet: 1.262 ± 0.311
1.485AspAsn: 1.485 ± 0.277
3.639AspPro: 3.639 ± 0.372
1.485AspGln: 1.485 ± 0.375
4.233AspArg: 4.233 ± 0.611
3.49AspSer: 3.49 ± 0.476
3.193AspThr: 3.193 ± 0.482
4.678AspVal: 4.678 ± 0.609
1.411AspTrp: 1.411 ± 0.363
1.188AspTyr: 1.188 ± 0.336
0.0AspXaa: 0.0 ± 0.0
Glu
6.832GluAla: 6.832 ± 0.902
0.149GluCys: 0.149 ± 0.105
3.936GluAsp: 3.936 ± 0.678
4.678GluGlu: 4.678 ± 0.896
1.559GluPhe: 1.559 ± 0.366
3.787GluGly: 3.787 ± 0.648
1.114GluHis: 1.114 ± 0.283
2.896GluIle: 2.896 ± 0.504
2.747GluLys: 2.747 ± 0.402
6.163GluLeu: 6.163 ± 0.89
0.891GluMet: 0.891 ± 0.207
1.559GluAsn: 1.559 ± 0.315
2.822GluPro: 2.822 ± 0.421
2.525GluGln: 2.525 ± 0.438
3.861GluArg: 3.861 ± 0.548
3.119GluSer: 3.119 ± 0.458
2.822GluThr: 2.822 ± 0.411
4.084GluVal: 4.084 ± 0.57
1.634GluTrp: 1.634 ± 0.379
1.188GluTyr: 1.188 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
4.158PheAla: 4.158 ± 0.546
0.446PheCys: 0.446 ± 0.207
2.079PheAsp: 2.079 ± 0.43
1.411PheGlu: 1.411 ± 0.338
0.446PhePhe: 0.446 ± 0.169
2.525PheGly: 2.525 ± 0.43
0.371PheHis: 0.371 ± 0.18
1.559PheIle: 1.559 ± 0.326
0.594PheLys: 0.594 ± 0.219
2.005PheLeu: 2.005 ± 0.41
0.371PheMet: 0.371 ± 0.192
1.04PheAsn: 1.04 ± 0.241
2.079PhePro: 2.079 ± 0.407
1.04PheGln: 1.04 ± 0.326
1.04PheArg: 1.04 ± 0.337
1.262PheSer: 1.262 ± 0.323
2.747PheThr: 2.747 ± 0.499
1.782PheVal: 1.782 ± 0.34
0.074PheTrp: 0.074 ± 0.077
0.743PheTyr: 0.743 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
8.985GlyAla: 8.985 ± 1.04
0.223GlyCys: 0.223 ± 0.133
4.827GlyAsp: 4.827 ± 0.531
3.416GlyGlu: 3.416 ± 0.425
3.787GlyPhe: 3.787 ± 0.592
7.574GlyGly: 7.574 ± 0.795
1.708GlyHis: 1.708 ± 0.418
3.787GlyIle: 3.787 ± 0.807
3.267GlyLys: 3.267 ± 0.455
6.163GlyLeu: 6.163 ± 0.766
1.634GlyMet: 1.634 ± 0.341
3.119GlyAsn: 3.119 ± 0.44
3.342GlyPro: 3.342 ± 0.447
2.822GlyGln: 2.822 ± 0.466
4.827GlyArg: 4.827 ± 0.699
5.643GlySer: 5.643 ± 0.747
7.574GlyThr: 7.574 ± 0.955
6.089GlyVal: 6.089 ± 0.835
2.45GlyTrp: 2.45 ± 0.433
2.376GlyTyr: 2.376 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.931HisAla: 1.931 ± 0.409
0.149HisCys: 0.149 ± 0.111
0.743HisAsp: 0.743 ± 0.239
0.965HisGlu: 0.965 ± 0.252
0.594HisPhe: 0.594 ± 0.176
1.559HisGly: 1.559 ± 0.417
0.297HisHis: 0.297 ± 0.143
1.04HisIle: 1.04 ± 0.277
0.446HisLys: 0.446 ± 0.134
2.376HisLeu: 2.376 ± 0.384
0.52HisMet: 0.52 ± 0.193
0.594HisAsn: 0.594 ± 0.206
1.337HisPro: 1.337 ± 0.382
0.743HisGln: 0.743 ± 0.249
1.485HisArg: 1.485 ± 0.339
0.891HisSer: 0.891 ± 0.249
1.114HisThr: 1.114 ± 0.32
1.782HisVal: 1.782 ± 0.376
0.223HisTrp: 0.223 ± 0.107
0.371HisTyr: 0.371 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
5.94IleAla: 5.94 ± 0.87
0.52IleCys: 0.52 ± 0.191
2.97IleAsp: 2.97 ± 0.428
2.376IleGlu: 2.376 ± 0.479
1.931IlePhe: 1.931 ± 0.381
3.342IleGly: 3.342 ± 0.496
1.114IleHis: 1.114 ± 0.227
1.485IleIle: 1.485 ± 0.299
1.337IleLys: 1.337 ± 0.222
2.079IleLeu: 2.079 ± 0.383
0.965IleMet: 0.965 ± 0.267
1.337IleAsn: 1.337 ± 0.28
1.634IlePro: 1.634 ± 0.28
2.153IleGln: 2.153 ± 0.491
3.044IleArg: 3.044 ± 0.507
2.896IleSer: 2.896 ± 0.429
4.381IleThr: 4.381 ± 0.654
2.822IleVal: 2.822 ± 0.449
0.223IleTrp: 0.223 ± 0.116
0.891IleTyr: 0.891 ± 0.326
0.0IleXaa: 0.0 ± 0.0
Lys
6.906LysAla: 6.906 ± 0.863
0.149LysCys: 0.149 ± 0.105
2.228LysAsp: 2.228 ± 0.456
1.782LysGlu: 1.782 ± 0.395
0.594LysPhe: 0.594 ± 0.19
2.525LysGly: 2.525 ± 0.553
0.965LysHis: 0.965 ± 0.284
1.634LysIle: 1.634 ± 0.313
2.005LysLys: 2.005 ± 0.462
3.267LysLeu: 3.267 ± 0.53
0.817LysMet: 0.817 ± 0.256
0.965LysAsn: 0.965 ± 0.294
2.747LysPro: 2.747 ± 0.361
1.114LysGln: 1.114 ± 0.314
2.896LysArg: 2.896 ± 0.486
2.673LysSer: 2.673 ± 0.488
3.044LysThr: 3.044 ± 0.491
2.45LysVal: 2.45 ± 0.397
0.817LysTrp: 0.817 ± 0.26
1.262LysTyr: 1.262 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
9.95LeuAla: 9.95 ± 0.736
0.52LeuCys: 0.52 ± 0.192
4.975LeuAsp: 4.975 ± 0.642
5.198LeuGlu: 5.198 ± 0.706
1.708LeuPhe: 1.708 ± 0.364
6.683LeuGly: 6.683 ± 0.889
1.188LeuHis: 1.188 ± 0.288
4.381LeuIle: 4.381 ± 0.597
3.564LeuLys: 3.564 ± 0.417
6.237LeuLeu: 6.237 ± 0.75
2.228LeuMet: 2.228 ± 0.368
3.193LeuAsn: 3.193 ± 0.491
6.237LeuPro: 6.237 ± 0.796
2.747LeuGln: 2.747 ± 0.52
5.866LeuArg: 5.866 ± 0.715
4.307LeuSer: 4.307 ± 0.62
5.792LeuThr: 5.792 ± 0.5
6.237LeuVal: 6.237 ± 0.965
1.262LeuTrp: 1.262 ± 0.297
1.262LeuTyr: 1.262 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
2.822MetAla: 2.822 ± 0.535
0.149MetCys: 0.149 ± 0.105
0.891MetAsp: 0.891 ± 0.275
1.04MetGlu: 1.04 ± 0.297
0.371MetPhe: 0.371 ± 0.136
1.856MetGly: 1.856 ± 0.342
0.223MetHis: 0.223 ± 0.147
0.743MetIle: 0.743 ± 0.201
0.891MetLys: 0.891 ± 0.209
1.634MetLeu: 1.634 ± 0.345
0.297MetMet: 0.297 ± 0.165
0.817MetAsn: 0.817 ± 0.245
1.856MetPro: 1.856 ± 0.325
0.446MetGln: 0.446 ± 0.184
1.188MetArg: 1.188 ± 0.301
1.411MetSer: 1.411 ± 0.298
1.782MetThr: 1.782 ± 0.434
0.891MetVal: 0.891 ± 0.219
0.0MetTrp: 0.0 ± 0.0
0.371MetTyr: 0.371 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
4.752AsnAla: 4.752 ± 0.734
0.149AsnCys: 0.149 ± 0.091
2.376AsnAsp: 2.376 ± 0.337
1.634AsnGlu: 1.634 ± 0.361
0.52AsnPhe: 0.52 ± 0.192
2.747AsnGly: 2.747 ± 0.432
0.817AsnHis: 0.817 ± 0.219
1.188AsnIle: 1.188 ± 0.269
0.817AsnLys: 0.817 ± 0.193
2.005AsnLeu: 2.005 ± 0.338
0.297AsnMet: 0.297 ± 0.124
0.594AsnAsn: 0.594 ± 0.238
2.376AsnPro: 2.376 ± 0.421
1.114AsnGln: 1.114 ± 0.256
2.376AsnArg: 2.376 ± 0.368
1.782AsnSer: 1.782 ± 0.456
2.673AsnThr: 2.673 ± 0.433
1.856AsnVal: 1.856 ± 0.475
1.188AsnTrp: 1.188 ± 0.324
0.668AsnTyr: 0.668 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
8.614ProAla: 8.614 ± 0.974
0.446ProCys: 0.446 ± 0.186
3.936ProAsp: 3.936 ± 0.799
3.639ProGlu: 3.639 ± 0.603
1.262ProPhe: 1.262 ± 0.276
5.049ProGly: 5.049 ± 0.627
1.708ProHis: 1.708 ± 0.419
2.302ProIle: 2.302 ± 0.313
2.005ProLys: 2.005 ± 0.421
3.936ProLeu: 3.936 ± 0.522
0.965ProMet: 0.965 ± 0.258
1.634ProAsn: 1.634 ± 0.384
3.787ProPro: 3.787 ± 0.752
2.005ProGln: 2.005 ± 0.359
4.01ProArg: 4.01 ± 0.496
3.416ProSer: 3.416 ± 0.617
3.044ProThr: 3.044 ± 0.365
5.346ProVal: 5.346 ± 0.819
0.52ProTrp: 0.52 ± 0.206
0.817ProTyr: 0.817 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
4.752GlnAla: 4.752 ± 0.787
0.074GlnCys: 0.074 ± 0.069
1.708GlnAsp: 1.708 ± 0.268
2.525GlnGlu: 2.525 ± 0.458
0.891GlnPhe: 0.891 ± 0.301
1.856GlnGly: 1.856 ± 0.306
0.965GlnHis: 0.965 ± 0.319
1.931GlnIle: 1.931 ± 0.422
1.188GlnLys: 1.188 ± 0.329
3.49GlnLeu: 3.49 ± 0.592
0.668GlnMet: 0.668 ± 0.233
1.188GlnAsn: 1.188 ± 0.259
2.302GlnPro: 2.302 ± 0.56
1.782GlnGln: 1.782 ± 0.321
2.005GlnArg: 2.005 ± 0.402
2.525GlnSer: 2.525 ± 0.498
2.599GlnThr: 2.599 ± 0.441
2.45GlnVal: 2.45 ± 0.447
0.668GlnTrp: 0.668 ± 0.182
0.668GlnTyr: 0.668 ± 0.221
0.0GlnXaa: 0.0 ± 0.0
Arg
6.534ArgAla: 6.534 ± 0.617
0.594ArgCys: 0.594 ± 0.247
4.084ArgAsp: 4.084 ± 0.478
3.936ArgGlu: 3.936 ± 0.605
1.411ArgPhe: 1.411 ± 0.312
3.267ArgGly: 3.267 ± 0.56
1.634ArgHis: 1.634 ± 0.445
2.376ArgIle: 2.376 ± 0.456
3.713ArgLys: 3.713 ± 0.506
5.346ArgLeu: 5.346 ± 0.768
1.04ArgMet: 1.04 ± 0.251
2.525ArgAsn: 2.525 ± 0.418
3.639ArgPro: 3.639 ± 0.586
2.228ArgGln: 2.228 ± 0.409
6.089ArgArg: 6.089 ± 0.674
3.193ArgSer: 3.193 ± 0.389
4.158ArgThr: 4.158 ± 0.687
4.604ArgVal: 4.604 ± 0.603
1.262ArgTrp: 1.262 ± 0.371
1.634ArgTyr: 1.634 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
6.906SerAla: 6.906 ± 0.587
0.446SerCys: 0.446 ± 0.202
2.822SerAsp: 2.822 ± 0.537
2.599SerGlu: 2.599 ± 0.366
1.337SerPhe: 1.337 ± 0.368
5.94SerGly: 5.94 ± 0.812
0.668SerHis: 0.668 ± 0.211
2.822SerIle: 2.822 ± 0.403
2.079SerLys: 2.079 ± 0.454
3.787SerLeu: 3.787 ± 0.549
1.262SerMet: 1.262 ± 0.268
1.708SerAsn: 1.708 ± 0.353
2.97SerPro: 2.97 ± 0.431
2.525SerGln: 2.525 ± 0.429
2.228SerArg: 2.228 ± 0.385
3.564SerSer: 3.564 ± 0.513
4.455SerThr: 4.455 ± 0.563
5.346SerVal: 5.346 ± 0.587
1.188SerTrp: 1.188 ± 0.336
1.262SerTyr: 1.262 ± 0.335
0.0SerXaa: 0.0 ± 0.0
Thr
9.059ThrAla: 9.059 ± 1.065
0.371ThrCys: 0.371 ± 0.149
3.49ThrAsp: 3.49 ± 0.513
3.713ThrGlu: 3.713 ± 0.513
2.599ThrPhe: 2.599 ± 0.474
6.98ThrGly: 6.98 ± 0.701
1.485ThrHis: 1.485 ± 0.344
2.97ThrIle: 2.97 ± 0.384
2.153ThrLys: 2.153 ± 0.404
6.386ThrLeu: 6.386 ± 0.681
1.188ThrMet: 1.188 ± 0.224
3.193ThrAsn: 3.193 ± 0.611
4.53ThrPro: 4.53 ± 0.636
2.302ThrGln: 2.302 ± 0.353
2.97ThrArg: 2.97 ± 0.521
3.564ThrSer: 3.564 ± 0.475
5.198ThrThr: 5.198 ± 0.616
5.792ThrVal: 5.792 ± 0.878
1.337ThrTrp: 1.337 ± 0.41
1.931ThrTyr: 1.931 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
9.133ValAla: 9.133 ± 0.857
0.668ValCys: 0.668 ± 0.201
3.936ValAsp: 3.936 ± 0.608
3.936ValGlu: 3.936 ± 0.537
2.302ValPhe: 2.302 ± 0.41
5.272ValGly: 5.272 ± 0.696
1.262ValHis: 1.262 ± 0.343
3.713ValIle: 3.713 ± 0.489
3.936ValLys: 3.936 ± 0.614
5.94ValLeu: 5.94 ± 0.674
1.188ValMet: 1.188 ± 0.341
2.599ValAsn: 2.599 ± 0.451
3.044ValPro: 3.044 ± 0.461
2.747ValGln: 2.747 ± 0.522
4.678ValArg: 4.678 ± 0.559
3.639ValSer: 3.639 ± 0.48
6.832ValThr: 6.832 ± 0.828
5.94ValVal: 5.94 ± 0.769
1.114ValTrp: 1.114 ± 0.237
2.228ValTyr: 2.228 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
1.485TrpAla: 1.485 ± 0.373
0.074TrpCys: 0.074 ± 0.064
0.817TrpAsp: 0.817 ± 0.196
1.188TrpGlu: 1.188 ± 0.297
0.817TrpPhe: 0.817 ± 0.204
1.485TrpGly: 1.485 ± 0.286
0.371TrpHis: 0.371 ± 0.171
0.371TrpIle: 0.371 ± 0.144
0.594TrpLys: 0.594 ± 0.249
2.97TrpLeu: 2.97 ± 0.447
0.668TrpMet: 0.668 ± 0.203
0.52TrpAsn: 0.52 ± 0.22
1.411TrpPro: 1.411 ± 0.313
0.52TrpGln: 0.52 ± 0.215
0.891TrpArg: 0.891 ± 0.246
1.04TrpSer: 1.04 ± 0.212
1.485TrpThr: 1.485 ± 0.369
1.485TrpVal: 1.485 ± 0.314
0.446TrpTrp: 0.446 ± 0.23
0.297TrpTyr: 0.297 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.856TyrAla: 1.856 ± 0.303
0.297TyrCys: 0.297 ± 0.152
1.411TyrAsp: 1.411 ± 0.33
1.782TyrGlu: 1.782 ± 0.395
0.52TyrPhe: 0.52 ± 0.242
1.856TyrGly: 1.856 ± 0.427
0.149TyrHis: 0.149 ± 0.112
0.817TyrIle: 0.817 ± 0.256
1.411TyrLys: 1.411 ± 0.319
2.228TyrLeu: 2.228 ± 0.341
0.297TyrMet: 0.297 ± 0.139
0.594TyrAsn: 0.594 ± 0.21
1.411TyrPro: 1.411 ± 0.323
0.668TyrGln: 0.668 ± 0.238
1.782TyrArg: 1.782 ± 0.4
0.965TyrSer: 0.965 ± 0.21
1.559TyrThr: 1.559 ± 0.411
0.817TyrVal: 0.817 ± 0.232
0.52TyrTrp: 0.52 ± 0.212
0.52TyrTyr: 0.52 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski