Amino acid dipepetide frequency for Nairobi sheep disease virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.281AlaAla: 3.281 ± 2.692
1.312AlaCys: 1.312 ± 0.322
2.789AlaAsp: 2.789 ± 0.874
3.117AlaGlu: 3.117 ± 0.699
1.476AlaPhe: 1.476 ± 0.453
3.773AlaGly: 3.773 ± 1.539
0.82AlaHis: 0.82 ± 0.382
3.281AlaIle: 3.281 ± 0.674
2.789AlaLys: 2.789 ± 0.427
5.413AlaLeu: 5.413 ± 2.526
1.476AlaMet: 1.476 ± 0.397
1.64AlaAsn: 1.64 ± 0.562
1.804AlaPro: 1.804 ± 0.554
1.969AlaGln: 1.969 ± 1.47
2.789AlaArg: 2.789 ± 0.608
4.593AlaSer: 4.593 ± 0.366
2.461AlaThr: 2.461 ± 1.313
4.265AlaVal: 4.265 ± 1.093
1.148AlaTrp: 1.148 ± 1.201
0.656AlaTyr: 0.656 ± 0.328
0.0AlaXaa: 0.0 ± 0.0
Cys
1.148CysAla: 1.148 ± 1.201
1.148CysCys: 1.148 ± 0.536
1.804CysAsp: 1.804 ± 0.574
1.64CysGlu: 1.64 ± 0.386
1.476CysPhe: 1.476 ± 0.238
0.492CysGly: 0.492 ± 0.543
0.656CysHis: 0.656 ± 0.244
1.804CysIle: 1.804 ± 0.418
1.312CysLys: 1.312 ± 0.319
2.461CysLeu: 2.461 ± 0.817
0.328CysMet: 0.328 ± 0.362
0.492CysAsn: 0.492 ± 0.296
1.969CysPro: 1.969 ± 0.95
1.148CysGln: 1.148 ± 0.261
1.476CysArg: 1.476 ± 0.453
3.117CysSer: 3.117 ± 0.753
2.461CysThr: 2.461 ± 1.007
1.312CysVal: 1.312 ± 0.488
0.492CysTrp: 0.492 ± 0.296
0.656CysTyr: 0.656 ± 0.475
0.0CysXaa: 0.0 ± 0.0
Asp
2.133AspAla: 2.133 ± 0.922
1.804AspCys: 1.804 ± 0.543
2.133AspAsp: 2.133 ± 0.51
4.265AspGlu: 4.265 ± 0.507
2.461AspPhe: 2.461 ± 0.576
3.937AspGly: 3.937 ± 0.768
0.492AspHis: 0.492 ± 0.263
3.773AspIle: 3.773 ± 0.567
4.429AspLys: 4.429 ± 1.5
4.921AspLeu: 4.921 ± 0.307
0.82AspMet: 0.82 ± 0.765
3.609AspAsn: 3.609 ± 0.962
0.984AspPro: 0.984 ± 0.223
1.148AspGln: 1.148 ± 0.334
3.281AspArg: 3.281 ± 0.928
4.593AspSer: 4.593 ± 0.417
2.789AspThr: 2.789 ± 0.849
3.117AspVal: 3.117 ± 0.486
0.984AspTrp: 0.984 ± 0.712
1.64AspTyr: 1.64 ± 0.171
0.0AspXaa: 0.0 ± 0.0
Glu
2.789GluAla: 2.789 ± 1.048
2.133GluCys: 2.133 ± 0.649
4.429GluAsp: 4.429 ± 1.355
8.694GluGlu: 8.694 ± 2.054
2.461GluPhe: 2.461 ± 1.089
3.117GluGly: 3.117 ± 0.249
1.148GluHis: 1.148 ± 0.837
3.773GluIle: 3.773 ± 0.845
3.773GluLys: 3.773 ± 0.085
9.514GluLeu: 9.514 ± 1.256
2.953GluMet: 2.953 ± 0.464
3.445GluAsn: 3.445 ± 0.59
1.804GluPro: 1.804 ± 0.574
1.804GluGln: 1.804 ± 0.071
3.609GluArg: 3.609 ± 0.935
6.234GluSer: 6.234 ± 0.032
5.085GluThr: 5.085 ± 0.858
6.07GluVal: 6.07 ± 0.595
0.984GluTrp: 0.984 ± 0.366
1.64GluTyr: 1.64 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
2.133PheAla: 2.133 ± 0.392
1.148PheCys: 1.148 ± 0.715
1.476PheAsp: 1.476 ± 0.689
3.117PheGlu: 3.117 ± 1.14
2.789PhePhe: 2.789 ± 0.766
1.804PheGly: 1.804 ± 0.3
0.656PheHis: 0.656 ± 0.724
1.969PheIle: 1.969 ± 0.478
2.461PheLys: 2.461 ± 0.696
4.921PheLeu: 4.921 ± 1.092
0.82PheMet: 0.82 ± 0.217
2.133PheAsn: 2.133 ± 0.53
1.312PhePro: 1.312 ± 0.172
0.984PheGln: 0.984 ± 0.257
1.312PheArg: 1.312 ± 0.319
3.609PheSer: 3.609 ± 1.15
2.789PheThr: 2.789 ± 0.175
2.133PheVal: 2.133 ± 0.478
0.164PheTrp: 0.164 ± 0.088
1.312PheTyr: 1.312 ± 0.481
0.0PheXaa: 0.0 ± 0.0
Gly
2.461GlyAla: 2.461 ± 0.403
1.804GlyCys: 1.804 ± 0.574
2.625GlyAsp: 2.625 ± 0.677
3.773GlyGlu: 3.773 ± 0.729
1.804GlyPhe: 1.804 ± 0.81
2.461GlyGly: 2.461 ± 0.558
0.656GlyHis: 0.656 ± 0.363
2.953GlyIle: 2.953 ± 0.476
4.429GlyLys: 4.429 ± 2.436
7.054GlyLeu: 7.054 ± 0.793
2.133GlyMet: 2.133 ± 0.53
2.461GlyAsn: 2.461 ± 0.986
2.625GlyPro: 2.625 ± 0.452
1.312GlyGln: 1.312 ± 0.481
4.265GlyArg: 4.265 ± 0.622
4.757GlySer: 4.757 ± 0.999
2.953GlyThr: 2.953 ± 1.313
2.789GlyVal: 2.789 ± 0.849
0.492GlyTrp: 0.492 ± 0.263
1.476GlyTyr: 1.476 ± 0.689
0.0GlyXaa: 0.0 ± 0.0
His
1.312HisAla: 1.312 ± 0.481
1.476HisCys: 1.476 ± 0.657
0.0HisAsp: 0.0 ± 0.0
0.656HisGlu: 0.656 ± 0.244
0.492HisPhe: 0.492 ± 0.112
0.656HisGly: 0.656 ± 0.244
0.164HisHis: 0.164 ± 0.088
0.82HisIle: 0.82 ± 0.754
1.312HisLys: 1.312 ± 1.236
2.625HisLeu: 2.625 ± 0.586
0.492HisMet: 0.492 ± 0.112
0.492HisAsn: 0.492 ± 0.112
0.984HisPro: 0.984 ± 0.257
0.656HisGln: 0.656 ± 0.328
0.984HisArg: 0.984 ± 0.223
1.804HisSer: 1.804 ± 0.418
0.82HisThr: 0.82 ± 0.217
0.984HisVal: 0.984 ± 0.366
0.328HisTrp: 0.328 ± 0.176
0.656HisTyr: 0.656 ± 0.475
0.0HisXaa: 0.0 ± 0.0
Ile
2.625IleAla: 2.625 ± 0.344
1.476IleCys: 1.476 ± 0.657
2.789IleAsp: 2.789 ± 0.562
3.773IleGlu: 3.773 ± 0.265
1.804IlePhe: 1.804 ± 0.418
2.297IleGly: 2.297 ± 0.855
0.984IleHis: 0.984 ± 0.223
3.445IleIle: 3.445 ± 0.839
6.234IleLys: 6.234 ± 0.984
5.906IleLeu: 5.906 ± 0.753
1.476IleMet: 1.476 ± 0.212
3.281IleAsn: 3.281 ± 1.324
1.476IlePro: 1.476 ± 0.624
2.953IleGln: 2.953 ± 0.77
2.133IleArg: 2.133 ± 0.53
5.085IleSer: 5.085 ± 1.212
3.281IleThr: 3.281 ± 0.736
3.773IleVal: 3.773 ± 1.041
0.656IleTrp: 0.656 ± 0.328
0.984IleTyr: 0.984 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
4.593LysAla: 4.593 ± 1.403
1.64LysCys: 1.64 ± 0.61
5.085LysAsp: 5.085 ± 1.392
7.546LysGlu: 7.546 ± 0.973
2.953LysPhe: 2.953 ± 0.69
4.101LysGly: 4.101 ± 0.972
1.312LysHis: 1.312 ± 0.727
3.445LysIle: 3.445 ± 0.22
7.382LysLys: 7.382 ± 0.886
8.858LysLeu: 8.858 ± 0.916
1.312LysMet: 1.312 ± 0.325
2.789LysAsn: 2.789 ± 0.308
1.969LysPro: 1.969 ± 0.446
2.953LysGln: 2.953 ± 0.557
3.937LysArg: 3.937 ± 0.687
3.937LysSer: 3.937 ± 0.879
3.773LysThr: 3.773 ± 0.527
4.429LysVal: 4.429 ± 0.586
0.984LysTrp: 0.984 ± 0.779
1.476LysTyr: 1.476 ± 0.345
0.0LysXaa: 0.0 ± 0.0
Leu
5.085LeuAla: 5.085 ± 1.652
3.117LeuCys: 3.117 ± 1.06
5.577LeuAsp: 5.577 ± 1.185
9.843LeuGlu: 9.843 ± 1.722
4.429LeuPhe: 4.429 ± 0.41
4.429LeuGly: 4.429 ± 0.804
2.133LeuHis: 2.133 ± 0.123
5.413LeuIle: 5.413 ± 1.133
8.694LeuLys: 8.694 ± 0.571
12.631LeuLeu: 12.631 ± 2.346
2.297LeuMet: 2.297 ± 0.667
6.398LeuAsn: 6.398 ± 1.378
3.281LeuPro: 3.281 ± 0.797
4.265LeuGln: 4.265 ± 0.626
4.921LeuArg: 4.921 ± 1.156
9.022LeuSer: 9.022 ± 0.721
8.694LeuThr: 8.694 ± 1.347
6.234LeuVal: 6.234 ± 0.928
0.328LeuTrp: 0.328 ± 0.392
3.609LeuTyr: 3.609 ± 0.809
0.0LeuXaa: 0.0 ± 0.0
Met
2.133MetAla: 2.133 ± 0.135
0.164MetCys: 0.164 ± 0.088
1.312MetAsp: 1.312 ± 0.56
1.148MetGlu: 1.148 ± 0.396
0.984MetPhe: 0.984 ± 0.303
1.476MetGly: 1.476 ± 0.388
0.492MetHis: 0.492 ± 0.548
1.148MetIle: 1.148 ± 0.726
1.804MetLys: 1.804 ± 0.481
3.281MetLeu: 3.281 ± 0.503
0.492MetMet: 0.492 ± 0.263
0.82MetAsn: 0.82 ± 0.232
0.656MetPro: 0.656 ± 0.328
1.312MetGln: 1.312 ± 0.293
1.148MetArg: 1.148 ± 0.673
2.461MetSer: 2.461 ± 0.496
0.984MetThr: 0.984 ± 0.592
0.492MetVal: 0.492 ± 0.367
0.492MetTrp: 0.492 ± 0.112
0.164MetTyr: 0.164 ± 0.088
0.0MetXaa: 0.0 ± 0.0
Asn
1.476AsnAla: 1.476 ± 1.168
1.476AsnCys: 1.476 ± 0.335
1.804AsnAsp: 1.804 ± 0.418
1.476AsnGlu: 1.476 ± 0.335
1.312AsnPhe: 1.312 ± 0.481
2.133AsnGly: 2.133 ± 0.912
0.492AsnHis: 0.492 ± 0.263
4.265AsnIle: 4.265 ± 0.93
4.429AsnLys: 4.429 ± 1.381
4.921AsnLeu: 4.921 ± 1.116
0.656AsnMet: 0.656 ± 0.159
1.476AsnAsn: 1.476 ± 0.59
2.133AsnPro: 2.133 ± 1.009
0.984AsnGln: 0.984 ± 0.303
2.953AsnArg: 2.953 ± 0.464
4.101AsnSer: 4.101 ± 0.725
3.281AsnThr: 3.281 ± 0.317
3.445AsnVal: 3.445 ± 0.82
0.984AsnTrp: 0.984 ± 0.313
1.476AsnTyr: 1.476 ± 0.147
0.0AsnXaa: 0.0 ± 0.0
Pro
2.297ProAla: 2.297 ± 0.522
0.492ProCys: 0.492 ± 0.389
2.625ProAsp: 2.625 ± 0.65
2.133ProGlu: 2.133 ± 0.38
0.82ProPhe: 0.82 ± 0.38
2.461ProGly: 2.461 ± 0.737
0.656ProHis: 0.656 ± 0.244
1.476ProIle: 1.476 ± 0.238
2.297ProLys: 2.297 ± 0.616
2.461ProLeu: 2.461 ± 0.496
0.328ProMet: 0.328 ± 0.176
0.656ProAsn: 0.656 ± 0.159
1.148ProPro: 1.148 ± 0.894
0.984ProGln: 0.984 ± 0.366
1.969ProArg: 1.969 ± 0.301
3.445ProSer: 3.445 ± 0.407
2.461ProThr: 2.461 ± 0.751
2.789ProVal: 2.789 ± 0.374
0.82ProTrp: 0.82 ± 0.281
0.82ProTyr: 0.82 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
2.297GlnAla: 2.297 ± 0.208
0.656GlnCys: 0.656 ± 0.159
2.133GlnAsp: 2.133 ± 0.547
2.953GlnGlu: 2.953 ± 0.678
1.64GlnPhe: 1.64 ± 0.409
1.804GlnGly: 1.804 ± 0.556
1.476GlnHis: 1.476 ± 0.453
1.476GlnIle: 1.476 ± 1.064
2.297GlnLys: 2.297 ± 0.42
3.609GlnLeu: 3.609 ± 0.474
1.312GlnMet: 1.312 ± 0.322
2.133GlnAsn: 2.133 ± 1.009
0.82GlnPro: 0.82 ± 0.232
2.297GlnGln: 2.297 ± 0.792
1.804GlnArg: 1.804 ± 0.531
2.953GlnSer: 2.953 ± 0.295
1.969GlnThr: 1.969 ± 0.625
2.297GlnVal: 2.297 ± 0.62
0.0GlnTrp: 0.0 ± 0.0
0.492GlnTyr: 0.492 ± 0.112
0.0GlnXaa: 0.0 ± 0.0
Arg
1.969ArgAla: 1.969 ± 0.605
1.64ArgCys: 1.64 ± 0.368
3.937ArgAsp: 3.937 ± 0.602
1.969ArgGlu: 1.969 ± 0.446
2.133ArgPhe: 2.133 ± 0.547
2.953ArgGly: 2.953 ± 1.52
1.148ArgHis: 1.148 ± 0.261
2.625ArgIle: 2.625 ± 0.677
2.625ArgLys: 2.625 ± 0.382
8.038ArgLeu: 8.038 ± 1.345
1.64ArgMet: 1.64 ± 0.171
2.133ArgAsn: 2.133 ± 0.38
1.476ArgPro: 1.476 ± 0.335
2.625ArgGln: 2.625 ± 1.177
3.445ArgArg: 3.445 ± 0.22
4.265ArgSer: 4.265 ± 0.626
2.297ArgThr: 2.297 ± 0.074
2.953ArgVal: 2.953 ± 0.329
0.328ArgTrp: 0.328 ± 0.122
1.312ArgTyr: 1.312 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
3.937SerAla: 3.937 ± 0.516
2.133SerCys: 2.133 ± 1.42
4.757SerAsp: 4.757 ± 1.069
8.038SerGlu: 8.038 ± 1.402
3.773SerPhe: 3.773 ± 0.277
5.249SerGly: 5.249 ± 0.498
1.476SerHis: 1.476 ± 0.388
5.085SerIle: 5.085 ± 0.246
5.413SerLys: 5.413 ± 0.752
7.71SerLeu: 7.71 ± 1.808
1.804SerMet: 1.804 ± 1.087
3.937SerAsn: 3.937 ± 0.893
2.625SerPro: 2.625 ± 1.12
2.461SerGln: 2.461 ± 0.496
4.265SerArg: 4.265 ± 0.955
8.53SerSer: 8.53 ± 1.52
5.906SerThr: 5.906 ± 1.339
6.234SerVal: 6.234 ± 0.473
1.148SerTrp: 1.148 ± 0.771
2.133SerTyr: 2.133 ± 0.894
0.0SerXaa: 0.0 ± 0.0
Thr
3.609ThrAla: 3.609 ± 1.056
1.148ThrCys: 1.148 ± 0.771
4.593ThrAsp: 4.593 ± 0.606
5.413ThrGlu: 5.413 ± 0.284
1.969ThrPhe: 1.969 ± 0.478
5.413ThrGly: 5.413 ± 1.035
1.312ThrHis: 1.312 ± 0.322
3.445ThrIle: 3.445 ± 1.331
3.773ThrLys: 3.773 ± 1.509
5.741ThrLeu: 5.741 ± 1.793
0.82ThrMet: 0.82 ± 0.217
2.297ThrAsn: 2.297 ± 0.208
2.133ThrPro: 2.133 ± 0.547
2.625ThrGln: 2.625 ± 0.162
1.969ThrArg: 1.969 ± 0.446
4.593ThrSer: 4.593 ± 0.631
5.249ThrThr: 5.249 ± 1.199
5.085ThrVal: 5.085 ± 0.922
0.82ThrTrp: 0.82 ± 0.415
1.476ThrTyr: 1.476 ± 0.147
0.0ThrXaa: 0.0 ± 0.0
Val
3.609ValAla: 3.609 ± 2.57
1.312ValCys: 1.312 ± 0.325
2.133ValAsp: 2.133 ± 0.123
4.265ValGlu: 4.265 ± 0.811
2.297ValPhe: 2.297 ± 0.522
3.117ValGly: 3.117 ± 0.486
0.82ValHis: 0.82 ± 0.232
4.265ValIle: 4.265 ± 0.994
5.741ValLys: 5.741 ± 0.404
6.562ValLeu: 6.562 ± 0.545
0.656ValMet: 0.656 ± 0.159
2.297ValAsn: 2.297 ± 0.315
3.117ValPro: 3.117 ± 1.174
3.281ValGln: 3.281 ± 0.797
3.937ValArg: 3.937 ± 1.444
6.89ValSer: 6.89 ± 1.123
3.773ValThr: 3.773 ± 0.395
5.249ValVal: 5.249 ± 0.986
0.0ValTrp: 0.0 ± 0.0
1.312ValTyr: 1.312 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
0.492TrpAla: 0.492 ± 0.548
0.492TrpCys: 0.492 ± 0.296
0.328TrpAsp: 0.328 ± 0.392
0.656TrpGlu: 0.656 ± 0.244
0.492TrpPhe: 0.492 ± 0.389
1.476TrpGly: 1.476 ± 0.605
0.164TrpHis: 0.164 ± 0.433
0.492TrpIle: 0.492 ± 0.389
1.969TrpLys: 1.969 ± 0.049
1.476TrpLeu: 1.476 ± 0.345
0.164TrpMet: 0.164 ± 0.181
0.328TrpAsn: 0.328 ± 0.362
0.492TrpPro: 0.492 ± 0.543
0.164TrpGln: 0.164 ± 0.088
0.328TrpArg: 0.328 ± 0.459
0.82TrpSer: 0.82 ± 0.232
0.492TrpThr: 0.492 ± 0.389
0.492TrpVal: 0.492 ± 0.263
0.164TrpTrp: 0.164 ± 0.088
0.164TrpTyr: 0.164 ± 0.088
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.476TyrAla: 1.476 ± 0.345
0.82TyrCys: 0.82 ± 0.415
0.984TyrAsp: 0.984 ± 0.223
0.984TyrGlu: 0.984 ± 0.415
1.476TyrPhe: 1.476 ± 0.335
2.133TyrGly: 2.133 ± 0.453
0.656TyrHis: 0.656 ± 0.159
1.64TyrIle: 1.64 ± 0.368
1.312TyrLys: 1.312 ± 0.524
2.297TyrLeu: 2.297 ± 0.792
0.82TyrMet: 0.82 ± 0.281
2.461TyrAsn: 2.461 ± 0.496
0.328TyrPro: 0.328 ± 0.176
0.492TyrGln: 0.492 ± 0.389
0.984TyrArg: 0.984 ± 0.257
1.804TyrSer: 1.804 ± 0.229
1.969TyrThr: 1.969 ± 0.732
0.492TyrVal: 0.492 ± 0.112
0.328TyrTrp: 0.328 ± 0.392
0.492TyrTyr: 0.492 ± 0.367
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (6097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski