Amino acid dipepetide frequency for Sulfolobus islandicus filamentous virus (isolate Iceland/Hveragerdi) (SIFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.246AlaCys: 0.246 ± 0.142
2.294AlaAsp: 2.294 ± 0.471
2.949AlaGlu: 2.949 ± 0.576
2.376AlaPhe: 2.376 ± 0.505
2.376AlaGly: 2.376 ± 0.688
0.41AlaHis: 0.41 ± 0.169
4.424AlaIle: 4.424 ± 0.547
3.85AlaLys: 3.85 ± 0.522
6.062AlaLeu: 6.062 ± 0.791
1.311AlaMet: 1.311 ± 0.386
2.13AlaAsn: 2.13 ± 0.444
1.311AlaPro: 1.311 ± 0.351
1.229AlaGln: 1.229 ± 0.391
1.475AlaArg: 1.475 ± 0.379
3.195AlaSer: 3.195 ± 0.772
3.768AlaThr: 3.768 ± 0.645
3.277AlaVal: 3.277 ± 0.475
0.41AlaTrp: 0.41 ± 0.171
2.785AlaTyr: 2.785 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.492CysAla: 0.492 ± 0.207
0.0CysCys: 0.0 ± 0.0
0.655CysAsp: 0.655 ± 0.213
0.983CysGlu: 0.983 ± 0.301
0.492CysPhe: 0.492 ± 0.253
0.819CysGly: 0.819 ± 0.275
0.164CysHis: 0.164 ± 0.134
1.72CysIle: 1.72 ± 0.373
0.901CysLys: 0.901 ± 0.311
0.492CysLeu: 0.492 ± 0.222
0.164CysMet: 0.164 ± 0.106
0.655CysAsn: 0.655 ± 0.184
0.573CysPro: 0.573 ± 0.231
0.246CysGln: 0.246 ± 0.141
0.246CysArg: 0.246 ± 0.153
1.065CysSer: 1.065 ± 0.306
0.819CysThr: 0.819 ± 0.278
0.901CysVal: 0.901 ± 0.269
0.082CysTrp: 0.082 ± 0.085
0.573CysTyr: 0.573 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
2.376AspAla: 2.376 ± 0.461
0.737AspCys: 0.737 ± 0.307
2.703AspAsp: 2.703 ± 0.545
3.932AspGlu: 3.932 ± 0.594
2.294AspPhe: 2.294 ± 0.376
2.13AspGly: 2.13 ± 0.328
0.573AspHis: 0.573 ± 0.227
4.096AspIle: 4.096 ± 0.593
2.949AspLys: 2.949 ± 0.492
4.014AspLeu: 4.014 ± 0.609
1.229AspMet: 1.229 ± 0.351
2.13AspAsn: 2.13 ± 0.395
1.556AspPro: 1.556 ± 0.348
1.147AspGln: 1.147 ± 0.304
1.311AspArg: 1.311 ± 0.337
2.212AspSer: 2.212 ± 0.452
1.638AspThr: 1.638 ± 0.379
5.407AspVal: 5.407 ± 0.708
0.573AspTrp: 0.573 ± 0.198
1.802AspTyr: 1.802 ± 0.445
0.0AspXaa: 0.0 ± 0.0
Glu
2.785GluAla: 2.785 ± 0.424
0.983GluCys: 0.983 ± 0.322
3.523GluAsp: 3.523 ± 0.544
6.39GluGlu: 6.39 ± 0.963
3.031GluPhe: 3.031 ± 0.557
2.703GluGly: 2.703 ± 0.435
0.901GluHis: 0.901 ± 0.276
8.52GluIle: 8.52 ± 0.937
6.799GluLys: 6.799 ± 1.001
6.39GluLeu: 6.39 ± 0.673
2.13GluMet: 2.13 ± 0.416
4.014GluAsn: 4.014 ± 0.564
1.147GluPro: 1.147 ± 0.318
1.884GluGln: 1.884 ± 0.511
2.867GluArg: 2.867 ± 0.449
2.703GluSer: 2.703 ± 0.474
3.113GluThr: 3.113 ± 0.535
4.096GluVal: 4.096 ± 0.644
0.819GluTrp: 0.819 ± 0.272
3.932GluTyr: 3.932 ± 0.614
0.0GluXaa: 0.0 ± 0.0
Phe
2.376PheAla: 2.376 ± 0.53
0.573PheCys: 0.573 ± 0.266
2.54PheAsp: 2.54 ± 0.439
2.458PheGlu: 2.458 ± 0.374
2.458PhePhe: 2.458 ± 0.515
2.048PheGly: 2.048 ± 0.44
0.655PheHis: 0.655 ± 0.246
3.359PheIle: 3.359 ± 0.507
3.441PheLys: 3.441 ± 0.645
4.751PheLeu: 4.751 ± 0.504
1.229PheMet: 1.229 ± 0.315
2.867PheAsn: 2.867 ± 0.509
0.901PhePro: 0.901 ± 0.226
1.393PheGln: 1.393 ± 0.332
1.311PheArg: 1.311 ± 0.337
3.441PheSer: 3.441 ± 0.613
3.686PheThr: 3.686 ± 0.605
3.277PheVal: 3.277 ± 0.514
0.164PheTrp: 0.164 ± 0.112
2.54PheTyr: 2.54 ± 0.402
0.0PheXaa: 0.0 ± 0.0
Gly
2.458GlyAla: 2.458 ± 0.585
1.065GlyCys: 1.065 ± 0.303
1.884GlyAsp: 1.884 ± 0.429
2.621GlyGlu: 2.621 ± 0.499
1.802GlyPhe: 1.802 ± 0.411
1.966GlyGly: 1.966 ± 0.639
0.737GlyHis: 0.737 ± 0.201
5.161GlyIle: 5.161 ± 0.586
4.997GlyLys: 4.997 ± 0.652
4.096GlyLeu: 4.096 ± 0.601
1.065GlyMet: 1.065 ± 0.312
4.096GlyAsn: 4.096 ± 0.571
0.0GlyPro: 0.0 ± 0.0
0.819GlyGln: 0.819 ± 0.245
1.966GlyArg: 1.966 ± 0.529
1.966GlySer: 1.966 ± 0.451
3.277GlyThr: 3.277 ± 0.489
3.604GlyVal: 3.604 ± 0.673
0.492GlyTrp: 0.492 ± 0.204
3.113GlyTyr: 3.113 ± 0.496
0.0GlyXaa: 0.0 ± 0.0
His
0.901HisAla: 0.901 ± 0.25
0.328HisCys: 0.328 ± 0.158
0.901HisAsp: 0.901 ± 0.26
0.901HisGlu: 0.901 ± 0.32
0.573HisPhe: 0.573 ± 0.238
0.492HisGly: 0.492 ± 0.201
0.246HisHis: 0.246 ± 0.12
1.311HisIle: 1.311 ± 0.353
0.655HisLys: 0.655 ± 0.268
0.901HisLeu: 0.901 ± 0.284
0.655HisMet: 0.655 ± 0.223
1.147HisAsn: 1.147 ± 0.294
0.492HisPro: 0.492 ± 0.22
0.164HisGln: 0.164 ± 0.111
0.328HisArg: 0.328 ± 0.189
0.983HisSer: 0.983 ± 0.235
0.655HisThr: 0.655 ± 0.246
1.966HisVal: 1.966 ± 0.386
0.082HisTrp: 0.082 ± 0.107
0.983HisTyr: 0.983 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
5.652IleAla: 5.652 ± 0.851
0.737IleCys: 0.737 ± 0.239
4.506IleAsp: 4.506 ± 0.512
6.963IleGlu: 6.963 ± 0.785
3.932IlePhe: 3.932 ± 0.584
4.014IleGly: 4.014 ± 0.573
1.802IleHis: 1.802 ± 0.359
9.994IleIle: 9.994 ± 1.044
8.028IleLys: 8.028 ± 0.942
6.717IleLeu: 6.717 ± 0.817
1.884IleMet: 1.884 ± 0.351
5.652IleAsn: 5.652 ± 0.783
4.751IlePro: 4.751 ± 0.576
3.359IleGln: 3.359 ± 0.527
3.277IleArg: 3.277 ± 0.655
5.571IleSer: 5.571 ± 0.609
5.243IleThr: 5.243 ± 0.616
6.39IleVal: 6.39 ± 0.781
0.41IleTrp: 0.41 ± 0.184
5.816IleTyr: 5.816 ± 0.717
0.0IleXaa: 0.0 ± 0.0
Lys
2.949LysAla: 2.949 ± 0.642
0.655LysCys: 0.655 ± 0.284
3.359LysAsp: 3.359 ± 0.624
7.127LysGlu: 7.127 ± 0.721
2.54LysPhe: 2.54 ± 0.54
3.113LysGly: 3.113 ± 0.508
1.72LysHis: 1.72 ± 0.315
7.619LysIle: 7.619 ± 0.969
7.782LysLys: 7.782 ± 0.957
7.7LysLeu: 7.7 ± 0.796
2.54LysMet: 2.54 ± 0.509
5.571LysAsn: 5.571 ± 0.648
1.884LysPro: 1.884 ± 0.328
2.785LysGln: 2.785 ± 0.524
2.785LysArg: 2.785 ± 0.458
3.604LysSer: 3.604 ± 0.599
4.014LysThr: 4.014 ± 0.528
4.588LysVal: 4.588 ± 0.596
0.492LysTrp: 0.492 ± 0.194
5.489LysTyr: 5.489 ± 0.78
0.0LysXaa: 0.0 ± 0.0
Leu
4.26LeuAla: 4.26 ± 0.602
1.393LeuCys: 1.393 ± 0.357
3.686LeuAsp: 3.686 ± 0.641
5.652LeuGlu: 5.652 ± 0.718
4.833LeuPhe: 4.833 ± 0.621
3.441LeuGly: 3.441 ± 0.633
0.983LeuHis: 0.983 ± 0.269
6.39LeuIle: 6.39 ± 0.764
7.455LeuLys: 7.455 ± 0.863
8.847LeuLeu: 8.847 ± 0.941
2.621LeuMet: 2.621 ± 0.477
5.243LeuAsn: 5.243 ± 0.682
3.85LeuPro: 3.85 ± 0.498
3.686LeuGln: 3.686 ± 0.62
4.342LeuArg: 4.342 ± 0.616
6.717LeuSer: 6.717 ± 0.929
6.144LeuThr: 6.144 ± 0.635
6.062LeuVal: 6.062 ± 0.635
1.065LeuTrp: 1.065 ± 0.243
4.26LeuTyr: 4.26 ± 0.748
0.0LeuXaa: 0.0 ± 0.0
Met
1.229MetAla: 1.229 ± 0.419
0.41MetCys: 0.41 ± 0.21
0.328MetAsp: 0.328 ± 0.148
2.13MetGlu: 2.13 ± 0.322
0.901MetPhe: 0.901 ± 0.247
1.311MetGly: 1.311 ± 0.346
0.246MetHis: 0.246 ± 0.168
2.376MetIle: 2.376 ± 0.413
1.966MetLys: 1.966 ± 0.343
1.802MetLeu: 1.802 ± 0.443
0.328MetMet: 0.328 ± 0.172
0.983MetAsn: 0.983 ± 0.264
1.065MetPro: 1.065 ± 0.286
0.819MetGln: 0.819 ± 0.272
1.638MetArg: 1.638 ± 0.519
2.54MetSer: 2.54 ± 0.455
1.311MetThr: 1.311 ± 0.353
2.621MetVal: 2.621 ± 0.547
0.164MetTrp: 0.164 ± 0.113
1.147MetTyr: 1.147 ± 0.34
0.0MetXaa: 0.0 ± 0.0
Asn
3.195AsnAla: 3.195 ± 0.382
0.492AsnCys: 0.492 ± 0.268
2.867AsnAsp: 2.867 ± 0.47
4.751AsnGlu: 4.751 ± 0.629
2.621AsnPhe: 2.621 ± 0.535
3.768AsnGly: 3.768 ± 0.493
0.819AsnHis: 0.819 ± 0.245
5.489AsnIle: 5.489 ± 0.626
4.26AsnLys: 4.26 ± 0.719
5.571AsnLeu: 5.571 ± 0.661
2.048AsnMet: 2.048 ± 0.404
3.85AsnAsn: 3.85 ± 0.637
1.802AsnPro: 1.802 ± 0.4
1.393AsnGln: 1.393 ± 0.303
1.475AsnArg: 1.475 ± 0.363
4.178AsnSer: 4.178 ± 0.53
3.768AsnThr: 3.768 ± 0.815
5.407AsnVal: 5.407 ± 0.574
0.492AsnTrp: 0.492 ± 0.206
3.523AsnTyr: 3.523 ± 0.488
0.0AsnXaa: 0.0 ± 0.0
Pro
1.475ProAla: 1.475 ± 0.353
0.655ProCys: 0.655 ± 0.216
1.393ProAsp: 1.393 ± 0.396
2.13ProGlu: 2.13 ± 0.393
1.475ProPhe: 1.475 ± 0.247
1.556ProGly: 1.556 ± 0.302
0.492ProHis: 0.492 ± 0.267
3.195ProIle: 3.195 ± 0.537
1.884ProLys: 1.884 ± 0.428
2.621ProLeu: 2.621 ± 0.4
0.492ProMet: 0.492 ± 0.244
2.294ProAsn: 2.294 ± 0.419
2.13ProPro: 2.13 ± 0.565
1.393ProGln: 1.393 ± 0.394
1.065ProArg: 1.065 ± 0.29
4.588ProSer: 4.588 ± 0.753
1.638ProThr: 1.638 ± 0.401
1.802ProVal: 1.802 ± 0.414
0.164ProTrp: 0.164 ± 0.108
1.638ProTyr: 1.638 ± 0.598
0.0ProXaa: 0.0 ± 0.0
Gln
1.393GlnAla: 1.393 ± 0.387
0.492GlnCys: 0.492 ± 0.173
1.229GlnAsp: 1.229 ± 0.356
1.638GlnGlu: 1.638 ± 0.41
1.556GlnPhe: 1.556 ± 0.337
2.458GlnGly: 2.458 ± 0.452
0.246GlnHis: 0.246 ± 0.187
3.031GlnIle: 3.031 ± 0.494
2.13GlnLys: 2.13 ± 0.362
4.014GlnLeu: 4.014 ± 0.699
0.655GlnMet: 0.655 ± 0.234
2.294GlnAsn: 2.294 ± 0.48
1.229GlnPro: 1.229 ± 0.43
1.802GlnGln: 1.802 ± 0.478
1.065GlnArg: 1.065 ± 0.333
1.966GlnSer: 1.966 ± 0.446
1.393GlnThr: 1.393 ± 0.44
1.638GlnVal: 1.638 ± 0.353
0.655GlnTrp: 0.655 ± 0.276
1.802GlnTyr: 1.802 ± 0.399
0.0GlnXaa: 0.0 ± 0.0
Arg
1.556ArgAla: 1.556 ± 0.375
0.246ArgCys: 0.246 ± 0.15
1.556ArgAsp: 1.556 ± 0.318
3.277ArgGlu: 3.277 ± 0.635
1.556ArgPhe: 1.556 ± 0.348
2.621ArgGly: 2.621 ± 0.587
0.819ArgHis: 0.819 ± 0.257
3.768ArgIle: 3.768 ± 0.536
2.949ArgLys: 2.949 ± 0.477
3.277ArgLeu: 3.277 ± 0.512
1.229ArgMet: 1.229 ± 0.345
1.966ArgAsn: 1.966 ± 0.38
0.819ArgPro: 0.819 ± 0.279
1.065ArgGln: 1.065 ± 0.258
1.72ArgArg: 1.72 ± 0.398
1.475ArgSer: 1.475 ± 0.32
1.147ArgThr: 1.147 ± 0.256
2.13ArgVal: 2.13 ± 0.402
0.164ArgTrp: 0.164 ± 0.109
1.966ArgTyr: 1.966 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
3.195SerAla: 3.195 ± 0.555
1.065SerCys: 1.065 ± 0.346
3.195SerAsp: 3.195 ± 0.614
4.26SerGlu: 4.26 ± 0.667
3.195SerPhe: 3.195 ± 0.579
3.932SerGly: 3.932 ± 0.741
0.983SerHis: 0.983 ± 0.265
5.243SerIle: 5.243 ± 0.613
3.523SerLys: 3.523 ± 0.531
5.652SerLeu: 5.652 ± 0.732
1.802SerMet: 1.802 ± 0.348
3.768SerAsn: 3.768 ± 0.627
2.54SerPro: 2.54 ± 0.474
2.294SerGln: 2.294 ± 0.427
1.475SerArg: 1.475 ± 0.296
5.98SerSer: 5.98 ± 0.917
4.751SerThr: 4.751 ± 0.844
5.571SerVal: 5.571 ± 0.548
0.492SerTrp: 0.492 ± 0.182
4.669SerTyr: 4.669 ± 0.628
0.0SerXaa: 0.0 ± 0.0
Thr
2.54ThrAla: 2.54 ± 0.569
0.164ThrCys: 0.164 ± 0.109
1.72ThrAsp: 1.72 ± 0.346
2.54ThrGlu: 2.54 ± 0.388
3.277ThrPhe: 3.277 ± 0.597
2.54ThrGly: 2.54 ± 0.547
1.147ThrHis: 1.147 ± 0.33
6.554ThrIle: 6.554 ± 0.739
4.178ThrLys: 4.178 ± 0.701
7.209ThrLeu: 7.209 ± 0.814
0.819ThrMet: 0.819 ± 0.276
3.523ThrAsn: 3.523 ± 0.643
2.458ThrPro: 2.458 ± 0.357
3.195ThrGln: 3.195 ± 0.86
1.966ThrArg: 1.966 ± 0.449
5.652ThrSer: 5.652 ± 0.888
3.768ThrThr: 3.768 ± 0.86
4.26ThrVal: 4.26 ± 0.617
0.328ThrTrp: 0.328 ± 0.155
4.014ThrTyr: 4.014 ± 0.552
0.0ThrXaa: 0.0 ± 0.0
Val
3.686ValAla: 3.686 ± 0.619
0.655ValCys: 0.655 ± 0.223
2.949ValAsp: 2.949 ± 0.484
4.178ValGlu: 4.178 ± 0.679
3.277ValPhe: 3.277 ± 0.495
2.867ValGly: 2.867 ± 0.549
0.819ValHis: 0.819 ± 0.251
6.144ValIle: 6.144 ± 0.62
6.799ValLys: 6.799 ± 0.76
5.898ValLeu: 5.898 ± 0.603
1.311ValMet: 1.311 ± 0.249
5.571ValAsn: 5.571 ± 0.738
3.031ValPro: 3.031 ± 0.451
2.048ValGln: 2.048 ± 0.413
2.703ValArg: 2.703 ± 0.5
5.898ValSer: 5.898 ± 0.741
6.062ValThr: 6.062 ± 0.903
4.342ValVal: 4.342 ± 0.745
0.492ValTrp: 0.492 ± 0.241
4.26ValTyr: 4.26 ± 0.619
0.0ValXaa: 0.0 ± 0.0
Trp
0.246TrpAla: 0.246 ± 0.145
0.328TrpCys: 0.328 ± 0.148
0.41TrpAsp: 0.41 ± 0.172
0.819TrpGlu: 0.819 ± 0.237
0.164TrpPhe: 0.164 ± 0.135
0.41TrpGly: 0.41 ± 0.175
0.164TrpHis: 0.164 ± 0.111
0.737TrpIle: 0.737 ± 0.219
0.41TrpLys: 0.41 ± 0.158
0.492TrpLeu: 0.492 ± 0.185
0.246TrpMet: 0.246 ± 0.174
0.655TrpAsn: 0.655 ± 0.187
0.0TrpPro: 0.0 ± 0.0
0.328TrpGln: 0.328 ± 0.169
0.573TrpArg: 0.573 ± 0.196
0.41TrpSer: 0.41 ± 0.184
0.41TrpThr: 0.41 ± 0.173
0.573TrpVal: 0.573 ± 0.212
0.328TrpTrp: 0.328 ± 0.181
0.655TrpTyr: 0.655 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.195TyrAla: 3.195 ± 0.539
0.819TyrCys: 0.819 ± 0.294
3.113TyrAsp: 3.113 ± 0.589
3.031TyrGlu: 3.031 ± 0.66
3.113TyrPhe: 3.113 ± 0.55
2.621TyrGly: 2.621 ± 0.507
0.737TyrHis: 0.737 ± 0.25
5.571TyrIle: 5.571 ± 0.858
3.359TyrLys: 3.359 ± 0.546
4.424TyrLeu: 4.424 ± 0.712
1.475TyrMet: 1.475 ± 0.34
3.359TyrAsn: 3.359 ± 0.457
2.54TyrPro: 2.54 ± 0.482
1.638TyrGln: 1.638 ± 0.303
1.72TyrArg: 1.72 ± 0.372
3.441TyrSer: 3.441 ± 0.569
5.243TyrThr: 5.243 ± 0.65
5.079TyrVal: 5.079 ± 0.755
0.492TyrTrp: 0.492 ± 0.17
2.376TyrTyr: 2.376 ± 0.451
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (12208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski