Skip to content
Permalink
Arunpravin/drm…
Switch branches/tags

Commits on Nov 16, 2021

  1. drm/amdgpu: add drm buddy support to amdgpu

    - Remove drm_mm references and replace with drm buddy functionalities
    - Add res cursor support for drm buddy
    
    v2(Matthew Auld):
      - replace spinlock with mutex as we call kmem_cache_zalloc(..., GFP_KERNEL)
        in drm_buddy_alloc() function
      - lock drm_buddy_block_trim() function as it calls mark_free/mark_split
        are all globally visible
    
    Signed-off-by: Arunpravin <Arunpravin.PaneerSelvam@amd.com>
    arunpravin24 authored and intel-lab-lkp committed Nov 16, 2021
  2. drm/amdgpu: move vram inline functions into a header

    Move shared vram inline functions and structs
    into a header file
    
    Signed-off-by: Arunpravin <Arunpravin.PaneerSelvam@amd.com>
    arunpravin24 authored and intel-lab-lkp committed Nov 16, 2021
  3. drm: implement a method to free unused pages

    On contiguous allocation, we round up the size
    to the *next* power of 2, implement a function
    to free the unused pages after the newly allocate block.
    
    v2(Matthew Auld):
      - replace function name 'drm_buddy_free_unused_pages' with
        drm_buddy_block_trim
      - replace input argument name 'actual_size' with 'new_size'
      - add more validation checks for input arguments
      - add overlaps check to avoid needless searching and splitting
      - merged the below patch to see the feature in action
        - add free unused pages support to i915 driver
      - lock drm_buddy_block_trim() function as it calls mark_free/mark_split
        are all globally visible
    
    Signed-off-by: Arunpravin <Arunpravin.PaneerSelvam@amd.com>
    arunpravin24 authored and intel-lab-lkp committed Nov 16, 2021
  4. drm: implement top-down allocation method

    Implemented a function which walk through the order list,
    compares the offset and returns the maximum offset block,
    this method is unpredictable in obtaining the high range
    address blocks which depends on allocation and deallocation.
    for instance, if driver requests address at a low specific
    range, allocator traverses from the root block and splits
    the larger blocks until it reaches the specific block and
    in the process of splitting, lower orders in the freelist
    are occupied with low range address blocks and for the
    subsequent TOPDOWN memory request we may return the low
    range blocks.To overcome this issue, we may go with the
    below approach.
    
    The other approach, sorting each order list entries in
    ascending order and compares the last entry of each
    order list in the freelist and return the max block.
    This creates sorting overhead on every drm_buddy_free()
    request and split up of larger blocks for a single page
    request.
    
    v2:
      - Fix alignment issues(Matthew Auld)
      - Remove unnecessary list_empty check(Matthew Auld)
      - merged the below patch to see the feature in action
        - add top-down alloc support to i915 driver
    
    Signed-off-by: Arunpravin <Arunpravin.PaneerSelvam@amd.com>
    arunpravin24 authored and intel-lab-lkp committed Nov 16, 2021
  5. drm: improve drm_buddy_alloc function

    - Make drm_buddy_alloc a single function to handle
      range allocation and non-range allocation demands
    
    - Implemented a new function alloc_range() which allocates
      the requested power-of-two block comply with range limitations
    
    - Moved order computation and memory alignment logic from
      i915 driver to drm buddy
    
    v2:
      merged below changes to keep the build unbroken
       - drm_buddy_alloc_range() becomes obsolete and may be removed
       - enable ttm range allocation (fpfn / lpfn) support in i915 driver
       - apply enhanced drm_buddy_alloc() function to i915 driver
    
    v3(Matthew Auld):
      - Fix alignment issues and remove unnecessary list_empty check
      - add more validation checks for input arguments
      - make alloc_range() block allocations as bottom-up
      - optimize order computation logic
      - replace uint64_t with u64, which is preferred in the kernel
    
    Signed-off-by: Arunpravin <Arunpravin.PaneerSelvam@amd.com>
    arunpravin24 authored and intel-lab-lkp committed Nov 16, 2021
  6. drm: move the buddy allocator from i915 into common drm

    Move the base i915 buddy allocator code into drm
    - Move i915_buddy.h to include/drm
    - Move i915_buddy.c to drm root folder
    - Rename "i915" string with "drm" string wherever applicable
    - Rename "I915" string with "DRM" string wherever applicable
    - Fix header file dependencies
    - Fix alignment issues
    - add Makefile support for drm buddy
    - export functions and write kerneldoc description
    - Remove i915 selftest config check condition as buddy selftest
      will be moved to drm selftest folder
    
    cleanup i915 buddy references in i915 driver module
    and replace with drm buddy
    
    v2:
      - include header file in alphabetical order (Thomas)
      - merged changes listed in the body section into a single patch
        to keep the build intact (Christian, Jani)
    
    Signed-off-by: Arunpravin <Arunpravin.PaneerSelvam@amd.com>
    arunpravin24 authored and intel-lab-lkp committed Nov 16, 2021

Commits on Nov 12, 2021

  1. Merge tag 'drm-misc-fixes-2021-11-11' of git://anongit.freedesktop.or…

    …g/drm/drm-misc into drm-next
    
     * dma-buf: name_lock fixes
     * prime: Keep object ref during mmap
     * nouveau: Fix a refcount issue; Fix device removal; Protect client
       list with dedicated mutex; Fix address CE0 address calculation
     * ttm: Fix race condition during BO eviction
    
    Signed-off-by: Dave Airlie <airlied@redhat.com>
    
    From: Thomas Zimmermann <tzimmermann@suse.de>
    Link: https://patchwork.freedesktop.org/patch/msgid/YYzY6jeox9EeI15i@linux-uq9g.fritz.box
    airlied committed Nov 12, 2021

Commits on Nov 11, 2021

  1. BackMerge tag 'v5.15' into drm-next

    I got a drm-fixes which had some 5.15 stuff in it, so to avoid
    the mess just backmerge here.
    
    Linux 5.15
    
    Signed-off-by: Dave Airlie <airlied@redhat.com>
    airlied committed Nov 11, 2021
  2. drm/ttm: Double check mem_type of BO while eviction

    BO might sit in a wrong lru list as there is a small period of memory
    moving and lru list updating.
    
    Lets skip eviction if we hit such mismatch.
    
    Suggested-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: xinhui pan <xinhui.pan@amd.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211110043149.57554-2-xinhui.pan@amd.com
    Signed-off-by: Christian König <christian.koenig@amd.com>
    xinhui pan authored and ChristianKoenigAMD committed Nov 11, 2021
  3. Merge tag 'amd-drm-fixes-5.16-2021-11-10' of https://gitlab.freedeskt…

    …op.org/agd5f/linux into drm-next
    
    amd-drm-fixes-5.16-2021-11-10:
    
    amdgpu:
    - Don't allow partial copy from user for DC debugfs
    - SRIOV fixes
    - GFX9 CSB pin count fix
    - Various IP version check fixes
    - DP 2.0 fixes
    - Limit DCN1 MPO fix to DCN1
    
    amdkfd:
    - SVM fixes
    - Reset fixes
    
    Signed-off-by: Dave Airlie <airlied@redhat.com>
    From: Alex Deucher <alexander.deucher@amd.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211110222536.7527-1-alexander.deucher@amd.com
    airlied committed Nov 11, 2021

Commits on Nov 10, 2021

  1. Merge tag 'drm-misc-next-fixes-2021-11-10' of git://anongit.freedeskt…

    …op.org/drm/drm-misc into drm-next
    
    Removed the TTM Huge Page functionnality to address a crash, a timeout
    fix for udl, CONFIG_FB dependency improvements, a fix for a circular
    locking depency in imx, a NULL pointer dereference fix for virtio, and a
    naming collision fix for drm/locking.
    
    Signed-off-by: Dave Airlie <airlied@redhat.com>
    
    From: Maxime Ripard <maxime@cerno.tech>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211110082114.vfpkpnecwdfg27lk@gilmour
    airlied committed Nov 10, 2021
  2. drm/amdgpu: add missed support for UVD IP_VERSION(3, 0, 64)

    Fixes: 96b8dd4 ("drm/amdgpu/amdgpu_vcn: convert to IP version checking")
    Signed-off-by: Flora Cui <flora.cui@amd.com>
    Signed-off-by: Guchun Chen <guchun.chen@amd.com>
    Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Guchun Chen authored and Alex Deucher committed Nov 10, 2021
  3. drm/amdgpu: drop jpeg IP initialization in SRIOV case

    Fixes: b05b9c5 ("drm/amdgpu: clean up set IP function")
    Signed-off-by: Guchun Chen <guchun.chen@amd.com>
    Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Guchun Chen authored and Alex Deucher committed Nov 10, 2021
  4. drm/amd/display: reject both non-zero src_x and src_y only for DCN1x

    [Why]
    Video plane gets rejected for non-zero src_y and src_x on DCN2.x.
    
    [How]
    Limit the rejection till DCN1.x and verified MPO, by dragging video
    playback beyond display's left (0, 0) co-ordinates.
    
    Fixes: d89f604 ("drm/amd/display: Reject non-zero src_y and src_x for video planes")
    Signed-off-by: Shirish S <shirish.s@amd.com>
    Reviewed-by: Harry Wentland <harry.wentland@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Shirish S authored and Alex Deucher committed Nov 10, 2021
  5. Merge tag 'drm-intel-next-fixes-2021-11-09' of git://anongit.freedesk…

    …top.org/drm/drm-intel into drm-next
    
    Couple Reverts, build fix, couple virtualization fixes,
    blank screen and other display rates fixes, and more.
    
    Four patches targeting stable in here.
    
    Display Fixes:
    - DP rates related fixes (Imre, Jani)
    - A Revert on disaling dual eDP that was causing state readout problems (Jani)
    - put the cdclk vtables in const data (Jani)
    - Fix DVO port type for moder platforms (Ville)
    - Fix blankscreen by turning DP++ TMDS output buffers on encoder->shutdown (Ville)
    - CCS FBs related fixes (Imre)
    
    GT fixes:
    - Fix recursive lock in GuC submission (Matt Brost)
    - Revert guc_id from i915_request tracepoint (Joonas)
    - Build fix around dmabuf (Matt Auld)
    
    Signed-off-by: Dave Airlie <airlied@redhat.com>
    
    From: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/YYsBif3HMi8GjLoU@intel.com
    airlied committed Nov 10, 2021
  6. Merge tag 'drm-misc-next-fixes-2021-11-05' of git://anongit.freedeskt…

    …op.org/drm/drm-misc into drm-next
    
    A refcounting fix for outstanding fence callbacks.
    
    Signed-off-by: Dave Airlie <airlied@redhat.com>
    
    From: Maxime Ripard <maxime@cerno.tech>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211105074318.oy6rwjr5wcw6qpjj@gilmour
    airlied committed Nov 10, 2021

Commits on Nov 9, 2021

  1. drm/amd/display: Add callbacks for DMUB HPD IRQ notifications

    [Why]
    We need HPD IRQ notifications (RX, short pulse) to properly handle
    DP MST for DPIA connections.
    
    [How]
    A null pointer exception currently occurs when these are received
    so add a check to validate that we have a handler installed for
    the notification.
    
    Extend the HPD handler to also handle HPD IRQ (RX) since the logic is
    the same.
    
    Fixes: e27c41d ("drm/amd/display: Support for DMUB HPD interrupt handling")
    
    Reviewed-by: Wayne Lin <Wayne.Lin@amd.com>
    Reviewed-by: Jude Shih <shenshih@amd.com>
    Acked-by: Anson Jacob <Anson.Jacob@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Nicholas Kazlauskas authored and Alex Deucher committed Nov 9, 2021
  2. drm/amd/display: Don't lock connection_mutex for DMUB HPD

    [Why]
    Per DRM spec we only need to hold that lock when touching
    connector->state - which we do not do in that handler.
    
    Taking this locking introduces unnecessary dependencies with other
    threads which is bad for performance and opens up the potential for
    a deadlock since there are multiple locks being held at once.
    
    [How]
    Remove the connection_mutex lock/unlock routine and just iterate over
    the drm connectors normally. The iter helpers implicitly lock the
    connection list so this is safe to do.
    
    DC link access also does not need to be guarded since the link
    table is static at creation - we don't dynamically add or remove links,
    just streams.
    
    Fixes: e27c41d ("drm/amd/display: Support for DMUB HPD interrupt handling")
    
    Reviewed-by: Jude Shih <shenshih@amd.com>
    Acked-by: Anson Jacob <Anson.Jacob@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Nicholas Kazlauskas authored and Alex Deucher committed Nov 9, 2021
  3. drm/amd/display: Add comment where CONFIG_DRM_AMD_DC_DCN macro ends

    Trivial patch which adds a comment for macro
    endif's in amdgpu_dm.c
    
    Reviewed-by: Ariel Bernstein <Eric.Bernstein@amd.com>
    Reviewed-by: Harry Wentland <Harry.Wentland@amd.com>
    Acked-by: Anson Jacob <Anson.Jacob@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Anson Jacob <Anson.Jacob@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Anson Jacob authored and Alex Deucher committed Nov 9, 2021
  4. drm/amdkfd: Fix retry fault drain race conditions

    The check for whether to drain retry faults must be under the mmap write
    lock to serialize with munmap notifier callbacks.
    
    We were also missing checks on child ranges. To fix that, simplify the
    logic by using a flag rather than checking on each prange. That also
    allows draining less freqeuntly when many ranges are unmapped at once.
    
    Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Tested-by: Philip Yang <Philip.Yang@amd.com>
    Tested-by: Alex Sierra <Alex.Sierra@amd.com>
    Reviewed-by: Philip Yang <Philip.Yang@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    fxkamd authored and Alex Deucher committed Nov 9, 2021
  5. drm/amdkfd: lower the VAs base offset to 8KB

    The low 16MB of virtual address space are currently reserved for kernel
    mode allocations mapped into user virtual address space. This causes
    conflicts with HMM/SVM mappings at low virtual addresses. We tried to
    move those kernel mode allocations to the upper half of the 64-bit
    virtual address space for GFX9, which is naturally reserved for kernel
    use. However, TBA (trap handler code) has problems to access addresses
    in the high virtual space. We have decided to set this to 8KB of the
    lower address space as a temporary fix, while investigate TBA address
    problem. It is very unlikely for user space to map memory at this low
    region.
    
    Signed-off-by: Alex Sierra <alex.sierra@amd.com>
    Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    alexsierrag authored and Alex Deucher committed Nov 9, 2021
  6. drm/amd/display: fix exit from amdgpu_dm_atomic_check() abruptly

    make action upon failure in "drm_atomic_add_affected_connectors()"
    consistent with the rest of failures in amdgpu_dm_atomic_check().
    
    Signed-off-by: Shirish S <shirish.s@amd.com>
    Reviewed-by: Harry Wentland <harry.wentland@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Shirish S authored and Alex Deucher committed Nov 9, 2021
  7. drm/amd/amdgpu: fix the kfd pre_reset sequence in sriov

    The KFD pre_reset should be called before reset been executed, it will
    hold the lock to prevent other rocm process to sent the packlage to hiq
    during host execute the real reset on the HW
    
    Signed-off-by: shaoyunl <shaoyun.liu@amd.com>
    Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    shaoyunl authored and Alex Deucher committed Nov 9, 2021
  8. drm/amdgpu: fix uvd crash on Polaris12 during driver unloading

    There was a change(below) target for such issue:
    d82e2c2 ("drm/amdgpu: Fix crash on device remove/driver unload")
    But the fix for VI ASICs was missing there. This is a supplement for
    that.
    
    Fixes: d82e2c2 ("drm/amdgpu: Fix crash on device remove/driver unload")
    
    Signed-off-by: Evan Quan <evan.quan@amd.com>
    Acked-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Evan Quan authored and Alex Deucher committed Nov 9, 2021
  9. drm/i915/adlp/fb: Prevent the mapping of redundant trailing padding N…

    …ULL pages
    
    So far the remapped view size in GTT/DPT was padded to the next aligned
    offset unnecessarily after the last color plane with an unaligned size.
    Remove the unnecessary padding.
    
    Cc: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>
    Fixes: 3d1adc3 ("drm/i915/adlp: Add support for remapping CCS FBs")
    Signed-off-by: Imre Deak <imre.deak@intel.com>
    Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211026225105.2783797-3-imre.deak@intel.com
    (cherry picked from commit 6b6636e)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    ideak authored and rodrigovivi committed Nov 9, 2021
  10. drm/i915/fb: Fix rounding error in subsampled plane size calculation

    For NV12 FBs with odd main surface tile-row height the CCS surface
    height was incorrectly calculated 1 less than the actual value. Fix this
    by rounding up the result of divison. For consistency do the same for
    the CCS surface width calculation.
    
    Fixes: b3e57bc ("drm/i915/tgl: Gen-12 render decompression")
    Signed-off-by: Imre Deak <imre.deak@intel.com>
    Reviewed-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211026225105.2783797-2-imre.deak@intel.com
    (cherry picked from commit 2ee5ef9)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    ideak authored and rodrigovivi committed Nov 9, 2021
  11. drm/i915/hdmi: Turn DP++ TMDS output buffers back on in encoder->shut…

    …down()
    
    Looks like our VBIOS/GOP generally fail to turn the DP dual mode adater
    TMDS output buffers back on after a reboot. This leads to a black screen
    after reboot if we turned the TMDS output buffers off prior to reboot.
    And if i915 decides to do a fastboot the black screen will persist even
    after i915 takes over.
    
    Apparently this has been a problem ever since commit b2ccb82 ("drm/i915:
    Enable/disable TMDS output buffers in DP++ adaptor as needed") if one
    rebooted while the display was turned off. And things became worse with
    commit fe0f1e3 ("drm/i915: Shut down displays gracefully on reboot")
    since now we always turn the display off before a reboot.
    
    This was reported on a RKL, but I confirmed the same behaviour on my
    SNB as well. So looks pretty universal.
    
    Let's fix this by explicitly turning the TMDS output buffers back on
    in the encoder->shutdown() hook. Note that this gets called after irqs
    have been disabled, so the i2c communication with the DP dual mode
    adapter has to be performed via polling (which the gmbus code is
    perfectly happy to do for us).
    
    We also need a bit of care in handling DDI encoders which may or may
    not be set up for HDMI output. Specifically ddc_pin will not be
    populated for a DP only DDI encoder, in which case we don't want to
    call intel_gmbus_get_adapter(). We can handle that by simply doing
    the dual mode adapter type check before calling
    intel_gmbus_get_adapter().
    
    Cc: <stable@vger.kernel.org> # v5.11+
    Fixes: fe0f1e3 ("drm/i915: Shut down displays gracefully on reboot")
    Closes: https://gitlab.freedesktop.org/drm/intel/-/issues/4371
    Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211029191802.18448-2-ville.syrjala@linux.intel.com
    Reviewed-by: Stanislav Lisovskiy <stanislav.lisovskiy@intel.com>
    (cherry picked from commit 49c55f7)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    vsyrjala authored and rodrigovivi committed Nov 9, 2021
  12. drm/locking: fix __stack_depot_* name conflict

    Commit cd06ab2 ("drm/locking: add backtrace for locking contended
    locks without backoff") added functions named __stack_depot_* in drm
    which conflict with stack depot. Rename to __drm_stack_depot_*.
    
    v2 by Jani:
    - Also rename __stack_depot_print
    
    References: https://lore.kernel.org/r/20211015202648.258445ef@canb.auug.org.au
    Fixes: cd06ab2 ("drm/locking: add backtrace for locking contended locks without backoff")
    Cc: Daniel Vetter <daniel@ffwll.ch>
    Reviewed-by: Daniel Vetter <daniel@ffwll.ch>
    Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>
    Signed-off-by: Jani Nikula <jani.nikula@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20211018085113.27033-1-jani.nikula@intel.com
    (cherry picked from commit c4f08d7)
    sfrothwell authored and jnikula committed Nov 9, 2021

Commits on Nov 8, 2021

  1. drm/virtio: Fix NULL dereference error in virtio_gpu_poll

    When virgl is not enabled, vfpriv pointer would not be allocated.
    Therefore, check for a valid value before dereferencing.
    
    Reported-by: Christian Zigotzky <chzigotzky@xenosoft.de>
    Cc: Gurchetan Singh <gurchetansingh@chromium.org>
    Cc: Gerd Hoffmann <kraxel@redhat.com>
    Signed-off-by: Vivek Kasireddy <vivek.kasireddy@intel.com>
    Tested-by: Christian Zigotzky <chzigotzky@xenosoft.de>
    Link: http://patchwork.freedesktop.org/patch/msgid/20211104214249.1802789-1-vivek.kasireddy@intel.com
    Signed-off-by: Gerd Hoffmann <kraxel@redhat.com>
    vivekkreddy authored and kraxel committed Nov 8, 2021

Commits on Nov 5, 2021

  1. drm/amdgpu: fix SI handling in amdgpu_device_asic_has_dc_support()

    Properly handle SI DC support when CONFIG_DRM_AMD_DC_SI is not
    set.
    
    Fixes: f7f12b2 ("drm/amdgpu: default to true in amdgpu_device_asic_has_dc_support")
    Reviewed-by: Evan Quan <evan.quan@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Alex Deucher committed Nov 5, 2021
  2. drm/amdgpu: Fix dangling kfd_bo pointer for shared BOs

    If a kfd_bo was shared (e.g. a dmabuf export), the original kfd_bo may be
    freed when the amdgpu_bo still lives on. Free the kfd_bo struct in the
    release_notify callback then the amdgpu_bo is freed.
    
    Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Reviewed-By: Ramesh Errabolu <Ramesh.Errabolu@amd.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    fxkamd authored and Alex Deucher committed Nov 5, 2021
  3. drm/amd/amdkfd: Don't sent command to HWS on kfd reset

    When kfd need to be reset, sent command to HWS might cause hang and get unnecessary timeout.
    This change try not to touch HW in pre_reset and keep queues to be in the evicted state
    when the reset is done, so they are not put back on the runlist. These queues will be destroied
    on process termination.
    
    Signed-off-by: shaoyunl <shaoyun.liu@amd.com>
    Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    shaoyunl authored and Alex Deucher committed Nov 5, 2021
  4. drm/amdgpu: correctly toggle gfx on/off around RLC_SPM_* register access

    As part of the ib padding process, accessing the RLC_SPM_* register may
    trigger gfx hang. Since gfxoff may be already kicked during the whole period.
    To address that, we manually toggle gfx on/off around the RLC_SPM_*
    register access.
    
    This can resolve the gfx hang issue observed on running Talos with RDP launched
    in parallel.
    
    Signed-off-by: Evan Quan <evan.quan@amd.com>
    Acked-by: Guchun Chen <guchun.chen@amd.com>
    Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Evan Quan authored and Alex Deucher committed Nov 5, 2021
  5. drm/amdgpu: correct xgmi ras error count reset

    The error count reset for xgmi3x16 pcs is missed.
    
    Signed-off-by: Tao Zhou <tao.zhou1@amd.com>
    Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Tao Zhou authored and Alex Deucher committed Nov 5, 2021
  6. drm/amd/pm: Correct DPMS disable IP version check

    Previously there was a check based on chip # for chips that aligned to
    >=CHIP_NAVI10 to have RLC stopped as part of DPMS check.  This was because
    of gfxclk being controlled by RLC in the newer designs.
    
    As part of IP version checking though, this got changed to match IP
    version for SMU.  Because Renoir designs also include smu11 that meant
    that even GFX9 started to stop RLC earlier.
    
    Adjust to match GFX IP version instead of SMU IP version to restore the
    previous behavior.
    
    Fixes: a896796 ("drm/amdgpu/amdgpu_smu: convert to IP version checking")
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    superm1 authored and Alex Deucher committed Nov 5, 2021
Older